The answer is largely the same whether we consider ℓ1 or ℓ2 regularisation, so I will just speak generally about regularisation.
Mean square error for training data
Given some training data {(xi,yi)}ni=1, a linear regression line Y=aX+b fit using the least squares method looks for coefficients that minimise the sum of squares, i.e. they are the minimisers given by
argmina,b∑i=1n(yi−(axi+b))2.
This gives the same coefficients as minimising the mean square error
MSE((x1,y1),…,(xn,yn))=1n∑i=1n(yi−(axi+b))2.
So, by definition, the coefficients (a,b) minimise the MSE on the training data. Any regularisation will only increase the MSE on the training data.
Generalisation performance
The main point of regularisation is to prevent overfitting on the data and improve the generalisation performance (i.e. on the test set).
With an appropriate parameter for regularisation, you may obtain a smaller MSE on the test set. This depends on your dataset and the parameters you choose: strong regularisation may lead to underfitting, whereas weak regularisation might not make much difference to the coefficients that you fit.