Tag: optimization
- Can Mean Square Error cause underfitting
- Is it possible to learn with batch size = 1
- Estimate MLE of discrete distribution with two parameters in R
- Why does stochastic gradient descent lead us to a minimum at all
- Why is a 2nd order derivative optimization better for no hidden layer neural networks
- Delta method for estimating a ratio involving variance and mean
- How to find the optimal values for $\beta$ and $\beta_0$ for sparse linear regression model? Where does the mean of $\lambda$ come into account?
- Pre-processing of features with cross-validation
- Stochastic Gradient Descent Code Check for Least Squares
- Use dev set to tune hyperparameters