Sitemap
Analytics Vidhya

Analytics Vidhya is a community of Generative AI and Data Science professionals. We are building the next-gen data science ecosystem https://www.analyticsvidhya.com

Press enter or click to view image in full size

Optimization Algorithms for Deep Learning

8 min readJul 23, 2019

--

Press enter or click to view image in full size
Given above, is the basic strategy when we use gradient descent as our optimization algorithm.
Press enter or click to view image in full size
Given above, is the basic strategy when we use mini-batch gradient descent as our optimization algorithm.
Press enter or click to view image in full size
Press enter or click to view image in full size

It is worthy to note that we can use the Mini Batch approach along with Moment optimizer as only the Parameter Updation methodology changes.

Press enter or click to view image in full size
Given above, is the basic strategy when we use Momentum as our optimization algorithm.
Press enter or click to view image in full size

It is worthy to note that here we have considered Y-axis as bias b and X-axis as weight W for better understanding and to visualize how the parameters are updated. We can damp out any such oscillation either caused by any bias b(b1, b2, …, bn) or weight W(W1, W2, …, Wn) or both in a similar manner. And again, we can still use the Mini Batch approach along with RMS optimizer as only the Parameter Updation methodology changes.

Press enter or click to view image in full size
Given above, is the basic strategy when we use RMS prop as our optimization algorithm.

It is worthy to note that we use 2 different values of β during the calculations. β1 is used for calculations relevant to Momentum whereas β2 is used for calculations relevant to RMS prop.And again, we can still use the Mini Batch approach along with AdaM optimizer as only the Parameter Updation methodology changes.

Press enter or click to view image in full size
Given above, is the basic strategy when we use AdaM as our optimization algorithm.
Press enter or click to view image in full size
Press enter or click to view image in full size
Contours of a loss surface and time evolution of different optimization algorithms

--

--

Analytics Vidhya
Analytics Vidhya

Published in Analytics Vidhya

Analytics Vidhya is a community of Generative AI and Data Science professionals. We are building the next-gen data science ecosystem https://www.analyticsvidhya.com