27. Backpropagation: Find Partial Derivatives

MIT OpenCourseWare

4.92 million Subscribers

53,032 views since Nov 26, 2023

MIT 18.065 Matrix Methods in Data Analysis, Signal Processing, and Machine Learning, Spring 2018
Instructor: Gilbert Strang
View the complete course: https://ocw.mit.edu/18-065S18
YouTube Playlist:    • MIT 18.065 Matrix Methods in Data Ana...  

In this lecture, Professor Strang presents Professor Sra's theorem which proves the convergence of stochastic gradient descent (SGD). He then reviews backpropagation, a method to compute derivatives quickly, using the chain rule.

Note: Videos of Lectures 28 and 29 are not available because those were in-class lab sessions that were not recorded.

License: Creative Commons BY-NC-SA
More information at https://ocw.mit.edu/terms
More courses at https://ocw.mit.edu