Skip to content

Commit c8f8a90

Browse files
Copilotmmcky
andauthored
Fix broken links in back_prop.md - URL encoding and dead domain (#602)
* Initial plan * Fix broken links in back_prop.md - URL encoding and dead domain Co-authored-by: mmcky <[email protected]> --------- Co-authored-by: copilot-swe-agent[bot] <[email protected]> Co-authored-by: mmcky <[email protected]>
1 parent 6f78812 commit c8f8a90

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

lectures/back_prop.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -201,7 +201,7 @@ $$ (eq:sgd)
201201
202202
where $\frac{d {\mathcal L}}{dx_{N+1}}=-\left(x_{N+1}-y\right)$ and $\alpha > 0 $ is a step size.
203203
204-
(See [this](https://en.wikipedia.org/wiki/Gradient_descent#Description) and [this](https://en.wikipedia.org/wiki/Newton%27s_method) to gather insights about how stochastic gradient descent
204+
(See [this](https://en.wikipedia.org/wiki/Gradient_descent#Description) and [this](https://en.wikipedia.org/wiki/Newton's_method) to gather insights about how stochastic gradient descent
205205
relates to Newton's method.)
206206
207207
To implement one step of this parameter update rule, we want the vector of derivatives $\frac{dx_{N+1}}{dp_k}$.
@@ -540,7 +540,7 @@ Image(fig.to_image(format="png"))
540540
It is fun to think about how deepening the neural net for the above example affects the quality of approximation
541541
542542
543-
* If the network is too deep, you'll run into the [vanishing gradient problem](https://neuralnetworksanddeeplearning.com/chap5.html)
543+
* If the network is too deep, you'll run into the [vanishing gradient problem](https://en.wikipedia.org/wiki/Vanishing_gradient_problem)
544544
* Other parameters such as the step size and the number of epochs can be as important or more important than the number of layers in the situation considered in this lecture.
545545
* Indeed, since $f$ is a linear function of $x$, a one-layer network with the identity map as an activation would probably work best.
546546

0 commit comments

Comments
 (0)