The quadratic sandwich

fedemagnani.github.io

91 points by cpp_frog 3 days ago


laGrenouille - 4 hours ago

Great visualizations. Really enjoyed having a well-written example where mathematical proofs directly help with understanding a practical application.

I wonder what would happen with this analysis if a momentum term was added to the gradient descent. It seems that it would fix the specific failure modes in the examples, but I wonder if there's a corresponding mathematical way of categorizing what kinds of functions can(not) be quickly optimized with GD + momentum.

xuzhenpeng - 6 hours ago

The animation is very good, making the article easy to understand

CarVac - 2 hours ago

Simplex methods can handle those tough situations, though.

xzp12138 - 5 hours ago

[flagged]