Given that you can compute both the objective function's gradient and directional derivative, what would be the approach to take in order to overcome the step size issue?
MathComputational mathOptimization
Pitfalls of gradient descent
Good steps
Report a typo
Select one option from the list
___
By continuing, you agree to the JetBrains Academy Terms of Service as well as Hyperskill Terms of Service and Privacy Policy.
Create a free account to access the full topic
By continuing, you agree to the JetBrains Academy Terms of Service as well as Hyperskill Terms of Service and Privacy Policy.