
Explanation:
Option A is correct because the weight update in neural networks follows the gradient descent formula:
Where:
Calculation:
Option B (0.195) would result from adding the gradient instead of subtracting it.
Option C (0.225) would result from incorrectly multiplying the learning rate by the weight instead of the gradient.
Ultimate access to all questions.
No comments yet.