Questions and Answers
What do (z_1)² and (z_2)² represent in the context of the neural network?
The sums of the multiplication between every input x_i with the corresponding weight (W_ij)¹ for layer 2
Which part of the neural network produces the predicted value?
Output layer
What is the purpose of forward propagation in a neural network?
To evaluate the predicted output s against an expected output y
Which popular NLP model is not specifically mentioned in the text?
Signup and view all the answers
What use case of transformers is NOT mentioned in the text?
Signup and view all the answers
In what context did Denis Rothman deliver AI solutions for Moët et Chandon and Airbus?
Signup and view all the answers
Что является первой главной компонентой в методе главных компонент (PCA)?
Signup and view all the answers
Как можно определить i-ю главную компоненту в методе главных компонент (PCA)?
Signup and view all the answers
Чем PCA тесно связано с анализом факторов?
Signup and view all the answers
The relative error is a more appropriate metric than the absolute difference when comparing numerical and analytic gradients.
Signup and view all the answers
It is recommended to track the difference between the numerical and analytic gradients directly to determine their compatibility.
Signup and view all the answers
Using double precision floating-point arithmetic can reduce relative errors in gradient checking.
Signup and view all the answers
The deeper the neural network, the lower the relative errors are expected to be during gradient checking.
Signup and view all the answers
Normalizing the loss function over the batch can reduce relative errors in gradient computations.
Signup and view all the answers