






Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
CS 7643 Quiz 2 | Actual Questions and Answers Latest Updated 2025/2026 (Graded A+) Georgia Institute of Technology
Typology: Exams
1 / 10
This page cannot be seen from the preview
Don't miss anything!







1. Which of the following are common issues while optimizing the weights of a deep neural network? (Select all that apply) A. Existence of local minima B. Ill-conditioned loss surface C. Noisy gradient estimates D. Saddle points **Correct Answer: B, C, D
A. ReLU activations B. Tanh and Sigmoid activations C. Linear transformations D. Max pooling layers Correct Answer: B
4. Which of the following best describes batch normalization? A. Adds noise to gradients during backpropagation B. Normalizes inputs within a mini-batch to stabilize training C. Increases the learning rate dynamically D. Removes neurons to prevent overfitting **Correct Answer: B
A. Ensure weights are all positive B. Keep the variance of activations consistent across layers C. Reduce training time by skipping normalization D. Initialize all weights at zero Correct Answer: B
10. Why are residual connections (ResNets) effective in deep architectures? A. They prevent underfitting B. They allow gradients to flow directly, mitigating vanishing gradients C. They reduce the number of parameters D. They remove the need for backpropagation **Correct Answer: B
12. In dropout regularization, neurons are: A. Permanently removed from the network B. Randomly deactivated during training to prevent co- adaptation C. Replaced with noise during forward propagation D. Normalized across mini-batches **Correct Answer: B
18. Which of the following problems do ReLU activations help mitigate? A. Vanishing gradient B. Exploding gradient C. Overfitting D. Saddle points **Correct Answer: A
21. Which initialization is typically best for ReLU activations? A. Xavier (Glorot) B. He initialization C. Zero initialization D. Random small constants **Correct Answer: B