








Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
CS-7643 QUIZ 4 DEEP LEARNING OPTIMIZATION REGULARIZATION FINAL STUDY GUIDE 2026 SOLVED QUESTIONS FULLY CORRECT
Typology: Exams
1 / 14
This page cannot be seen from the preview
Don't miss anything!









⫸ RL: Evaluative Feedback. Answer: - Pick an action, receive a reward
⫸ MDP. Answer: Framework underlying RL S: Set of states A: Set of actions R: Distribution of Rewards T: Transition probabiliity y: Discount property Markov Property: Current state completely characterizes state of the environment ⫸ RL: Equations relating optimal quantities. Answer: 1. V(S) = max_a(Q(s, a)
⫸ Pseudo-labeling for Unlabeled Data (Semi-Supervised Learning). Answer: - Learn a model on labeled training data
⫸ Multi-View Pseudo-Labeling Key Details for Success. Answer: - Pseudo-labeling without augmentation isn't very effective --> need good data augmentation algos