Deep Learning - Architectures & Methods, Prüfungen von Deep Learning

Deep Learning Klausur Midterm Exam 2017 2018

Art: Prüfungen

2017/2018

Hochgeladen am 23.08.2021

domi877
domi877 🇩🇪

1 dokument

1 / 6

Toggle sidebar

Diese Seite wird in der Vorschau nicht angezeigt

Lass dir nichts Wichtiges entgehen!

bg1
CS 231N
Convolutional Neural Networks for Visual Recognition
Spring 2017 Sample Midterm Exam
May 9, 2017
Full Name:
Question Score
Multiple Choice (20 pts)
True/False (20 pts)
Short Answer (60 pts)
Total (100 pts)
Welcome to the CS231N Midterm Exam!
The exam is 1 hour 15 minutes.
No notes or electronic devices are allowed.
I understand and agree to uphold the Stanford Honor Code during this exam.
Signature: Date:
Good luck!
1
pf3
pf4
pf5

Unvollständige Textvorschau

Nur auf Docsity: Lade Deep Learning - Architectures & Methods und mehr Prüfungen als PDF für Deep Learning herunter!

CS 231N

Convolutional Neural Networks for Visual Recognition

Spring 2017 Sample Midterm Exam

May 9, 2017

Full Name:

Question Score

Multiple Choice (20 pts)

True/False (20 pts)

Short Answer (60 pts)

Total (100 pts)

Welcome to the CS231N Midterm Exam!

  • The exam is 1 hour 15 minutes.
  • No notes or electronic devices are allowed.

I understand and agree to uphold the Stanford Honor Code during this exam.

Signature: Date:

Good luck!

1 Multiple Choice (20 points)

Circle the letters of your choice.

Each question is worth 2 points. Each one of the four individual choices is 0.5 points for a correct answer, or 0 points otherwise.

  1. You start training your Neural Network but the loss is almost completely flat. What could be the cause?

(a) The learning rate could be too low (b) The regularization strength could be too high (c) The class distribution could be very uneven in the dataset (d) The weight initialization scale could be incorrectly set

  1. A VGGNet only uses a sequence of 3x3 CONV with stride 1 pad 1 and 2x2 POOL stride 2 pad 0 layers. It eventually transitions to Fully Connected layers and the classifier. There are 5 POOL layers in total. On ImageNet, the VGGNet takes 224x224 images. If we tried to run the VGGNet on a 32x input (e.g. CIFAR-10 image):

(a) The code would crash on the very first CONV layer because 3x3 filters with stride 1 pad 1 wouldn’t ”fit” across 32x32 input (b) The amount of memory needed to store the forward activations in the first CONV layer would be reduced by a factor of 7 (since 224/32 = 7) (c) The network would run fine until the very first Fully Connected layer, where it would crash (d) The network would run forward just fine but its predictions would, of course, be ImageNet class predictions

  1. A max pooling layer in a ConvNet:

(a) Is approximately as fast to compute in both forward and backward pass as a CONV layer (with the same filter size and strides). (b) Is similar to batch normalization in that it will keep all of your neuron activities in a similar range. (c) Could contribute to difficulties during gradient checking. (d) Could contribute to the vanishing gradient problem (recall: this is a problem where by the end of a backward pass the gradients are very small)

3 Short Answer (60 points)

Answer each question in provided space.

3.1 Backpropagation

Fill in the missing gradients underneath the forward pass activations in each circuit diagram. The gradient of the output with respect to the loss is one (1.00) for every circut, and has already been filled in.

3.2 Convolutional Architectures

Consider the convolutional network defined by the layers in the left column below. Fill in the size of the activation volumes at each layer, and the number of parameters at each layer. You can write your answer as a multiplication (e.g. 128x128x3).

  • CONV5-N denotes a convolutional layer with N neurons, each having 5x5xD filters, where D is the depth of the activation volume at the previous layer. Padding is 2, and stride is 1.
  • POOL2 denotes a 2x2 max-pooling layer with stride 2 (pad 0)
  • FC-N denotes a fully-connected layer with N neurons.

Layer Activation Volume Dimensions (memory) Number of parameters INPUT 32x32x1 0 CONV5- POOL CONV5- POOL FC-

3.3 Simple ConvNet (12 points)

Consider the following 1-dimensional ConvNet, where all variables are scalars:

x 1 x 2 x 3 x 4 x 5

k, b

z 1

k, b

z 2

k, b

z 3

max max

w, a

f

v 1 v 2

y ˆ

y

L

conv

max pool, relu

fully connected

Loss

L =

(y − yˆ)^2

y ˆ =

[

w 1 w 2

]

[

v 1 v 2

]

  • a

[

v 1 v 2

]

[

max{z 1 , z 2 , 0 } max{z 2 , z 3 , 0 }

]

z 1 z 2 z 3

k 1 k 2 k 3 0 0 0 k 1 k 2 k 3 0 0 0 k 1 k 2 k 3

x 1 x 2 x 3 x 4 x 5

b b b

(a) (1 point) List the parameters in this network.

(b) (3 points) Determine the following

∂L ∂w 1

∂L

∂w 2

∂L

∂a

(c) (3 points) Given the gradients of the loss L with respect to the second layer activations v, derive the gradient of the loss with respect to the first layer activations z. More precisely, given ∂L ∂v 1

= δ 1

∂L

∂v 2

= δ 2

Determine the following ∂L ∂z 1

∂L

∂z 2

∂L

∂z 3