









Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
An overview of Convolutional Neural Networks (CNNs) for image recognition, focusing on the architectures of LeNet-5, AlexNet, GoogLeNet, VGGNet, and ResNet. the basics of CNNs, including the use of convolutional kernels, activation functions, pooling layers, and fully connected layers. It also discusses various datasets, such as MNIST, CIFAR10, and ImageNet, and their significance in CNN research. The document concludes with case studies of each network, highlighting their contributions and achievements.
Typology: Cheat Sheet
1 / 17
This page cannot be seen from the preview
Don't miss anything!










Reference http://cs231n.github.io/convolutional-networks/ By Yunzhe Xue
This is a gif image
Padding on the input volume with zeros in such way that the conv layer does not alter the spatial dimensions of the input
CIFAR10 dataset and state of the art The CIFAR-10 dataset consists of 60000 32x32 color images in 10 classes, with 6000 images per class. There are 50000 training images and 10000 test images.
Case studies
Convolutional Network from Szegedy et al. from Google. Its main contribution was the development of an Inception Module that dramatically reduced the number of parameters in the network (4M, compared to AlexNet with 60M). Additionally, this paper uses Average Pooling instead of Fully Connected layers at the top of the ConvNet, eliminating a large amount of parameters that do not seem to matter much. There are also several followup versions to the GoogLeNet, most recently Inception-v4.
VGG- GoogleNe t ResNet