Deep face recognition, Slides of Computer science

Slides from lecture on ethics and computer science

Typology: Slides

2020/2021

Uploaded on 10/15/2023

crazystraw01
crazystraw01 🇺🇸

1 document

1 / 30

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
Omkar M. Parkhi, Andrea Vedaldi, Andrew Zisserman
Deep face recognition
pf3
pf4
pf5
pf8
pf9
pfa
pfd
pfe
pff
pf12
pf13
pf14
pf15
pf16
pf17
pf18
pf19
pf1a
pf1b
pf1c
pf1d
pf1e

Partial preview of the text

Download Deep face recognition and more Slides Computer science in PDF only on Docsity!

Omkar M. Parkhi, Andrea Vedaldi, Andrew Zisserman

Deep face recognition

Key Questions

  • (^) Can large scale datasets be built with minimal human intervention?
  • (^) Can we propose a convolutional neural network which can compete with that of internet giants like Google and Facebook? Yes! Yes!

Can large scale datasets be built with minimal human intervention?

Dataset Collection

1. Candidate list generation : Finding names of celebrities

  • (^) Tap the knowledge on the web
  • (^) 5000 identities Robert Downey Jr. Allison Hannigan Ashley Hamilton Amitabh Bacchan Barack Obama Vladimir Putin

Dataset Collection

Popular Celebrity

Robert Downey Jr.

Dataset Collection

Not so popular

celebrity

Ashley Hamilton

3. Rank image sets

  • (^) 2000 images per identity
  • (^) Searching by appending keyword “actor”
  • (^) Learning classifier using data obtained the previous step. c
  • (^) Ranking 2000 images and selecting top 1000 images
  • (^) Approx. 2.6 Million images of 2622 celebrities Dataset Collection

Dataset Collection

4. Near duplicate removal

  • (^) VLAD descriptor based near duplicate removal

5. manual filtering

  • (^) Curating the dataset further using manual

checks

**No. Aim Mode # Persons

images

/person Total # images Anno. effort** 1 Candidate list generation Auto 5000 200 1,000,000 - 2 Candidate list filtering Manual 2622 - - 4 days 3 Rank image sets Auto 2622 1000 2,622,000 - 4 Near duplicate removal Auto 2622 623 1,635,159 - 5 Manual filtering Manual 2622 375 982,803 10 days

Dataset Collection

Example Images From Our Dataset

Can we propose a convolutional neural network which can compete with that of internet giants like Google and Facebook etc.?

Convolutional Neural Network

Training

  • (^) MatConvNet Tootlbox
    • (^) Nvidia CuDNN bindings
    • (^) Multi GPU Training (approx 3.5x speedup)
    • (^) Nvidia Titan Black
    • (^) 7 days of training
  • (^) Random Gaussian Initialization
  • (^) Stochastic Gradient Descent with back prop.
    • (^) Accumulator Descent for large batch sizes
  • (^) Batch Size: 256
  • (^) Incremental FC layer training
  • (^) 2622 way multi class criterion (soft max) image Conv- maxpool fc- fc- Softmax Conv- Conv- maxpool Conv- Conv- maxpool Conv- Conv- maxpool Conv- Conv- Conv- maxpool Conv- Conv- Matconvnet – convolutional neural networks for matlab.^ fc- A Vedaldi and K. Lenc. Arxiv - 2014.

Convolutional Neural Network

Training: Learning Task Specific Embedding

  • (^) Learning embedding by minimizing triplet loss
  • (^) Learning a projection from 4096 to 1024 dimensions
  • (^) On line triplet formation at the beginning of each iteration
  • (^) Fine tuned on target datasets
  • (^) Only the projection layers learnt

Labeled Faces In the Wild Dataset (LFW)

  • (^) Face Verification: Given a pair of images specify

whether they belong to the same person

  • (^) 13K images, 5.7K people
  • (^) Standard benchmark in the community
  • (^) Several test protocols depending upon availability of training data

within and outside th

dataset. [ G. Huang, M. Ramesh, T. Berg and E. Learned-Miller - Tech Report 07]

Effects of design choices (LFW Unrestricted Protocol) No. Network Config. Dataset Face Align Training Face Align Testing Embedding 100%-EER

1 A Curated No No No 92.

2 A Full No No No 95.

3 A Full No Yes No 96.

4 B Full No Yes No 97.

5 B Full Yes Yes No 97.

6 D Full No Yes No 96.

7 B Full No Yes Yes 99.