back propagation for, Study Guides, Projects, Research of Computer science

back propagation method for a students

Typology: Study Guides, Projects, Research

2025/2026

Uploaded on 04/13/2026

rubina-akhtar-1
rubina-akhtar-1 ๐Ÿ‡ฎ๐Ÿ‡ณ

1 document

1 / 40

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
DS 4400
David Liu
Khoury College of Computer Science
Northeastern University
April 2 2024
Machine Learning and Data Mining I
Spring 2024
pf3
pf4
pf5
pf8
pf9
pfa
pfd
pfe
pff
pf12
pf13
pf14
pf15
pf16
pf17
pf18
pf19
pf1a
pf1b
pf1c
pf1d
pf1e
pf1f
pf20
pf21
pf22
pf23
pf24
pf25
pf26
pf27
pf28

Partial preview of the text

Download back propagation for and more Study Guides, Projects, Research Computer science in PDF only on Docsity!

DS 4400

David Liu

Khoury College of Computer Science

Northeastern University

April 2 2024

Machine Learning and Data Mining I

Spring 2024

Outline

  • Training Neural Networks
    • Demo
    • Backpropagation
    • Parameter Initialization
    • Derivation for feed-forward neural network for

binary classification (sigmoid activation)

  • Stochastic Gradient Descent
    • Gradient descent variants
  • Regularization methods for neural networks
    • Weight decay
    • Dropout

Reminder: Logistic Regression

Cross-entropy loss

N

N

[๐‘ฆ

log โ„Ž

)log 1 โˆ’ โ„Ž

]

loss

loss

Gradient Descent

  • Converges for convex objective
  • Could get stuck in local minimum for non-convex objectives

Training Neural Networks

  • Training data ๐‘ฅ

!

, ๐‘ฆ

!

, โ€ฆ ๐‘ฅ

"

, ๐‘ฆ

"

  • Training example ๐‘ฅ

= ๐‘ฅ

#!

, โ€ฆ ๐‘ฅ

#$

, label ๐‘ฆ

  • One forward pass through the network
    • Compute prediction !๐‘ฆ

= โ„Ž(๐‘ฅ

)

  • Loss function for each example
    • ๐ฟ ๐‘ฆ, ๐‘ฆ = โˆ’[ 1 โˆ’ ๐‘ฆ log 1 โˆ’ ๐‘ฆ + ๐‘ฆ log ๐‘ฆ*]
  • Loss function for training data
    • ๐ฝ ๐‘Š, ๐‘ =

โˆ‘

๐ฟ ( ๐‘ฆ!

, ๐‘ฆ

)

Cross-entropy loss

GD for Neural Networks

  • Initialization
    • For all layers โ„“
      • Initialize ๐‘Š

[โ„“]

[โ„“]

  • Backpropagation
    • Fix learning rate ๐›ผ
    • Repeat
      • For all layers โ„“

GD for Neural Networks

  • Initialization
    • For all layers โ„“
      • Set ๐‘Š

[โ„“]

[โ„“]

at random

  • Backpropagation
    • Fix learning rate ๐›ผ
    • Repeat
      • For all layers โ„“ (starting backwards)
      • ๐‘Š

[โ„“]

[โ„“]

$%&

' ()(

,

!

,,

!

)

(/

โ„“

  • ๐‘

[โ„“]

[โ„“]

$%&

' ()(

,

!

,,

!

)

( 0

โ„“

This is

expensive!

Stochastic Gradient Descent

  • Initialization
    • For all layers โ„“
      • Set ๐‘Š

[โ„“]

[โ„“]

at random

  • Backpropagation
    • Fix learning rate ๐›ผ
    • Repeat
      • For all layers โ„“ (starting backwards)
        • For all training examples ๐‘ฅ

$

$

[โ„“]

[โ„“]

$

$

โ„“

[โ„“]

[โ„“]

$

$

โ„“

Incremental

version of GD

Gradient Descent Variants

Review of a Perceptron

1

Backpropagation Intuition

๐‘ง

$

[$]

โ†’ ๐‘Ž

$

[$]

๐›ฟ

$

[$]

๐‘ง

'

[$]

โ†’ ๐‘Ž

'

[$]

๐›ฟ

'

[$]

๐‘ง

$

[']

โ†’ ๐‘Ž

$

[']

๐›ฟ

$

[']

๐‘ง

'

[']

โ†’ ๐‘Ž

'

[']

๐›ฟ

'

(')

๐‘ง

$

[(]

โ†’ ๐‘Ž

$

[(]

๐›ฟ

$

(()

&

[ 6 ]

&

[ 6 ]

7&

[ 6 ]

7

[ 7 ]

&

[ 6 ]

W

7&

[ 6 ]

๐›ฟ

$

[(]

๐›ฟ

'

[']

Backpropagation Intuition

๐‘ง

$

[$]

โ†’ ๐‘Ž

$

[$]

๐›ฟ

$

[$]

๐‘ง

'

[$]

โ†’ ๐‘Ž

'

[$]

๐›ฟ

'

($)

๐‘ง

$

[']

โ†’ ๐‘Ž

$

[']

๐›ฟ

$

[']

๐‘ง

'

[']

โ†’ ๐‘Ž

'

[']

๐›ฟ

'

(')

๐‘ง

$

[(]

โ†’ ๐‘Ž

$

[(]

๐›ฟ

$

[()

7

[&]

7&

[ 7 ]

&

[ 7 ]

77

[ 7 ]

7

[ 7 ]

7&

[ 6 ]

๐›ฟ

$

[(]

๐›ฟ

'

[']

77

[ 7 ]

7&

[ 7 ]

๐›ฟ

'

[$]