Docsity
Docsity

Prepare-se para as provas
Prepare-se para as provas

Estude fácil! Tem muito documento disponível na Docsity


Ganhe pontos para baixar
Ganhe pontos para baixar

Ganhe pontos ajudando outros esrudantes ou compre um plano Premium


Guias e Dicas
Guias e Dicas


Hybrid Recurrent Neural Networks: Combining the Strengths of RNNs and Hidden Markov Models, Teses (TCC) de Matemática Discreta

The concept of hybrid recurrent neural networks (hrnns), which combine the strengths of recurrent neural networks (rnns) and hidden markov models (hmms) for modeling time-varying sequences. The theoretical foundations of rnns, the hybrid architecture of hrnns, and their applications. It also mentions the use of evolutionary neural learning for training rnns and the encoding of symbolic knowledge in recurrent neural networks.

Tipologia: Teses (TCC)

2020

Compartilhado em 24/09/2020

jrjrjr2021
jrjrjr2021 🇧🇷

1 documento

1 / 142

Toggle sidebar

Esta página não é visível na pré-visualização

Não perca as partes importantes!

bg1
Modelling of Deterministic, Fuzzy and Probablistic Dynamical Systems
Rohitash Chandra
A Thesis in the Field of Computing Science
for the Degree of Master of Science in Computing Science
The University of Fiji
August, 2007
Supervisor: Prof. Christian Omlin.
pf3
pf4
pf5
pf8
pf9
pfa
pfd
pfe
pff
pf12
pf13
pf14
pf15
pf16
pf17
pf18
pf19
pf1a
pf1b
pf1c
pf1d
pf1e
pf1f
pf20
pf21
pf22
pf23
pf24
pf25
pf26
pf27
pf28
pf29
pf2a
pf2b
pf2c
pf2d
pf2e
pf2f
pf30
pf31
pf32
pf33
pf34
pf35
pf36
pf37
pf38
pf39
pf3a
pf3b
pf3c
pf3d
pf3e
pf3f
pf40
pf41
pf42
pf43
pf44
pf45
pf46
pf47
pf48
pf49
pf4a
pf4b
pf4c
pf4d
pf4e
pf4f
pf50
pf51
pf52
pf53
pf54
pf55
pf56
pf57
pf58
pf59
pf5a
pf5b
pf5c
pf5d
pf5e
pf5f
pf60
pf61
pf62
pf63
pf64

Pré-visualização parcial do texto

Baixe Hybrid Recurrent Neural Networks: Combining the Strengths of RNNs and Hidden Markov Models e outras Teses (TCC) em PDF para Matemática Discreta, somente na Docsity!

Modelling of Deterministic, Fuzzy and Probablistic Dynamical Systems

Rohitash Chandra

A Thesis in the Field of Computing Science for the Degree of Master of Science in Computing Science

The University of Fiji

August, 2007

Supervisor: Prof. Christian Omlin.

iv

Author's Biographical Sketch The author is from the Fiji islands. He was born in Nausori and attended Saraswati Primary and Saraswati College for his primary and secondary school education. The author graduated with a Bachelor of Science Degree from the University of the South Pacific in April 2006. He joined the University of Fiji as a tutor in computing science in early 2007. His research work has been published in numerous international conference proceedings in the field of artificial intelligence. Apart from his contribution to the field of computing science, the author has published poetry in a number of international literary journals. He has released two books of poetry in the years of 2006 and 2007: “Barefoot on Soft River Sand” and “A Hot Pot of Roasted Poems”. He is also the editor and publisher of The Blue Fog Journal.

v

To Roni.

vii

Table of Contents

Table of Contents ..............................................................................................................vii

xiii

  • Chapter 1 Introduction List of Figures ..................................................................................................................xiii
    • 1.1 Motivation
    • 1.2 Premises
      • 1.2.1 Recurrent Neural Networks................................................................................
      • 1.2.2 Hidden Markov Models
      • 1.2.3 Finite-State Automata and Knowledge Representation
      • 1.2.4 Hybrid Systems
      • 1.2.5 Speech Recognition............................................................................................
    • 1.3 Research Hypothesis
      • 1.3.1 Learning Finite-State Automata
      • 1.3.2 Extraction of Finite Automaton..........................................................................
      • 1.3.3 Evolutionary Training of Recurrent Neural Networks.......................................
      • 1.3.4 Hybrid Systems of Recurrent Networks and Hidden Markov Models
    • 1.4 Technical Goals.........................................................................................................
    • 1.5 Research Methodology..............................................................................................
      • 1.5.1 Gradient Descent and Evolutionary Training...................................................
      • 1.5.2 Extraction of Fuzzy Finite Automaton.............................................................
      • 1.5.3 The Hybrid Recurrent Neural Networks Architecture
      • 1.5.4 Application to Speech Phoneme Recognition viii
    • 1.6 Accomplishments....................................................................................................
    • 1.7 Thesis Overview......................................................................................................
  • Chapter 2 Neural Network Fundamentals.........................................................................
    • 2.1 Introduction
    • 2.2 Processing Elements................................................................................................
    • 2.3 Topologies...............................................................................................................
      • 2.3.2 Recurrent Neural Networks..............................................................................
    • 2.4 Learning Algorithm.................................................................................................
    • 2.4 Computational Capabilities.....................................................................................
    • 2.5 Summary
  • Chapter 3 Hybrid Systems.................................................................................................
    • 3.1 Introduction
    • 3.2 Symbolic Connectionist Learning...........................................................................
      • 3.2.1 General Paradigm
      • 3.2.2 The Significance and Insertion of Prior Knowledge
      • 3.2.3 Knowledge Extraction......................................................................................
      • 3.2.4 Knowledge Refinement....................................................................................
    • 3.3 Neural Expert Systems............................................................................................
    • 3.4 Neuro-Fuzzy Systems
    • 3.5 Evolutionary Neural Networks
      • 3.5.1 Evolutionary Neural Learning..........................................................................
      • 3.5.2 Evolutionary Neural Topologies
    • 3.6 Hybrid Recurrent Neural Networks Inspired by Hidden Markov Models.............. ix
      • 3.6.1 Motivation
      • 3.6.2 Significance of Hybrid Recurrent Neural Networks
      • 3.6.3 The Derivation for Hybrid Recurrent Neural Networks
      • 3.5.4 Training Hybrid Recurrent Neural Networks...................................................
    • 3.6 Summary
  • Chapter 4 Automata Theory..............................................................................................
    • 4.1 Introduction
    • 4.2 Formal Languages...................................................................................................
    • 4.3 Finite-State Automata
      • 4.3.1 Deterministic Finite-State Automata................................................................
      • 4.3.2 Fuzzy Finite-State Automata............................................................................
    • 4.4 Finite State Machines: Hidden Markov Models
    • 4.5 Summary
  • Chapter 5 Recurrent Neural Networks
    • 5.1 Introduction
    • 5.2 Architectures
      • 5.2.1 First-Order Recurrent Neural Networks...........................................................
      • 5.2.2 Second-Order Recurrent Neural Networks
      • 5.2.3 Locally Recurrent Neural Networks
      • 5.2.4 NARX Recurrent Networks
      • 5.2.5 Long Short Term Memory
    • 5.3 Learning
      • 5.3.1 Introduction x
      • 5.3.2 Backpropagation -Through -Time....................................................................
      • 5.3.3 Real Time Recurrent Learning.........................................................................
      • 5.3.4 Evolutionary Neural Leaning
      • 5.3.4.1 Genetic Algorithms
      • 5.3.4.2 Training Neural Networks with Genetic Algorithms
      • 5.4 Recurrent Neural Networks as Models of Computation
    • 5.5 Knowledge Extraction from Recurrent Neural Network
      • 5.5.1 Introduction
      • 5.5.2 Knowledge Extraction Using Machine Learning.............................................
    • 5.6 Applications of Recurrent Neural Networks...........................................................
      • 5.6.1 Recurrent Neural Networks for Speech Recognition.......................................
      • 5.6.2 Recurrent Neural Networks for Control.........................................................
      • 5.6.3 Molecular Biology..........................................................................................
      • 5.6.4 Signature Verification
    • 5.7 Summary
  • Chapter 6 Training and Extraction of Finite State Automata............................................
    • 6.1 Introduction
    • 6.2 Gradient Descent Training of Recurrent Neural Networks.....................................
      • 6.2.1 Training on Deterministic Finite-State Automata
      • 6.2.2 Training on Fuzzy Finite Automaton
    • 6.3 Extraction of Finite Automaton from Trained Recurrent Neural Networks
      • 6.3.1 Extraction of Deterministic Finite Automaton.................................................
      • 6.3.2 Extraction of Fuzzy Finite Automaton............................................................. xi
    • 6.4 Evolutionary Training of Hybrid Recurrent Neural Networks
      • 6.4.1 Introduction
      • 6.4.2 Learning Deterministic Finite Automaton
    • 6.5 Extraction of Deterministic Finite Automaton........................................................
    • 6.6 Summary
  • Chapter 7 Real World Application: Speech Recognition..................................................
    • 7.1 Introduction
    • 7.2 Speech Recognition Systems
    • 7.3 Feature Extraction from Speech Sequences............................................................
    • 7.3.1 Introduction
      • 7.3.2 Mel Frequency Cepstral Coefficients...............................................................
      • 7.3.3 The TIMIT Database
      • 7.3.4 Empirical Results
    • 7.4 An Application to Speech Phoneme Classification
      • 7.4.1 Empirical Results and Discussion
    • 7.5 Summary
  • Chapter 8 Conclusion and Directions for Future Research.............................................
    • 8.1 Accomplishments and Open Problems
      • 8.1.1 Open Problems
    • 8.2 Derivation of Gradient Decent Learning of the Hybrid Architecture
    • 8.3 Applications to Other Real World Time Series.
  • References
  • Figure 1 Output of a single neuron.................................................................................... List of Figures
  • Figure 2 Neural network topologies..................................................................................
  • Figure 3 Architectures of recurrent neural networks
  • Figure 4 The framework for combining symbolic and neural learning
  • Figure 5 Neural expert system
  • Figure 6 An example of a neuro-fuzzy system
  • Figure 7 Hybrid recurrent neural networks
  • Figure 8 Deterministic finite-state automata....................................................................
  • Figure 9 Fuzzy finite-state automaton with weight state transitions................................
  • Figure 10 Equivalent deterministic acceptor....................................................................
  • Figure 11 A first-order discrete Markov model
  • Figure 12 A first-order hidden Markov model.................................................................
  • Figure 13 First-order recurrent neural networks
  • Figure 14 Second-order recurrent neural networks..........................................................
  • Figure 15 Locally recurrent networks
  • Figure 16 NARX network architecture
  • Figure 17 Example of a three layer LTSM topology
  • Figure 18 Unfolding a recurrent neural network in time
  • Figure 19 Crossover and mutation operators for genetic algorithms
  • Figure 20 Crossover operator for evolutionary neural learning.......................................
  • Figure 21 Knowledge extraction through machine learning............................................ xiv
  • Figure 22 Prefix tree..........................................................................................................
  • Figure 23 DFA induction
  • Figure 24 The 7 state deterministic finite automaton.......................................................
  • Figure 25 The 7 state fuzzy finite automaton...................................................................
  • Figure 26 Hybrid recurrent neural networks
  • Figure 27 Windowing.......................................................................................................
  • Figure 28 The process of MFCC feature extraction.......................................................
  • Figure 29 Hybrid recurrent neural networks

1.2 Premises Recurrent neural networks on their own can very well represent dynamical systems such as finite automaton. They have been applied to a wide range of real world problems with dynamical characteristics including speech, signature and gesture recognition [1, 2, and 3]. Hidden Markov models, on the other hand, have been popular tools for modelling speech sequences [6]. Finite state automata have been a useful paradigm for studying recurrent neural networks and their dynamical characteristics. Hybrid systems combine useful features of at least two paradigms. Examples of hybrid systems in machine learning include: evolutionary neural learning, neural expert systems, neuro-fuzzy systems, and symbolic connectionist learning.

1.2.1 Recurrent Neural Networks

Neural networks are loosely modelled on the brain. They learn by training from past experience and can demonstrate good generalization performance when presented with data not initially included in the training process. Neural networks can be divided into two classes: feedforward and recurrent neural networks. Feedforward networks are used in applications where the data does not contain time variant information while recurrent neural networks model time series sequences and possesses dynamical characteristics. Recurrent neural networks contain feedback connections. They have the ability to maintain information from past states for the computation of future state outputs. It has been shown that recurrent neural networks can model non-linear dynamical systems. Recurrent neural networks have been successfully applied to a wide range of applications including speech, gesture and signature recognition [1, 2, and 3]. One limitation of neural networks is the difficulty to train them using gradient descent learning; a network may get trapped in the local minima resulting in poor training and generalization performance.

1.2.2 Hidden Markov Models

Hidden Markov models have been popular tools for automatic speech recognition [6]. In a regular Markov model, the state is directly visible to the observer. Therefore, the state transition probabilities are the only parameters. In a hidden Markov model, the state is not directly visible, however, the variables influenced by the states are visible. Each state has a probability distribution over the possible output tokens. The sequence of tokens generated by a hidden Markov model gives some information about the sequence of states. In a first order hidden Markov model, the state at time t+1 depends only on state at time t , regardless of the states in the previous times [12]. This first-order assumption is generally inappropriate for speech signals where dependencies often extend through several states, however, hidden Markov models have been very successful for certain types of speech recognition [13].

1.2.3 Finite-State Automata and Knowledge Representation

Finite-state automata represent dynamical behaviour and are a useful framework for studying recurrent neural networks as no feature extraction is necessary. A deterministic finite automaton is a finite automaton where one transition to the next state exists for each pair of state and input signal. A deterministic finite automaton reads in a string of input symbols. For each input symbol, it performs a state transition. When the last input symbol has been received, the automaton will either accept or reject the string depending on the output of the state. A fuzzy finite automaton is a finite-state automata where for each pair of state and input signal, there exists a set of possible successor states.

Symbolic or expert knowledge can be inserted into neural networks prior to training for better training and generalization performance. It has been shown that deterministic finite-state automata can be directly encoded into recurrent neural networks

Compared to gradient descent training of neural networks, evolutionary neural learning tends to drive the network out of the local minima resulting in better generalization performance.

1.2.5 Speech Recognition

A speech sequence contains huge amount of irrelevant information. Feature extraction reduces speech to salient which is then used for modelling. Recurrent neural networks and hidden Markov models have been successfully applied to modelling speech sequences [1, 6]. They have been applied to recognize words and phonemes. The performance of speech recognition system can be measured in terms of accuracy and speed. Recurrent neural networks are capable of modelling complicated sequences. They have shown more accuracy in recognition in cases of low quality, noisy data compared to hidden Markov models. However, hidden Markov models have shown to perform better when it comes to large vocabularies. Extensive research on the application of speech recognition has been done for more than forty years, however, scientists are unable to implement systems which can show excellent performance in environments with background noise which come anywhere near human recognition performance.

1.3 Research Hypothesis Modelling of real world time varying sequences such as speech, signature and gesture is difficult. These sequences have dynamical characteristics which can be modelled by recurrent neural networks and hidden Markov models. In Section 1.2, the limitations of both these systems have been discussed. Hybrid systems aim at combining the strengths of different paradigms while, at the same, alleviating respective weaknesses. The combination of recurrent neural networks and hidden Markov models may yield a powerful structure which may deal with the individual limitations of these systems.

Finite state automata represent dynamical behaviour and are useful models for studying recurrent neural networks. Recurrent neural networks can learn and represent finite state automata in their internal states. These issues are addressed through the following hypothesis:

1.3.1 Learning Finite-State Automata

In Section 1.2.1, it has been discussed that recurrent neural networks can represent dynamical systems. Finite-state automata represent dynamical behaviour and are useful frameworks for studying recurrent neural networks as no feature extraction is necessary. Recurrent neural networks can represent deterministic finite automaton in their internal structure upon training from sample strings which represent such automaton.

The hypothesis is that recurrent neural networks can also learn and represent fuzzy finite automaton despite the fact that computation in recurrent neural networks is deterministic.