Docsity
Docsity

Prepare-se para as provas
Prepare-se para as provas

Estude fácil! Tem muito documento disponível na Docsity


Ganhe pontos para baixar
Ganhe pontos para baixar

Ganhe pontos ajudando outros esrudantes ou compre um plano Premium


Guias e Dicas
Guias e Dicas


Neural Networks in Power System Protection: An Overview, Notas de estudo de Engenharia Elétrica

An overview of the use of neural networks in power system protection, focusing on digital relaying and sampling techniques. It discusses the importance of neural networks in detecting hidden failures and improving the performance of power system protection systems.

Tipologia: Notas de estudo

2011

Compartilhado em 10/08/2011

alcides-neto-10
alcides-neto-10 🇧🇷

4.7

(7)

29 documentos

1 / 18

Toggle sidebar

Esta página não é visível na pré-visualização

Não perca as partes importantes!

bg1
5
Digital Relaying
Jame s S. Thorp
Virginia Polytechnic Institute
5.1 Sampling .............................................................................. 5-2
5.2 Antialiasing Filters............................................................... 5-2
5.3 Sigma-Delta A=D Converters............................................. 5-2
5.4 Phasors from Samples......................................................... 5-4
5.5 Symmetrical Components .................................................. 5-5
5.6 Algorithms ........................................................................... 5-7
Parameter Estimation . Least Squares Fitting . DFT .
Differential Equations . Kalman Filters . Wavelet
Transforms . Neural Networks
Digital relaying had its origins in the late 1960s and early 1970s with pioneering papers by
Rockefeller (1969), Mann and Morrison (1971), and Poncelet (1972) and an early field experiment
(Gilcrest et al., 1972; Rockefeller and Udren, 1972). Because of the cost of the computers in those
times, a single high-cost minicomputer was proposed by Rockefeller (1969) to perform multiple
relaying calculations in the substation. In addition to having high cost and high power requirements,
early minicomputer systems were slow in comparison with modern systems and could only perform
simple calculations. The well-founded belief that computers would get smaller, faster, and cheaper
combined with expectations of benefits of computer relaying kept the field moving. The third IEEE
tutorial on microprocessor protection (Sachdev, 1997) lists more then 1100 publications in the area
since 1970. Nearly two thirds of the papers are devoted to developing and comparing algorithms. It is
not clear this trend should continue. Issues beyond algorithms should receive more attention in
the future.
The expected benefits of microprocessor protection have largely been realized. The ability of a digital
relay to perform self-monitoring and checking is a clear advantage over the previous technology. Many
relays are called upon to function only a few cycles in a year. A large percentage of major disturbances
can be traced to ‘‘hidden failures’ in relays that were undetected until the relay was exposed to certain
system conditions (Tamronglak et al., 1996). The ability of a digital relay to detect a failure within itself
and remove itself from service before an incorrect operation occurs is one of the most important
advantages of digital protection.
The microprocessor revolution has created a situation in which digital relays are the relays of choice
because of economic reasons. The cost of conventional (analog) relays has increased while the hardware
cost of the most sophisticated digital relays has decreased dramatically. Even including substantial
software costs, digital relays are the economic choice and have the additional advantage of having
lower wiring costs. Prior to the introduction of microprocessor-based systems, several panels of space
and considerable wiring was required to provide all the functions needed for each zone of transmission
line protection. For example, an installation requiring phase distance protection for phase-to-phase and
three-phase faults, ground distance, ground-overcurrent, a pilot scheme, breaker failure, and reclosing
logic demanded redundant wiring, several hundred watts of power, and a lot of panel space. A single
microprocessor system is a single box, with a ten-watt power requirement and with only direct wiring,
has replaced the old system.
ß2006 by Taylor & Francis Group, LLC.
pf3
pf4
pf5
pf8
pf9
pfa
pfd
pfe
pff
pf12

Pré-visualização parcial do texto

Baixe Neural Networks in Power System Protection: An Overview e outras Notas de estudo em PDF para Engenharia Elétrica, somente na Docsity!

Digital Relaying

James S. Thorp

Virginia Polytechnic Institute

5.1 Sampling .............................................................................. 5 - 5.2 Antialiasing Filters............................................................... 5 - 5.3 Sigma-Delta A=D Converters............................................. 5 - 5.4 Phasors from Samples......................................................... 5 - 5.5 Symmetrical Components .................................................. 5 - 5.6 Algorithms ........................................................................... 5 - Parameter Estimation.^ Least Squares Fitting.^ DFT. Differential Equations.^ Kalman Filters.^ Wavelet Transforms.^ Neural Networks

Digital relaying had its origins in the late 1960s and early 1970s with pioneering papers by Rockefeller (1969), Mann and Morrison (1971), and Poncelet (1972) and an early field experiment (Gilcrest et al., 1972; Rockefeller and Udren, 1972). Because of the cost of the computers in those times, a single high-cost minicomputer was proposed by Rockefeller (1969) to perform multiple relaying calculations in the substation. In addition to having high cost and high power requirements, early minicomputer systems were slow in comparison with modern systems and could only perform simple calculations. The well-founded belief that computers would get smaller, faster, and cheaper combined with expectations of benefits of computer relaying kept the field moving. The third IEEE tutorial on microprocessor protection (Sachdev, 1997) lists more then 1100 publications in the area since 1970. Nearly two thirds of the papers are devoted to developing and comparing algorithms. It is not clear this trend should continue. Issues beyond algorithms should receive more attention in the future. The expected benefits of microprocessor protection have largely been realized. The ability of a digital relay to perform self-monitoring and checking is a clear advantage over the previous technology. Many relays are called upon to function only a few cycles in a year. A large percentage of major disturbances can be traced to ‘‘hidden failures’’ in relays that were undetected until the relay was exposed to certain system conditions (Tamronglak et al., 1996). The ability of a digital relay to detect a failure within itself and remove itself from service before an incorrect operation occurs is one of the most important advantages of digital protection. The microprocessor revolution has created a situation in which digital relays are the relays of choice because of economic reasons. The cost of conventional (analog) relays has increased while the hardware cost of the most sophisticated digital relays has decreased dramatically. Even including substantial software costs, digital relays are the economic choice and have the additional advantage of having lower wiring costs. Prior to the introduction of microprocessor-based systems, several panels of space and considerable wiring was required to provide all the functions needed for each zone of transmission line protection. For example, an installation requiring phase distance protection for phase-to-phase and three-phase faults, ground distance, ground-overcurrent, a pilot scheme, breaker failure, and reclosing logic demanded redundant wiring, several hundred watts of power, and a lot of panel space. A single microprocessor system is a single box, with a ten-watt power requirement and with only direct wiring, has replaced the old system.

Modern digital relays can provide SCADA, metering, and oscillographic records. Line relays can also provide fault location information. All of this data can be available by modem or on a WAN. A LAN in the substation connecting the protection modules to a local host is also a possibility. Complex multifunction relays can have an almost bewildering number of settings. Techniques for dealing with setting management are being developed. With improved communication technology, the possibility of involving microprocessor protection in wide-area protection and control is being considered.

5.1 Sampling

The sampling process is essential for microprocessor protection to produce the numbers required by the processing unit to perform calculations and reach relaying decisions. Both 12 and 16 bit A=D converters are in use. The large difference between load and fault current is a driving force behind the need for more precision in the A=D conversion. It is difficult to measure load current accurately while not saturating for fault current with only 12 bits. It should be noted that most protection functions do not require such precise load current measurement. Although there are applications, such as hydro generator protection, where the sampling rate is derived from the actual power system frequency, most relay applications involve sampling at a fixed rate that is a multiple of the nominal power system frequency.

5.2 Antialiasing Filters

ANSI=IEEE Standard C37.90, provides the standard for the Surge Withstand Capability (SWC) to be built into protective relay equipment. The standard consists of both an oscillatory and transient test. Typically the surge filter is followed by an antialiasing filter before the A=D converter. Ideally the signal x(t) presented to the A=D converter x(t) is band-limited to some frequency vc, i.e., the Fourier transform of x(t) is confined to a low-pass band less that vc such as shown in Fig. 5.1. Sampling the low-pass signal at a frequency of vs produces a signal with a transform made up of shifted replicas of the low-pass transform as shown in Fig. 5.2. If vs  vc > vc, i.e., vs > 2 vc as shown, then an ideal low-pass filter applied to z(t) can recover the original signal x(t). The frequency of twice the highest frequency present in the signal to be sampled is the Nyquist sampling rate. If vs < 2 vc the sampled signal is said to be ‘‘aliased’’ and the output of the low-pass filter is not the original signal. In some applications the frequency content of the signal is known and the sampling frequency is chosen to avoid aliasing (music CDs), while in digital relaying applications the sampling frequency is specified and the frequency content of the signal is controlled by filtering the signal before sampling to insure its highest frequency is less than half the sampling frequency. The filter used is referred to as an antialising filter. Aliasing also occurs when discrete sequences are sampled or decimated. For example, if a high sampling rate such as 7200 Hz is used to provide data for oscillography, then taking every tenth sample provides data at 720 Hz to be used for relaying. The process of taking every tenth sample (decimation) will produce aliasing unless a digital anti- aliasing filter with a cut-off frequency of 360 Hz is provided.

5.3 Sigma-Delta A=D Converters

There is an advantage in sampling at rates many times the Nyquist rate. It is possible to exchange speed of sampling for bits of resolution. So called Sigma-Delta A=D converters are based on one bit sampling at very high rates. Consider a signal x(t) sampled at a high rate T ¼ 1 =fs, i.e., x[n] ¼ x(nT) with the difference between the current sample and a times the last sample given by

X(ω)

−ωc ωc ω

FIGURE 5.1 The Fourier Transform of a band-limited function.

5.4 Phasors from Samples

A phasor is a complex number used to represent sinu- soidal functions of time such as AC voltages and currents. For convenience in calculating the power in AC circuits from phasors, the phasor magnitude is set equal to the rms value of the sinusoidal waveform. A sinusoidal quan- tity and its phasor representation are shown in Fig. 5.5, and are defined as follows:

Sinusoidal quantity Phasor

y(t) ¼ Y (^) m cos (vt þ f) Y ¼ Y (^) m ffiffiffi 2

p ejf^

A phasor represents a single frequency sinusoid and is not directly applicable under transient conditions. However, the idea of a phasor can be used in transient conditions by considering that the phasor represents an estimate of the fundamental frequency component of a waveform observed over a finite window. In case of N samples yk , obtained from the signal y(t) over a period of the waveform:

Y ¼

ffiffiffi 2

p

N

XN

k¼ 1

y (^) k ejk^

(^2) Np (5:4)

or,

Y ¼

ffiffiffi 2

p

N

XN

k¼ 1

y (^) k cos k2p N

 j

XN

k¼ 1

y (^) k sin k2p N

Using u for the sampling angle 2p=N, it follows that

Y ¼

ffiffiffi 2

p

N

Yc  jY (^) s

where

Yc ¼

XN

k¼ 1

y (^) k cos (ku)

Y (^) s ¼

XN

k¼ 1

y (^) k sin (ku)

Note that the input signal y(t) must be band-limited to Nv=2 to avoid aliasing errors. In the presence of white noise, the fundamental frequency component of the Discrete Fourier Transform (DFT) given by Eqs. (5.4)–(5.7) can be shown to be a least-squares estimate of the phasor. If the data window is not a multiple of a half cycle, the least-squares estimate is some other combination of Y (^) c and Ys , and is no longer given by Eq. (5.6). Short window (less than one period) phasor computations are of interest in some digital relaying applications. For the present, we will concentrate on data windows that are multiples of a half cycle of the nominal power system frequency. The data window begins at the instant when sample number 1 is obtained as shown in Fig. 5.5. The sample set y (^) k is given by

φ

φ

Y

Y

FIGURE 5.5 Phasor representation.

y (^) k ¼ Y (^) m cos (ku þ f) (5:8)

Substituting for y (^) k from Eq. (5.8) in Eq. (5.4),

Y ¼

ffiffiffi 2

p

N

XN

k¼ 1

Y (^) m cos (ku þ w)ejku^ (5:9)

or

Y ¼

ffiffiffi 2

p Y (^) m e jf^ (5:10)

which is the familiar expression Eq. (5.3), for the phasor representation of the sinusoid in Eq. (5.3). The instant at which the first data sample is obtained defines the orientation of the phasor in the complex plane. The reference axis for the phasor, i.e., the horizontal axis in Fig. 5.5, is specified by the first sample in the data window. Equations (5.6)–(5.7) define an algorithm for computing a phasor from an input signal. A recursive form of the algorithm is more useful for real-time measurements. Consider the phasors computed from two adjacent sample sets: y (^) kfk ¼ 1,2,    , Ng and, y^0 kfk ¼ 2,3,    , N þ 1 g, and their corresponding phasors Y 1 and Y 20 respectively:

Y^1 ¼

ffiffiffi 2

p

N

XN

k¼ 1

y (^) k ejku^ (5:11)

Y^2

0 ¼

ffiffiffi 2

p

N

XN

k¼ 1

y (^) kþ 1 ejku^ (5:12)

We may modify Eq. (5.12) to develop a recursive phasor calculation as follows:

Y 2 ¼ Y 2

0 eju^ ¼ Y 1 þ

ffiffiffi 2

p

N

y (^) Nþ 1  y (^1)

eju^ (5:13)

Since the angle of the phasor Y^20 is greater than the angle of the phasor Y 1 by the sampling angle u, the phasor Y 2 has the same angle as the phasor Y 1. When the input signal is a constant sinusoid, the phasor calculated from Eq. (5.13) is a constant complex number. In general, the phasor Y, corresponding to the data y (^) kfk ¼ r, r þ 1, r þ 2,    , N þ r  1 g is recursively modified into Y rþ^1 according to the formula

Yrþ^1 ¼ Y r^ eju^ ¼ Y r^ þ

ffiffiffi 2

p

N

y (^) Nþr  y (^) r

eju^ (5:14)

The recursive phasor calculation as given by Eq. (5.13) is very efficient. It regenerates the new phasor from the old one and utilizes most of the computations performed for the phasor with the old data window.

5.5 Symmetrical Components

Symmetrical components are linear transformations on voltages and currents of a three phase network. The symmetrical component transformation matrix S transforms the phase quantities, taken here to be voltages Ef, (although they could equally well be currents), into symmetrical components ES :

systems so that crucial performance features of the network, such as the power flows in transmission lines could be determined with a degree of confidence. Many other important measures of power system performance, such as contingency evaluation, stability margins, etc., can be expressed in terms of the state of the power system, i.e., the phasors. Accuracy of time synchronization directly translates into the accuracy with which phase angle differences between various phasors can be measured. Phase angles between the ends of transmission lines in a power network may vary between a few degrees, and may approach 180 8 during particularly violent stability oscillations. Under these circumstances, assuming that one may wish to measure angular differences as little as 1 8 , one would want the accuracy of measurement to be better than 0.1 8. Fortunately, synchronization accuracies of the order of 1 msec are now achievable from the Global Positioning System (GPS) satellites. One microsecond corresponds to 0.022 8 for a 60 Hz power system, which more than meets our needs. Real-time phasor measurements have been applied in static state estimation, frequency measurement, and wide area control.

5.6 Algorithms

5.6.1 Parameter Estimation

Most relaying algorithms extract information about the waveform from current and voltage waveforms in order to make relaying decisions. Examples include: current and voltage phasors that can be used to compute impedance, the rms value, the current that can be used in an overcurrent relay, and the harmonic content of a current that can be used to form a restraint in transformer protection. An approach that unifies a number of algorithms is that of parameter estimation. The samples are assumed to be of a current or voltage that has a known form with some unknown parameters. The simplest such signal can be written as

y(t) ¼ Y (^) c cos v 0 t þ Y (^) s sin v 0 t þ e(t) (5:19)

where v 0 is the nominal power system frequency, Y (^) c and Y (^) s are unknown quantities, and e(t) is an error signal (all the things that are not the fundamental frequency signal in this simple model). It should be noted that in this formulation, we assume that the power system frequency is known. If the numbers, Y (^) c and Ys were known, we could compute the fundamental frequency phasor. With samples taken at an interval of T seconds,

y (^) n ¼ y(nT) ¼ Y (^) c cos nu þ Ys sin nu þ e(nT) (5:20)

where u ¼ v 0 T is the sampling angle. If signals other than the fundamental frequency signal were present, it would be useful to include them in a formulation similar to Eq. (5.19) so that they would be included in e(t). If, for example, the second harmonic were included, Eq. (5.19) could be modified to

y (^) n ¼ Y (^) 1c cos nu þ Y1s sin nu þ Y (^) 2c cos 2nu þ Y (^) 2s sin 2nu þ e(nT) (5:21)

It is clear that more samples are needed to estimate the parameters as more terms are included. Equation (5.21) can be generalized to include any number of harmonics (the number is limited by the sampling rate), the exponential offset in a current, or any known signal that is suspected to be included in the post-fault waveform. No matter how detailed the formulation, e(t) will include unpredictable contri- butions from:

. (^) The transducers (CTs and PTs) . (^) Fault arc . (^) Traveling wave effects . (^) A=D converters

. (^) The exponential offset in the current . (^) The transient response of the antialising filters . (^) The power system itself

The current offset is not an error signal for some algorithms and is removed separately for some others. The power system generated signals are transients depending on fault location, the fault incidence angle, and the structure of the power system. The power system transients are low enough in frequency to be present after the antialiasing filter. We can write a general expression as

y (^) n ¼

XK

k¼ 1

sk (nT)Y (^) k þ e (^) n (5:22)

If y represents a vector of N samples, and Y a vector of K unknown coefficients, then there are N equations in K unknowns in the form

y ¼ SY þ e (5:23)

The matrix S is made up of samples of the signals s (^) k.

S ¼

s 1 (T) s 2 (T)    sK (T) s 1 (2T) s 2 (2T)    sK (2T) ...^ ...^ ... s 1 (NT) s 2 (NT)    sK (NT)

The presence of the error e and the fact that the number of equations is larger than the number of unknowns (N > K) makes it necessary to estimate Y.

5.6.2 Least Squares Fitting

One criterion for choosing the estimate ^YY is to minimize the scalar formed as the sum of the squares of the error term in Eq. (5.23), viz.

eT^ e ¼ (y  SY) T^ (y  SY) ¼

XN

n¼ 1

e (^2) n (5:25)

It can be shown that the minimum least squared error [the minimum value of Eq. (5.25)] occurs when

YY^ ^ ¼ (ST^ S)^1 S T^ y ¼ By (5:26)

where B ¼ (S TS)^1 S T. The calculations involving the matrix S can be performed off-line to create an ‘‘algorithm,’’ i.e., an estimate of each of the K parameters is obtained by multiplying the N samples by a set of stored numbers. The rows of Eq. (5.26) can represent a number of different algorithms depending on the choice of the signals s (^) k(nT) and the interval over which the samples are taken.

5.6.3 DFT

The simplest form of Eq. (5.26) is when the matrix S TS is diagonal. Using a signal alphabet of cosines and sines of the first N harmonics of the fundamental frequency over a window of one cycle of the fundamental frequency, the familiar Discrete Fourier Transform (DFT) is produced. With

Using trapezoidal integration to evaluate the integrals (assuming t is small)

ðt^2

t (^1)

v(t)dt ¼ R

ðt^2

t 1

i(t)dt þ L(i(t 2 )  i(t 1 )) (5:33)

ðt^2

t (^1)

v(t)dt ¼ R

ðt^2

t 1

i(t)dt þ L(i(t 2 )  i(t 1 )) (5:34)

R and L are given by

R ¼

(v (^) kþ 1 þ v (^) k )(i (^) kþ 2  ikþ 1 )  (v (^) kþ 2 þ v (^) kþ 1 )(ikþ 1  ik ) 2 i (^) k ikþ 2  i^2 kþ 1

L ¼

T

(v (^) kþ 2 þ v (^) kþ 1 )(ikþ 1 þ ik )  (v (^) kþ 1 þ v (^) k )(ikþ 2 þ ikþ 1 ) 2 ik ikþ 2  i^2 kþ 1

It should be noted that the sample values occur in both numerator and denominator of Eqs. (5.35) and (5.36). The denominator is not constant but varies in time with local minima at points where both the current and the derivative of the current are small. For a pure sinusoidal current, the current and its derivative are never both small but when an offset is included there is a possibility of both being small once per period. Error signals for this algorithm include terms that do not satisfy the differential equation such as the currents in the shunt elements in the line model required by long lines. In intervals where the denominator is small, errors in the numerator of Eqs. (5.35) and (5.36) are amplified. The resulting estimates can be quite poor. It is also difficult to make the window longer than three samples. The complexity of solving such equations for a larger number of samples suggests that the short window results be post processed. Simple averaging of the short-window estimates is inappropriate, however. A counting scheme was used in which the counter was advanced if the estimated R and L were in the zone and the counter was decreased if the estimates lay outside the zone (Chen and Breingan, 1979). By requiring the counter to reach some threshold before tripping, secure operation can be assured with a cost of some delay. For example, if the threshold were set at six with a sampling rate of 16 times a cycle, the fastest trip decision would take a half cycle. Each ‘‘bad’’ estimate would delay the decision by two additional samples. The actual time for a relaying decision is variable and depends on the exact data. The use of a median filter is an alternate to the counting scheme (Akke and Thorp, 1997). The median operation ranks the input values according to their amplitude and selects the middle value as the output. Median filters have an odd number of inputs. A length five median filter has an input-output relation between input x[n] and output y[n] given by

y[n] ¼ median{x[n  2], x[n  1], x[n], x[n þ 1], x[n þ 2]} (5:37)

Median filters of length five, seven, and nine have been applied to the output of the short window differential equation algorithm (Akke and Thorp, 1997). The median filter preserves the essential features of the input while removing isolated noise spikes. The filter length rather than the counter scheme, fixes the time required for a relaying decision.

5.6.4.2 Transformer Protection Algorithms

Virtually all algorithms for the protection of power transformers use the principle of percentage differ- ential protection. The difference between algorithms lies in how the algorithm restrains the differential trip for conditions of overexcitation and inrush. Algorithms based on harmonic restraint, which

parallel existing analog protection, compute the second and fifth harmonics using Eq. (5.10) (Thorp and Phadke, 1982). These algorithms use current measurements only and cannot be faster than one cycle because of the need to compute the second harmonic. The harmonic calculation provides for secure operation since the transient event produces harmonic content which delays relay operation for about a cycle. In an integrated substation with other microprocessor relays, it is possible to consider transformer protection algorithms that use voltage information. Shared voltage samples could be a result of multiple protection modules connected in a LAN in the substation. The magnitude of the voltage itself can be used as a restraint in a digital version of a ‘‘tripping suppressor’’ (Harder and Marter, 1948). A physical model similar to the differential equation model for a faulted line can be constructed using the flux in the transformer. The differential equation describing the terminal voltage, v(t), the winding current, i(t), and the flux linkage L(t) is:

v(tÞ  L di(t) dt

dL(t) dt

where L is the leakage inductance of the winding. Using trapezoidal integration for the integral in Eq. (5.38)

ðt^2

t (^1)

v(t)dt  L[i(t 2 )  i(t 1 )] ¼ L(t 2 )  L(t 1 ) (5:39)

gives

L(t 2 )  L(t 1 ) ¼

T

[v(t 2 ) þ v(t 1 )]  L[i(t 2 )  i(t 1 )] (5:40)

or

Lkþ 1 ¼ Lk þ

T

[v (^) kþ 1 þ v (^) k ]  L[ikþ 1  ik ] (5:41)

Since the initial flux L 0 in Eq. (5.41) cannot be known without separate sensing, the slope of the flux current curve is used

dL di

k

T

½v (^) k þ v (^) k 1 Š ik  i (^) k 1

 L (5:42)

The slope of the flux current characteristic shown in Fig. 5.7 is different depending on whether there is a fault or not. The algorithm must then be able to dif- ferentiate between inrush (the slope alternates between large and small values) and a fault (the slope is always small). The counting scheme used for the differential equation algorithm for line protection can be adapted to this application. The counter increases if the slope is less than a threshold and the differential current indi- cates trip, and the counter decreases if the slope is greater than the threshold or the differential does not indicate trip.

Λ

Fault

i

FIGURE 5.7 The flux-current characteristic compared to fault conditions.

and

Hk ¼ [ cos (kC) sin (kC)1] (5:50)

The measurement covariance matrix was

Rk ¼ KekDt=T^ (5:51)

with T chosen as half the line time constant and different Ks for voltage and current. The Kalman filter estimates phasors for voltage and current as the DFT algorithms. The filter must be started and terminated using some other software. After the calculations begin, the data window continues to grow until the process is halted. This is different from fixed data windows such as a one cycle Fourier calculation. The growing data window has some advantages, but has the limitation that if started incorrectly, it has a hard time recovering if a fault occurs after the calculations have been initiated. The Kalman filter assumes an initial statistical description of the state x, and recursively updates the estimate of state. The initial assumption about the state is that it is a random vector independent of the processes wk and v (^) k and with a known mean and covariance matrix, P 0. The recursive calculation involves computing a gain matrix K (^) k. The estimate is given by

^xxkþ 1 ¼ Fk^xx (^) k þ Kkþ 1 [z (^) kþ 1  H (^) kþ 1 ^xxk ] (5:52)

The first term in Eq. (5.52) is an update of the old estimate by the state transition matrix while the second is the gain matrix K (^) kþ 1 multiplying the observation residual. The bracketed term in Eq. (5.52) is the difference between the actual measurement, z (^) k, and the predicted value of the measurement, i.e., the residual in predicting the measurement. The gain matrix can then be computed recursively. The amount of computation involved depends on the state vector dimension. For the linear problem described here, these calculations can be performed off-line. In the absence of the decaying measurement error, the Kalman filter offers little other than the growing data window. It has been shown that at multiples of a half cycle, the Kalman filter estimate for a constant error covariance is the same as that obtained from the DFT.

5.6.6 Wavelet Transforms

The Wavelet Transform is a signal processing tool that is replacing the Fourier Transform in many applications including data compression, sonar and radar, communications, and biomedical applica- tions. In the signal processing community there is considerable overlap between wavelets and the area of filter banks. In applications in which it is used, the Wavelet Transform is viewed as an improvement over the Fourier Transform because it deals with time-frequency resolution in a different way. The Fourier Transform provides a decomposition of a time function into exponentials, e jvt, which exist for all time. We should consider the signal that is processed with the DFT calculations in the previous sections as being extended periodically for all time. That is, the data window represents one period of a periodic signal. The sampling rate and the length of the data window determine the frequency resolution of the calculations. While these limitations are well understood and intuitive, they are serious limitations in some applications such as compression. The Wavelet Transform introduces an alternative to these limitations. The Fourier Transform can be written

X ðv Þ ¼

ð^1



x(t)ejvt^ dt (5:53)

The effect of a data window can be captured by imagining that the signal x(t) is windowed before the Fourier Transform is computed. The function h(t) represents the windowing function such as a one- cycle rectangle.

X(v, t) ¼

ð^1



x(t)h(t  t)ejvt^ dt (5:54)

The Wavelet Transform is written

X(s, t) ¼

(^1) ð



x(t)

ffiffi s

p h t  t s

dt (5:55)

where s is a scale parameter and t is a time shift. The scale parameter is an alternative to the frequency parameter of the Fourier Transform. If h(t) has Fourier Transform H(v), then h(t=s) has Fourier Transform H(sv). Note that for a fixed h(t) that large, s compresses the transform while small s spreads the transform in frequency. There are a few requirements on a signal h(t) to be the ‘‘mother wavelet’’ (essentially that h(t) have finite energy and be a bandpass signal). For example, h(t) could be the output of a bandpass filter. It is also true that it is only necessary to know the Wavelet Transform at discrete values of s and t in order to be able to represent the signal. In particular

s ¼ 2 m^ , t ¼ n2m^ m ¼... ,  2,0,1,2,3,... n ¼... ,  2,0,1,2,3,...

where lower values of m correspond to smaller values of s or higher frequencies. If x(t) is limited to a band B Hz, then it can be represented by samples at TS ¼ 1 =2B sec.

x(n) ¼ x(nT (^) s )

Using a mother wavelet corresponding to an ideal bandpass filter illustrates a number of ideas. Figure 5. shows the filters corresponding to m ¼ 0,1,2, and 3 and Fig. 5.9 shows the corresponding time functions. Since x(t) has no frequencies above B Hz, only positive values of m are necessary. The structure of the process can be seen in Fig. 5.10. The boxes labeled LPFR and HPFR are low and high pass filters with cutoff frequencies of R Hz. The circle with the down arrow and a 2 represents the process of taking every other sample. For example, on the first line the output of the bandpass filter only has a bandwidth of B= 2 Hz and the samples at T (^) S sec can be decimated to samples at 2T (^) S sec. Additional understanding of the compression process is possible if we take a signal made of eight numbers and let the low pass filter be the average of two consecutive samples (x(n) þ x(n þ 1))= 2 and the high pass filter to be the difference (x(n)  x(n þ 1))=2 (Gail and Nielsen, 1999). For example, with

x(n) ¼ [  2  28  46  44  20 12 32 30]

we get

h 1 ðk 1 Þ ¼ [13  1  16 1] h 2 ðk 2 Þ ¼ [7  8 :5] h 3 ðk 3 Þ ¼ [7:75] l 3 ðk 3 Þ ¼ [ 0 :75]

If we truncate to form

h 1 ðk 1 Þ ¼ [16 0  16 0] h 2 ðk 2 Þ ¼ [8  8] h 3 ðk 3 Þ ¼ [8] l 3 ðk 3 Þ ¼ [0]

and reconstruct the original sequence

~xx(n) ¼ [0  32  48  48  24 8 32 32]

The original and reconstructed compressed waveform is shown in Fig. 5.11. Wavelets have been applied to relaying for systems grounded through a Peterson coil where the form of the wavelet was chosen to fit unusual waveforms the Peterson coil produces (Chaari et al., 1996).

5.6.7 Neural Networks

Artificial Neural Networks (ANNs) had their beginning in the ‘‘perceptron,’’ which was designed to recognize patterns. The number of papers suggesting relay application have soared. The attraction is the

x(n) 2

2

2

2

2

2

HPF (^) B/

LPFB/2 HPFB/

HPFB/

h 1 (k 1 )

h 2 (k 2 )

h 3 (k 3 )

l 3 (k 3 )

LPFB/

LPFB/

FIGURE 5.10 Cascade filter structure.

40 30 20 10 0 − 10 Original Compressed − 20 − 30 − 40 − 50 − 60

FIGURE 5.11 Original and compressed signals.

use of ANNs as pattern recognition devices that can be trained with data to recognize faults, inrush, or other protection effects. The basic feed forward neural net is composed of layers of neurons as shown in Fig. 5.12. The function F is either a threshold function or a saturating function such as a symmetric sigmoid function. The weights w (^) i are determined by training the network. The training process is the most difficult part of the ANN process. Typically, simulation data such as that obtained from EMTP is used to train the ANN. A set of cases to be executed must be identified along with a proposed structure for the net. The structure is described in terms of the number of inputs, neuron in layers, various layers, and outputs. An example might be a net with 12 inputs, and a 4, 3, 1 layer structure. There would be 4  12 plus 4  3 plus 3  1 or 63 weights to be determined. Clearly, a lot more than 60 training cases are needed to learn 63 weights. In addition, some cases not used for training are needed for testing. Software exists for the training process but judgment in determining the training sequences is vital. Once the weights are learned, the designer is frequently asked how the ANN will perform when some combination of inputs are presented to it. The ability to answer such questions is very much a function of the breadth of the training sequence. The protective relaying application of ANNs include high-impedance fault detection (Eborn et al., 1990), transformer protection (Perez et al., 1994), fault classification (Dalstein and Kulicke, 1995), fault direction determination, adaptive reclosing (Aggarwal et al., 1994), and rotating machinery protection (Chow and Yee, 1991).

References

Aggarwal, R.K., Johns, A.T., Song, Y.H., Dunn, R.W., and Fitton, D.S., Neural-network based adaptive single-pole autoreclosure technique for EHV transmission systems, IEEE Proceedings—C, 141, 155, 1994. Akke, M. and Thorp, J.S., Improved estimates from the differential equation algorithm by median post- filtering, IEEE Sixth Int. Conf. on Development in Power System Protection, Univ. of Nottingham, UK, March 1997. Chaari, O, Neunier, M., and Brouaye, F., Wavelets: A new tool for the resonant grounded power distribution system relaying, IEEE Trans. on Power Delivery, 11, 1301, July 1996. Chen, M.M. and Breingan, W.D., Field experience with a digital system with transmission line protec- tion, IEEE Trans. on Power Appar. and Syst., 98, 1796, Sep.=Oct. 1979. Chow, M. and Yee, S.O., Methodology for on-line incipient fault detection in single-phase squirrel-cage induction motors using artificial neural networks, IEEE Trans. on Energy Conversion, 6, 536, Sept. 1991. Dalstein, T. and Kulicke, B., Neural network approach to fault classification for high speed protective relaying, IEEE Trans. on Power Delivery, 10, 1002, Apr. 1995.

x

w 1

Inputs

Input Layer Output Layer

Outputs

Hidden Layer

Φ

Φ(Σ

n i= w 2 wi xi) x

xn

FIGURE 5.12 One neuron and a neural network.