2 – Preliminaries




Abstract




Bivariate or multivariate frequency analysis entails univariate distributions that are determined by empirical fitting to data. The fitting, in turn, requires the determination of distribution parameters and the assessment of the goodness of fit. In practical applications, such as hydrologic design, risk analysis is also needed. The objective of this chapter, therefore, is to briefly discuss these basic elements, which are needed for frequency analysis and will be needed in subsequent chapters.





2 Preliminaries




2.1 Univariate Probability Distributions


Among the univariate distributions, we will briefly discuss the most commonly applied continuous univariate distributions, especially in univariate hydrological frequency analyses (Kite, 1977; Singh, 1998; Rao and Hamed, 2000; Singh and Zhang, 2016). In what follows, we will use X as an independent identically distributed (IID) random variable with probability density function (PDF) f(x)fx and cumulative distribution function (CDF) F(x)Fx.



2.1.1 Normal Distribution


Normal distribution: The PDF and CDF of the normal distribution can be given as follows:


fx=1σ2π0.5exp−x−μ22σ2;Fx=Φx−μσ;μ∈ℝ,σ>0(2.1)

In Equation (2.1), ΦΦ represents the standard normal distribution, and μ, σμ,σ are the location and scale parameters having the connotation of mean and standard deviation of the random variable, respectively. Defining the standard normal variable z = (x − μ)/σz=x−μ/σ, Equation (2.1) can be written as


fz=12πexp−z22;Fz=12π∫−∞zexp−t22dt;F−z=1−Fz(2.1a)

Abramowitz and Stegun (1965) have numerically approximated F(z) with an error less than 7.5 × 10−57.5×10−5 as



F(z) = 1 − f(z)[a1z + a2Z2 + a3z3 + a4z4 + a5z5] + ϵ(z)
Fz=1−fza1z+a2z2+a3z3+a4z4+a5z5+ϵz
(2.1b)

where a1 − 0.319381530, a2 =  − 0.356563782, a3 = 1.781477937, a4 =  − 1.821255978, a5 = 1.330274429a1=−0.319381530,a2=−0.356563782,a3=1.781477937,a4=−1.821255978,a5=1.330274429, and ϵ(z)ϵz is the error of approximation.


In hydrological frequency analysis, the normal distribution has been commonly applied in two scenarios:




  1. 1. Normal distribution with mean of zero is the classic assumption for time series analysis and regression analysis. As a simple example, let Y be the response or prediction variable and X be the predictor variable. Then, a simple linear regression can be expressed as


    EYX=Ŷ=a+bx;e=Y−Ŷande~N0σe2(2.2)
    where e is the residual or error and e~N0σe2 denotes that e is distributed normally with mean 0 and variance σe2. E[YX]EYX denotes the conditional expectation of Y given X. denotes the predicted response through simple linear regression with intercept of a and slope of b.


    For example, a stationary time series {Xtt = 1, 2, …}Xtt=12… modeled by an Autoregressive and Moving Average (ARMA) model with (p, q) (Box et al., 2007) as follows:


    xt=c+ϕ1xt−1+…+ϕpxt−p+et+θ1et−1+…+θqet−q;et~N0σet2(2.3)


    In Equation (2.3), c is the long-term average of the time series, and ϕ1, …, ϕp; θ1, …, θqϕ1,…,ϕp;θ1,…,θq are, respectively, the coefficients for autoregressive and moving average terms. More specifically, in Equations (2.2) and (2.3), the residual e, following normal distribution with mean of 0 is commonly called white Gaussian noise.



  2. 2. After certain monotone transformation (e.g., Box–Cox or probability integral transformation), the normal distribution (Equation (2.1)) may be applied to model the nonnormally distributed hydrologic variables (e.g., Hazen, 1914; Markovic, 1965).



2.1.2 Log-Normal Distribution


Let Y =  ln (x).Y=lnx. If X follows the log-normal distribution, then its logarithm follows the normal distribution, whose PDF can be written as follows:


fx=1xσ2π0.5exp−lnx−μ22σ2;x>0(2.4)

The CDF of the log-normal distribution can be computed again through the standard normal distribution as follows:


Fx=Φlnx−μσ(2.5)

The logarithm of the random variable X is a special case of the Box–Cox transformation (Box and Cox, 1964) with λ = 0λ=0:


xT=xλ−1λ,λ≠0lnx,λ=0(2.5a)

The log-normal distribution has been widely used in hydrological frequency analysis (e.g., Chow, 1954).



2.1.3 Student t Distribution


Similar to the normal distribution, the Student t distribution is also bell-shaped (Hogg and Craig, 1978). However, it possesses the heavy tail, i.e., excess kurtosis is greater than 0. The PDF of the standard Student t distribution is given as follows:


fxν=Γν+12νπ0.5Γν21+x2ν−ν+12(2.6a)

And its CDF is given as follows:


Fxν=12+xΓν+12F2112ν+1232−x2ννπ0.5Γν2(2.6b)

In Equations (2.6a) and (2.6b), νν represents the degree of freedom. It is worth to note that with the degree of freedom, the Student t distribution will converge to normal distribution, i.e., the excess kurtosis is approaching 0. It may be explained using the excess kurtosis of Student t distribution as follows: limν→∞exkurtosis=limν→∞6ν−4=0. And 2F1F21 represents the hypergeometric function as follows:


F2112ν+1232−x2ν=∑n=0∞12nν+12n−x2νn32nn!(2.6c)

In Equation (2.6c), the Pochhammer symbol is defined as follows:


an=1n=0aa+1⋯a+n−1n>0(2.6d)


2.1.4 Exponential and Gamma Distributions


The exponential distribution is a special case of the gamma distribution (Hogg and Craig, 1978). These two distributions have been commonly applied in rainfall and flood frequency analyses. The gamma distribution can be given as follows:


fx=1Γαβαxα−1exp−xβ;x≥0,α,β:shapeandscale paramters,α,β>0(2.7)

When the shape parameter α = 1α=1, the gamma distribution is reduced to the exponential distribution as follows:


fx=1βexp−xβ(2.7a)

whose CDF is simply


Fx=1−exp−xβ(2.7b)

The CDF of the gamma distribution can be expressed as follows:


Fx=γαxβΓα(2.8)

where


γαxβ=∫0xβtα−1e−tdt−Lower incomplete gamma function(2.8a)

The gamma function can be expressed as follows:


Γα=∫0∞tα−1e−tdt(2.8b)

with the following properties:


Γα+1=αΓα,α>0;Γα=Γα+1α,α<1andΓn=n−1!;Γ2=Γ1=1;Γ12=π;

n is an integer. Abramowitz and Stegun (1965) have numerically approximated the gamma function for 0 < α ≤ 10<α≤1 with an absolute error less than 3 × 10−73×10−7 as Γα=1+∑i=18aiαi+ϵα,



where a1 =  − 0.57191652, a2 = 0.988205891, a3 =  − 0.897056937, α4 = 0.918206857
wherea1=−0.57191652,a2=0.988205891,a3=−0.897056937,α4=0.918206857


a5 =  − 0.756704078, α6 = 0.482199394, α − 7 =  − 0.193527818, α8 = 0.035868343
a5=−0.756704078,α6=0.482199394,α−7=−0.193527818,α8=0.035868343

For other values of α, the gamma function properties can be used to compute the gamma function. For example,



Γ(4.25) = 3.25Γ(3.25) = 3.25(2.25)Γ(2.25) = 3.25(2.25)(1.25)Γ(1.25).
Γ4.25=3.25Γ3.25=3.252.25Γ2.25=3.252.251.25Γ1.25.

Besides the exponential distribution being a special case of Gamma distribution, the chi-square distribution is also a special case of gamma distribution by setting α=k2, where k denotes the degree of freedom and usually taking the integers, and β = 2.



2.1.5 Generalized Extreme Value (GEV) and Extreme Value (EV) Distributions


Introduced by Jenkinson (1955) and recommended by the Natural Environment Research Council (1975) of Great Britain, the GEV distribution has been widely applied for flood frequency analysis. The EV distributions may be directly obtained from the GEV distribution. The PDF and CDF of the GEV distribution can be written as follows:


fxabc=1a1−bx−ca1−bbexp−1−bx−ca1b,1−bx−ca>0(2.9a)

Fx=exp−1−bx−ca1b(2.9b)

In Equations (2.9a) and (2.9b), a, b, and c are the scale, shape, and location parameters, respectively, and the range of variable X depends on the sign of parameter b.


The EV distributions can be derived, depending on the shape parameter b.



EV I Distribution (b = 0)

The EV I distribution may also be called the Gumbel distribution (Gumbel, 1941). It is a popular distribution for flood, drought, and rainfall frequency analyses. The PDF and CDF of EV 1 distribution can be written as follows:


fxac=1aexp−x−ca−exp−x−ca;x≥c(2.10a)

Fxac=exp−exp−x−ca(2.10b)

The coefficient of skewness is 1.1396 and the X ranges as x ∈ [c, ∞)x∈c∞.



EV II Distribution (b < 0)

The EV II distribution is also called Fréchet distribution (Gumbel, 1958) that has also been applied to frequency analysis. The PDF and CDF of the EV II distribution can be written as follows:


fxacβ=−1b=βax−ca−β−1exp−−x−ca−β,a,β>0(2.11a)

Fxacβ=exp−x−ca−β(2.11b)

The coefficient of skewness is greater than 1.1396 and X can take on values in the range x∈c+ak∞, which makes it appropriate for flood frequency analysis.



EV III Distribution (b > 0)

Belonging to the Weibull family (i.e., inverse Weibull distribution), the EV III distribution is usually applied for low-flow frequency analysis (Singh, 1998). The PDF and CDF of the EV III distribution can be written as follows:


fxacβ=1b=βa−x−caβ−1exp−−x−caβ;x≤c(2.12a)

Fxacβ=exp−−x−caβ(2.12b)

The coefficient of skewness is less than 1.396 and variable X ranges as x∈−∞c+αβ, which does not render it suitable for flood frequency analysis.



2.1.6 Weibull Distribution


The Weibull distribution (Rosin and Rammler, 1933) is commonly applied for low-flow frequency analysis, hazard functional analysis, as well as risk and reliability analysis. The PDF and CDF of the Weibull distribution can be written as follows:


fxab=abxba−1exp−xba;x>0,a,b>0(2.13a)

Fxab=1−exp−xba(2.13b)

The Weibull distribution is a reverse GBV distribution.



Pearson and Log-Pearson Type III Distributions

These two distributions are commonly applied for flood frequency analysis (Singh, 1998). The log-Pearson type III distribution is the standard method for flood frequency analysis in the United States, whereas the Pearson type III distribution is the standard method in China.



Pearson Type III Distribution

The PDF and CDF of Pearson type III distribution can be written as follows:


fxabc=1aΓbx−cab−1exp−x−ca;x≥c,a>0,b>0(2.14a)

Fx=1Γbγbx−ca(2.14b)

Using y = (x − c)/ay=x−c/a Equations (2.14a) and (2.14b) can be written as


fy=1aΓbyb−1exp−y(2.14c)

Fy=1Γb∫0ytb−1exp−tdt=γbyΓb(2.14d)

The value of F(y) can be determined in the same way as for the gamma distribution discussed earlier.



Log-Pearson Type III Distribution

Similar to the log-normal distribution, if random variable X follows the log-Pearson type III distribution, then its logarithm Y =  ln XY=lnX follows the Pearson type III distribution. The PDF and CDF of log-Pearson type III distribution can be written as follows:


fxabc=1axΓblnx−cab−1exp−lnx−ca;x>expc,a>0,b>0(2.15a)

Fxabc=1Γbγblnx−ca(2.15b)


2.1.7 Burr XII Distribution


The PDF and CDF of Burr XII distribution (Burr, 1942) can be written as follows:


fxabc=bc1+xac−k−1xc−1ac,x≥0,a,b,c>0(2.16a)

Fxabc=1−1+xac−k(2.16b)


2.1.8 Log-Logistic Distribution


The log-logistic distribution is also known as Fisk distribution (Shoukri et al., 1988). Its PDF and CDF can be written as follows:


fxab=baxab−11+xab2;x>0,a>0,b>0(2.17a)

Fxab=xbab+xb(2.17b)

Equation (2.17b) can be used to directly express a quantile. Equations (2.17) can also be generalized by including the location parameter.



2.1.9 Pareto Distribution


There are four distributions in the Pareto family (Arnold, 1983). The two- and three-parameter Pareto distributions have been used for modeling large floods. The PDF and CDF of the two-parameter Pareto distribution can be written as follows:


fx=axmaxa+1,x≥xm;fx=0ifx>xm(2.18a)

Fx=1−xmxa,x≥xm;Fx=0ifx<xm(2.18b)

There are many other distributions that have been applied in frequency analysis (Singh and Zhang, 2016), besides the distributions illustrated in this section.



2.2 Bivariate Distributions


Here we discuss the commonly applied bivariate distributions in bivariate hydrologic analyses.



2.2.1 Bivariate Gamma Distribution


Several different bivariate gamma distributions have been applied in bivariate hydrological analyses. For all the bivariate gamma distributions introduced, their margins (or marginals) are univariate gamma distribution with the PDF and CDF given as Equations (2.7) and (2.8).



Izawa Bigamma Model

The joint PDF of Izawa bigamma model (Izawa, 1965) is given for random variables X and Y as follows:


fxy=xyn−12xmexp−βxx+βyy1−ηΓnΓmβxβy−n+12βx−m1−ηηn−12×∫011−tn−12tm−1expβxηxt1−ηIn−12βxβyηxy1−t1−ηdt(2.19)

where


Ish=∑k=0∞h2s+2kk!Γs+k+1(2.19a)

η=ραxαy;0≤ρ<1;0≤η<1,αx≤αy(2.19b)

In the preceding expressions, Is(⋅)Is⋅ is the modified Bessel function of the first kind; ηη is the association parameter between XX and YY; ρρ is Pearson’s product-moment correlation coefficient of X and Y; X~gamma(xαxβx); and Y~gamma(yαyβy)X~gammaxαxβx;andY~gammayαyβy.


The limitations of the Izawa bigamma distribution are that (i) the shape parameter of X is less than that of Y; and (ii) it may only model the positively correlated random variables.



Moran Model

The PDF of the Moran model (Moran, 1969) of X and YXandY with the gamma marginals can be written as


fxy=11−ρN2fXxαxβxfYyαyβyexp−ρNx′2−2ρNx′y′+ρNy′221−ρN2(2.20)

where x = Φ−1(FX(xαxβx)), y = Φ−1(FY(yαyβy))x′=Φ−1FXxαxβx,y′=Φ−1FYyαyβy, ρNρN represents Pearson’s product-moment correlation coefficient of the transformed variables x and yx′andy′.



Smith–Adelfang–Tubbs (SAT) Model

Again with gamma marginals, Smith et al. (1982) developed the another bivariate model (i.e., the SAT model). Its PDF and CDF of the SAT model can be expressed as follows:


fxy=K1K2∑j=0∞∑k=0∞cjkβxxjηβyyj+k,0<η<1fXxαxβxfYyαyβy,η=0(2.20a)

FxyJ∑j=0∞∑j=0∞djkHx1−ηαx+jHy1−ηαy+j+k;0<η<1FXxαxβxFYyαyβy,η=0(2.20b)

K1=βxxλx−1βyyλy−1exp−βx+βy1−η(2.20c)


K2 = (1 − η)αxΓ(αx)Γ(αy − αx)
K2=1−ηαxΓαxΓαy−αx
(2.20d)


cjk=ηj+kΓαy−αx+k1−η2j+kΓαy+j+kj!k!(2.20e)

η=ραy/αx(2.20f)

J=1−ηαyΓαxΓαy−αx(2.20g)

djk=ηj+kΓαy−αx+kΓαy+j+kj!k!(2.20h)

Hza=∫0zta−1e−tdt(2.20i)


Farlie–Gumbel–Morgenstern (FGM) Model

This bivariate model was first proposed by Morgenstern (1956). Its PDF and CDF of the FGM model for random variables X and Y can be expressed as follows:



f(xy) = fX(x)fY(y)(1 + η (2FX(x) − 1)(2FY(y) − 1)))
fxy=fXxfYy1+η2FXx−12FYy−1
(2.21a)


F(xy) = FX(x)FY(y)(1 + η(1 − FX(x))(1 − FY(y)))
Fxy=FXxFYy1+η1−FXx1−FYy
(2.21b)

where {fX(x), fY(y)}fXxfYy and {FX(x), FY(y)}FXxFYy are the marginal PDFs and CDFs of XX and YY, respectively, and ηη is the correlation coefficient between XX and YY.



Gumbel Mixed (GM) Model

The GM model has been applied to model the bivariate flood frequency analysis (Yue et al., 1999). The CDF of the GM model may be expressed as follows:


Fxy=FXxFYyexp−θ1lnFXx+1lnFYy−1;0≤θ≤1(2.22)

where θθ is the association parameters of the GM model, which describes the dependence between random variables XX and YY as follows:


θ=21−cosπρ6,0≤ρ≤23(2.22a)

where ρρ is Pearson’s product moment correlation coefficient.


It should be noted that the marginal CDFs of random variable X and Y are the Gumbel distribution (i.e., Equation (2.10b)) in the case of the conventional GM model.



Gumbel Logistic (GL) Model

The Gumbel logistic model was first proposed by Gumbel (Gumbel, 1960, 1961). With the Gumbel-distributed marginals (Equation (2.10)), the CDF of the GL model can be expressed as follows:


Fxy=exp−−lnFXxη+−lnFYyη1η;η≥1(2.23)

where


η=11−ρ;0≤ρ≤1(2.23a)

As the association parameter of the GL model, ηη describes the dependence between two random variables.



Bivariate Exponential Model

Marshall and Ingram (1967), Singh and Singh (1991), and Bacchi et al. (1994) proposed the bivariate exponential distribution that can be expressed as follows:



fXY(xy) = ab[(1 + acx)(1 + bcy) − ρ] exp (−ax − by − abcxy)
fXYxy=ab1+acx1+bcy−ρexp−ax−by−abcxy
(2.24a)

where X and Y are exponentially distributed as



fX(x) = a exp  (−ax); fY(y) = b  exp (−by)
fXx=aexp−ax;fYy=bexp−by
(2.24b)

and c represents the association between 0 and 1 between X and Y defined through the coefficient of correlation as


ρ=−1+∫0∞11+cxexp−xdx(2.24c)

This bivariate model is valid for ρ between 0 and –0.404.



Nagao–Kadoya Bivariate Exponential (BVE) Model

With the exponential distributed random variables XX and YY (Equation (2.7a)), the PDF of the BEV model (Balakrisinan and Lai, 2009) can be expressed as follows:


fxy=αβ1−ρexp−αx+βy1−ρI02ραβxy1−ρ,x≥0,y≥0,0≤ρ<1(2.25)

where


I02ραβxy1−ρ=∑k=0∞ραβxy1−ρ2k1k!2(2.25a)

In Equations (2.25) and (2.25a), ρρ is the Pearson correlation coefficient between X and Y; α, βα,β are the parameters of exponential variables X and Y, respectively, as X~ exp (α), Y~ exp (β)X~expα,Y~expβ from Equation (2.7a); and I0andI0 is the modified Bessel function of the first kind.



2.2.2 Bivariate Normal Distribution


The bivariate normal distribution is also applied in bivariate hydrological frequency analysis. Let X and Y follow normal distribution (Equation (2.1)). Then the bivariate normal distribution can be written as follows:


fxy=12πσXσY1−ρ2exp−qxy2(2.26)

qxy=11−ρ2expx−μXσX2−2ρx−μXσXy−μYσY+y−μYσY2(2.26a)


2.2.3 Bivariate Log-Normal Distribution


For the log-normally distributed random variables X and Y (Equation (2.4)), the joint distribution may be expressed with the bivariate log-normal distribution as follows:


fxy=12πσX1σY11−ρ2explnx−μX1σX12−2ρlnx−μX1σX1lny−μY1σY1lny−μY1σY12(2.27)

μX1=lnμX1+σX2μX20.5;μY1=lnμY1+σY2μY20.5(2.27a)

σX1=ln1+σX2μX212;σY1=ln1+σY2μY212(2.27b)

where μX, σX; μY, σYμX,σX;μY,σY are the mean and standard deviations of random variables X and Y; and ρρ is the Pearson correlation coefficient of (lnX, lnY)lnXlnY.


From the preceding commonly applied bivariate probability distribution models, it is seen that (1) the bivariate gamma and exponential family may only model the positive dependence; (2) the bivariate normal and log-normal distribution may model the dependence in the entire range; and (3) the marginal distributions of all the models belong to the same type of univariate distribution, i.e., gamma, exponential, normal, and log-normal distributions.



2.3 Estimation of Parameters of Probability Distributions


The dependence of the commonly applied conventional bivariate distributions are associated with the Pearson correlation coefficient of the bivariate random variables. In this section, we will only briefly review the parameter estimation for univariate probability distributions.


There are a number of methods that may be applied to estimate the parameters of univariate distributions (Singh, 1998; Rao and Hamed, 2000). These methods are (1) method of moments (MOM), (2) method of maximum likelihood estimation (MLE), (3) method of probability weighted moments (PWM), (4) method of L-moments (LM), (5) method of least squares (LS), (6) method of maximum entropy (MAX_ENT), (7) method of mixed moments (MIX), (8) the generalized method of moments (GMM), and (9) incomplete means method (ICM). Let X be a random variable with density function f(xα1α2, …, αk)fxα1α2…αk in which ααs are the parameters and X = [x1x2, …, xn]X=x1x2…xn is the sample drawn from the population. In what follows, we will introduce the four most commonly applied methods in hydrology and water resources engineering, i.e., the MOM, MLE, PWM, and LM methods.



2.3.1 Method of Moments


The MOM is a natural and relatively easy parameter estimation method for univariate distributions. However, MOM is usually inferior in quality and not as efficient as the MLE, especially for distributions with a large number of parameters (three or more). This is partly because higher-order moments are more likely to be biased for relatively small samples (Rao and Hamed, 2000).


MOM assumes that the sample moments are equal to the population moments, that is, the sample is sufficiently large to be representative of the population. Given the probability distribution with k parameters α1, …, αkα1,…,αk, we can compute k sample moments from the sample X = {x1, …, xn},X=x1…xn, such as sample mean , sample standard deviation (SX)SX, sample skewness coefficient (g1)g1, and sample kurtosis. The relation between moments and parameters of the probability distribution is then established by simultaneously solving k equations for the unknown parameters: α1, …, αkα1,…,αk. It is worth noting that the first moment is computed about the origin, while the other sample moments are about the first moment (mean). We will illustrate parameter estimation by MOM for normal, gamma, Weibull, and Gumbel distributions as examples.


The rth-moment ratio, denoted as CrCr, is defined as follows:


CrX=μrXμ2Xr2(2.28)

From Equation (2.28), we can see the following:


Coefficient of variation:C1=μ1Xμ2X0.5=μσ(2.28a)

Coefficient of skewness:C3=Cs=μ3Xμ2X1.5(2.28b)


Coefficient of kurtosis:C4=Ck′=μ4Xμ2X2(2.28c)

In addition, the classical moment diagram is graphed using the possible pairs (β1β2)β1β2, which are related to C3C3 and C4C4 as follows:


β1=C32=Cs2,β2=C4=Ck′(2.28d)



Example 2.1 Estimate parameters of the normal distribution by MOM.


Solution: With the PDF of normal distribution given in Equation (2.1) and letting α1 = μ; α2 = σα1=μ;α2=σ, we can estimate parameters α1 and α2α1andα2 by the solving the following two equations:


μ1=∫−∞+∞x1α22π0.5exp−x−α122α22dx:population mean(2.29a)

μ2=∫−∞+∞x−μ121α22π0.5exp−x−α122α22dx:population variance(2.29b)

The following equates the sample mean to the population mean and the sample variance VAR(X) to the population variance:


X¯=1N∑i=1Nxi=μ1=∫−∞+∞x1α22π0.5exp−x−α122α22dx(2.29c)

VARX=1N∑i=1Nxi−X¯2=μ2=∫−∞+∞x−μ121α22π0.5exp−x−α122α22dx(2.29d)

In Equation (2.29), N is replaced by (N−1) to correct for the bias due to sample size.


Solving Equations (2.29c) and (2.29d) simultaneously, we get the following:


α̂1=m1=1N∑i=1Nxi;α̂22=m2=1N−1∑i=1Nxi−m12(2.29e)



Example 2.2 Estimate parameters of gamma distribution by MOM.


Solution: The PDF of gamma distribution is given as Equation (2.7). Let α1 = αα1=α and α2 = β.α2=β. The first moment of gamma distribution can be written as follows:


μ1=∫0∞xfxdx=∫0∞α2α1xα1Γα1exp−α2xdx=α2α1Γα1∫0∞xα1exp−α2xdx=α2α1Γα1Γα1+1α2α1+1=α1α2(2.30a)

The variance of gamma distribution can be given as follows:


μ2=∫0∞x−μ12fxdx=∫0∞x−α1α22α2α1xα1−1Γα1exp−α2xdx=α1α22(2.30b)

Substituting the sample mean and variance as m1 = μ1; m2 = μ2m1=μ1;m2=μ2, we can estimate the parameters by solving Equations (2.30a) and (2.30b) simultaneously as follows:


α2=m1m2=∑i=1Nxi∑i=1Nxi−X¯2;α1=∑i=1NxiX¯∑i=1Nxi−X¯2;X¯=1N∑i=1Nxi(2.30c)

It is worth noting that the exponential distribution is a special case of gamma distribution with α1 = 1α1=1, and α2 = 1/m1α2=1/m1.




Example 2.3 Estimate parameters of Weibull distribution using MOM.


Solution: The PDF of Weibull distribution is given as Equation (2.13a). Let α1 = a, α2 = b.α1=a,α2=b. Then we can write the population mean as follows:


μ1=∫0∞xfxdx=∫0∞xα1α2xα2α1−1exp−xα2α1dx(2.31a)

Only gold members can continue reading. Log In or Register to continue

Oct 12, 2020 | Posted by in Water and Sewage | Comments Off on 2 – Preliminaries
Premium Wordpress Themes by UFO Themes