10 – Rainfall Frequency Analysis

Abstract

In this chapter, we will illustrate the application of copulas in rainfall frequency analysis. This chapter is divided into two parts: (1) rainfall depth-duration frequency (DDF) analysis; and (2) multivariate rainfall frequency (i.e., four-dimensional) analysis. The rainfall data from the watersheds in the United States are collected and applied for analyses. The Archimedean, meta-elliptical, and vine copulas are applied to model the dependence among rainfall variables. Application shows that the DDF may be modeled by the Gumbel–Hougaard copula. Both vine and meta-elliptical copulas may be applied to model the spatial dependence of rainfall variables. Compared to the vine copula, modeling is easier to do when applying the meta-elliptical copula.

10 Rainfall Frequency Analysis

10.1 Introduction

Rainfall frequency analysis is of fundamental importance for hydrologic and hydraulic engineering design. In what follows, we will first introduce some examples with regard to rainfall analysis. Rainfall intensity-duration-frequency (IDF) or rainfall depth-duration frequency (DDF) curves published by National Oceanographic Atmospheric Administration (NOAA) are classic examples of rainfall frequency analysis. The IDF (or DDF) curves are derived first by separating rainfall events based on their durations (e.g., 15 minutes, 30 minutes, one hour, etc.) and then by fitting a univariate probability distribution to the rainfall depth or intensity data of a certain duration. The fitted univariate distribution is applied to produce a family of rainfall depth-frequency curves. In this manner, the two-dimensional depth-duration analysis is reduced to a one-dimensional analysis, involving only intensity (or depth) corresponding to a fixed duration. As described by the NOAA documents (e.g., TP-40), the IDF (or DDF) curves may be estimated from either annual maximum series or partial duration series. The IDF (or DDF) curves are widely applied in hydrological and hydraulic engineering design.

The rational method relates rainfall intensity (I) of a given duration (normally equal to the time of concentration) of a certain return period to peak runoff (discharge) (Q), where the peak runoff is assumed as a linear function of rainfall (Q = CIA)Q=CIA), where A is the area of the drainage basin. In this method, rainfall of a certain return period results in the runoff peak of exactly the same return period. To date, the rational method is commonly applied in urban hydrology (e.g., urban rainfall and runoff analysis) and urban hydraulic engineering design (e.g., detention/retention basin design, storm sewer design, and highway drainage design).

The SCS method, developed by Soil Conservation Service (now, the Natural Resources Conservation Service), may be applied to larger areas compared to the rational method (usually less than 60 acres [about 25 hectares]) for estimating runoff of a given rainfall amount. This method estimates the amount of surface runoff (or excess rainfall) through what is called the Curve Number (CN), which is related to land use and land cover, antecedent soil moisture, hydrologic condition, and soil moisture retention capacity.

The probable maximum precipitation (PMP) method, which does not rely on the IDF (DDF) curve, estimates the maximum amount of precipitation that may probably occur. The PMP analysis is required for the design of dams, dam breach analysis, spillway analysis, design of nuclear power plants, etc. These examples may be considered to illustrate applications of univariate rainfall analysis in hydrologic and hydraulic engineering design.

In the past three decades, bivariate (and multivariate) rainfall frequency analysis has attracted significant attention, because rainfall variables may be correlated and may significantly affect surface runoff (Cordova and Rodriguez-Iturbe, 1985). In the early days, the bivariate exponential distribution was applied to model the correlation structure of extreme rainfall variables (e.g., Hashino, 1985; Singh and Singh, 1991; Bacchi et al., 1994). Later, other bivariate rainfall models were investigated to model the relation between rainfall intensity and rainfall duration, for example, improved derived flood frequency distribution (DFFD) model by Kurothe et al. (1997) and Goel et al. (2000); Yue (2000a, 2000b, 2000c) investigated the applicability of bivariate normal, Gumbel logistic, and Gumbel mixed distributions. Besides the application to river discharge (Favre et al., 2004), the copula theory has been applied to bivariate and multivariate rainfall analysis (Grimaldi et al., 2005; Zhang and Singh, 2007a, 2007b, 2007c; Kao and Govindaraju, 2007, 2008; Cong and Brady, 2012; Zhang et al., 2012; Hao and Singh, 2013; Zhang et al., 2013; Abdul Rauf and Zeephongsekul, 2014; Cantet and Arnaud, 2014; Khedun et al., 2014; Moazami et al., 2014; Vernieuwe et al., 2015; among others).

With the advantages of the copula theory discussed in the preceding chapters, we will illustrate the application of copula theory to bivariate (or multivariate) rainfall frequency analysis. It is assumed that rainfall variables are continuous variates. However, rainfall variables may actually be discrete in nature.

10.2 Rainfall Depth-Duration Frequency (DDF) Analysis

Many studies have employed copulas for bivariate (multivariate) rainfall analysis based on annual maximum series (AMS). In this section, we will use Partial Duration Series (PDS) to illustrate the copula application to derive the DDF curves. The rainfall data with a 15-minute interval were collected for the rain gauge station: coop-166394 near Morgan City, Louisiana. The recorded data cover a period from May 8, 1971, to January 1, 2014. The rainfall data are available upon request from National Climate Data Center (NCDC).

The general procedure for DDF analysis includes the following steps:

1. Separate the rainfall records collected into independent rainfall events. Extract the rainfall depth and rainfall duration from these independent rainfall events obtained.
2. Evaluate the marginal rainfall depth and rainfall duration variables and corresponding marginal distributions.
3. Evaluate the rank-based correlation of rainfall depth and rainfall duration. Choose the possible copula candidates.
4. Perform the rainfall depth and rainfall duration analysis with the use of the possible copula candidates. Select the best-fitted copula functions.
5. Estimate the rainfall depth of given rainfall duration for a given return period.

In what follows, we will discuss how to perform the DDF analysis in detail.

10.2.1 Rainfall Data Processing

Before analyzing bivariate rainfall variables (i.e., rainfall depth and duration), we need to separate the rainfall data into individual rainfall events first. As commonly done, a six-hour duration of no rain is considered as the criterion to separate any two events. From a total of 12,089 available rainfall records for rain gage coop-166394, a total of 2,816 events were identified for the 43-year duration. Table 10.1 illustrates the rainfall event separation year 1971 as an example. From Table 10.1, it can be seen that there are nine independent rainfall events identified from May 22, 1971, to June 29, 1971. Table 10.2 lists the nine rainfall events separated. As an example, we consider the No. 5 event, which started on June 20, 1971, at 14:15 and ended on 15:15 on the same day. Summing up the incremental rainfall depths within this time window, we have the following:

depth=2.54+2.54+10.16+7.62=22.86mm.maximum rainfall intensity=10.16/0.25=40.64mm/h.duration=1h.

Table 10.1. Illustration of rainfall event separation.

Event no.	Date	Rainfall amount (mm)^a	Interarrival time (h)^b
	19710522 15:30	7.62	—
	19710601 00:15	0	—
1	19710605 16:15	2.54	336.75
2	19710616 13:45	2.54	261.50
3	19710618 16:00	5.08	50.25
	19710618 17:30	2.54	1.50
4	19710619 13:30	2.54	20.00
	19710619 13:45	2.54	0.25
	19710619 14:30	2.54	0.75
5	19710620 14:30	2.54	24.00
	19710620 14:45	2.54	0.25
	19710620 15:00	10.16	0.25
	19710620 15:15	7.62	0.25
6	19710621 10:00	2.54	18.75
	19710621 10:15	7.62	0.25
7	19710622 20:00	12.7	33.75
	19710622 20:15	5.08	0.25
	19710622 20:30	5.08	0.25
	19710622 20:45	5.08	0.25
	19710622 21:00	2.54	0.25
	19710622 21:15	2.54	0.25
	19710622 22:00	2.54	0.75
	19710622 22:30	2.54	0.50
8	19710624 15:30	2.54	41.00
9	19710629 12:00	5.08	116.50

Notes:

^a Incremental rainfall depth with 15-minute interval until the time stated.

^b Difference between current day and time with the previous day and time.

Table 10.2. Nine rainfall events separated based on Table 10.1.

No	Depth (mm)	Duration (h)	Max. Intensity (mm/h)^a	Start	End
1	2.54	0.25	10.16	6/5/71 16:00	6/5/71 16:15
2	2.54	0.25	10.16	6/16/71 13:30	6/16/71 13:45
3	7.62	1.75	20.32	6/18/71 15:45	6/18/71 17:30
4	7.62	1.25	10.16	6/19/71 13:15	6/19/71 14:30
5	22.86	1.00	40.64	6/20/71 14:15	6/20/71 15:15
6	10.16	0.50	30.48	6/21/71 9:45	6/21/71 10:15
7	38.1	2.75	50.8	6/22/71 19:45	6/22/71 22:30
8	2.54	0.25	10.16	6/24/71 15:14	6/24/71 15:30
9	5.08	0.25	20.32	6/29/71 11:45	6/29/71 12:00

Note: ^a Maximum average intensity of 15-minute interval.

Similarly, all rainfall events may be separated based on the six-hour duration of no rain criterion. Setting the threshold for the identified rainfall events as follows:

Threshold = median (rainfall depth) + std (rainfall depth)

Threshold=medianrainfall depth+stdrainfall depth(10.1)

From the record, we have median = 7.62 mm and standard deviation = 23.87 mm, which yield the threshold = 31.49 mm. Applying this threshold, we reduced the number of rainfall events to 378 that is roughly about nine events per year. With the partial duration rainfall series thus identified, we can then start to investigate bivariate rainfall characteristics through (i) the investigation of the marginal distribution and (ii) the investigation of dependence.

10.2.2 Investigation of Marginal Distributions: Depth and Duration

Before we investigate marginal distributions, we will first look at a scatter plot of rainfall variables (Figure 10.1(a)). Zooming in on the lower-right corner (Figure 10.1(b)), we see there are ties in both rainfall depth and rainfall duration variables. The kernel density (nonparametric probability density estimation (Wand and Jones, 1995) is applied to approximate the nonparametric probability density and distribution function for univariate rainfall variables. The kernel density function is given as follows:

f̂x=1nh∑i=1nKx−xih

(10.2)

In Equation (10.2), K(.)K. is the kernel function. Here we use the commonly applied K(x) = ϕ(x)Kx=ϕx, i.e., the normal kernel (the normal density function); h is the smoothing parameter, which is also called bandwidth (h = 6.086mm, 1.797hrh=6.086mm,1.797hr for rainfall depth and rainfall duration respectively); and n is the sample size.

Figure 10.1 Scatter plot for rainfall depth and rainfall duration: (a) original; (b) zoomed in at lower-right corner.

To compute the probability density and marginal probability using the kernel density, the MATLAB function is applied as follows:

pdf = ksdensity(x, x₁,^‘support^‘,^‘positive^‘)

pdf=ksdensityxx1,′support′,′positive′(10.2a)

cdf = ksdensity(x, x₁,^‘function^‘,^‘cdf^‘,^‘support^‘, ^‘positive^‘)

cdf=ksdensityxx1,′function′,′cdf′,′support′′positive′(10.2b)

In Equations (10.2a) and (10.2b), x and x₁ represent the random variable and the data points where the nonparametric pdf and cdf need to be evaluated. Figure 10.2 plots the density function as well as the cumulative probabilities for both rainfall variables. The CDF estimated from the kernel density is applied for bivariate analysis using copulas.

Figure 10.2 Frequency and cumulative probability plots with kernel density function for rainfall depth and rainfall duration series.

10.2.3 Bivariate Rainfall Frequency Analysis

The scatter plot in Figure 10.1 indicates positive dependence between rainfall depth and rainfall duration. The rank-based Kendall correlation coefficient is computed as τ_n≈0.32τn≈0.32. Among the copula candidates (i.e., Gumbel–Hougaard, Clayton, Frank, Gaussian, and Student’s t copulas), the Frank copula is found to better model the bivariate rainfall characteristics. Figure 10.3 compares the empirical CDF estimated from the kernel density with the bivariate random variables simulated from the fitted Frank copula with its parameter value of 3.529. Comparison shows that (i) the simulated random variates cover the overall dependence fairly well; and (ii) the tie existing in both rainfall depth and rainfall duration variables may impact the concordance of the bivariate rainfall variables.

Figure 10.3 Comparison of bivariate empirical distribution using kernel density with the random variables simulated from the fitted Frank copula.

However, with the continuous assumption, we will proceed to estimate the rainfall depth for a given duration of a given return period. The exceedance probability (P_exPex) corresponding to a given return period (T) for the partial duration series may be written as follows:

Pex=1μT

(10.3)

In Equation (10.3), μ≈9μ≈9, the average number of events per year.

Equating Equation (10.3) to the exceedance probability of rainfall depth of a given rainfall duration, we have the following:

Praindep>xRaindur=d=1μT

(10.4)

Equation (10.4) is equivalent to the following:

CFrankFdep≤FdepxFdur=Fdurd=1−1μT

(10.5)

In Equation (10.5), C^Frank(F_dep ≤ F_dep(x)| F_dur = F_dur(d)) = P(dep ≤ x| dur = d)CFrankFdep≤FdepxFdur=Fdurd=Pdep≤xdur=d. The conditional copula in Equation (10.5) is listed as #5 in Table 4.2. Applying the kernel density to the given durations of 1, 2, 3, 6, 12, and 24 fours, we have F_dur(d)Fdurd computed as follows:

F_dur(d) = [0.0818, 0.1385, 0.2079, 0.4362, 0.7480, 0.9551]

Fdurd=0.08180.13850.20790.43620.74800.9551.

For the return period of 1, 2, 5, 10, 25, 50, and 100 years, we have the exceedance probability computed using Equation (10.3) directly as follows:

P_ex = [0.8862, 0.9431, 0.9772, 0.9886, 0.9954, 0.9997, 0.9989]

Pex=0.88620.94310.97720.98860.99540.99970.9989.

Substituting F_dur(d), P_exFdurd,Pex into Equation (10.5), we can compute F_dep(x)Fdepx numerically using the bisection method. Finally, we can estimate the corresponding rainfall depth using the inverse of the kernel density (fitted to the observed rainfall depth) with the computed F_dep(x)Fdepx. Table 10.3 lists the estimated F_dep(x)Fdepx and the corresponding estimated rainfall depth. Figure 10.4 compares the rainfall depth estimated from copula-based analysis with the published DDF of partial duration for Morgan City, Louisiana (http://hdsc.nws.noaa.gov/hdsc/pfds/pfds_map_cont.html?bkmrk=la). Comparison shows that (i) for the storms with shorter duration and return periods less than 10 years, the copula estimates are either closely following the NOAA estimates or well within NOAA 90% bounds; (ii) for short durations (i.e., D = 1 and 2 hours) and higher return periods (T ≥ 25 yr)T≥25yr), the copula estimates are higher than the NOAA upper 90% bounds; and (iii) as the storm duration increases, the copula estimates for higher return periods get closer to either NOAA upper 90% bounds or actually closely follow the NOAA estimates.

Table 10.3. Estimated probability distribution of rainfall depth and estimated rainfall depth of given duration with given return period.

	1-yr	2-yr	5-yr	10-yr	25-yr	50-yr	100-yr
				F_dep(x)Fdepx
1-hr	0.5988	0.7364	0.8656	0.9252	0.9677	0.9834	0.9916
2-hr	0.6512	0.7793	0.8921	0.9413	0.9751	0.9873	0.9936
3-hr	0.7034	0.8195	0.9153	0.9548	0.9811	0.9904	0.9952
6-hr	0.8296	0.9064	0.9599	0.9794	0.9916	0.9958	0.9979
12-hr	0.9250	0.9622	0.9848	0.9924	0.9969	0.9985	0.9992
24-hr	0.9594	0.9802	0.9922	0.9961	0.9984	0.9992	0.9996

	Rainfall depth (mm)
1-hr	55.30	68.64	92.78	116.32	166.01	206.38	246.33
2-hr	59.54	74.73	101.12	128.39	182.26	221.99	261.82
3-hr	64.72	81.89	110.86	143.97	198.76	238.62	277.24
6-hr	84.01	106.74	151.78	193.73	246.69	284.32	314.98
12-hr	116.25	155.60	211.31	251.90	299.25	327.08	350.48
24-hr	150.98	195.86	250.48	287.99	326.54	350.01	370.53

Figure 10.4 Comparison of copula estimates with the NOAA estimations with a 90% confidence bound.

The differences between the NOAA-DDF and the copula-based DDF curves may be due to the following:

i. The NOAA-DDF analysis only extracts rainfall events for certain durations. These extracted events are then treated as univariate random variables and are fitted by univariate probability distributions.
ii. In the copula-based DDF analysis, on the other hand, rainfall events extracted may yield different rainfall durations. The bivariate rainfall depth-duration model is then constructed, and the rainfall depth of a given duration is estimated from the conditional probability function of f(depth < depth^∗| duration = duration^∗)fdepth<depth∗duration=duration∗. In this analysis, the duration can take on any value.
iii. The ties that may exist in the NOAA-DDF extracted events may not have the same degree of impact as that of copula-based DDF events. As discussed earlier, there may be many ties in the rainfall depth and duration of the extracted rainfall events (partial duration or annual maximum series), and these tied values may distort the concordance of the bivariate rainfall analysis. Additionally, the rainfall variables (especially rainfall duration) may be discrete in nature.

Even with the differences between the NOAA and copula-based DDF curves constructed for the partial duration time series, the copula-based method may be considered as a rational alternative for rainfall DDF (or IDF) construction with simpler and faster rainfall separation (events regardless of the length of rainfall duration) compared to that of NOAA analysis (rainfall duration–based directly).

10.3 Spatial Analysis of Annual Precipitation

With the assumption of annual precipitation amount as a random variable, the general procedure for spatial analysis of annual precipitation includes the following steps:

1. Select the region of interest, identify the rain gauges, and collect the annual precipitation records.
2. Evaluate the pairwise rank-based correlation coefficient of annual precipitation.
3. Identify the possible vine structure based on the rank-based correlation coefficients computed, and select possible copula candidates for T1 first, and then proceed with the analysis for the rest of the tree structure as discussed in Chapter 5.
4. Identify the proper tree structure for the asymmetric Archimedean copula and then proceed with the analysis as discussed in Chapter 5 for the asymmetric Archimedean copula.
5. Construct the meta-elliptical copula for the multivariate precipitation variables.
6. Compare the performance of different copula construction approaches.

To illustrate the spatial analysis of annual precipitation (rainfall), we will use four NOAA rainfall stations located in the Cuyahoga River Watershed, Ohio (see Table 10.4). The copula model is constructed from the annual rainfall data collected from 1953 to 2012 from NCDC. In this case study, we will apply D-vine, meta-elliptical copulas (i.e., meta-Gaussian and meta-Student T) and asymmetric Archimedean copulas. The reason that a D-vine copula is chosen from the pair copula construction is that there is no obvious center variable governing the dependence structure among all four rainfall stations (see the rank-based Kendall correlation coefficient listed in Table 10.5).

Table 10.4. Annual rainfall amount (mm) at four rain gauges.

Year	Rain gauges
Year	R330058	R336949	R333780	R331458
1953	668.528	634.746	536.702	744.728
1954	855.726	970.788	915.416	970.28
1955	705.866	943.61	943.864	872.998
1956	1,071.88	1,110.996	1,197.61	950.468
1957	735.584	954.532	859.79	839.47
1958	838.962	996.95	1,021.588	923.798
1959	1,025.906	1,334.008	1,240.282	1,075.182
1960	504.952	524.51	791.464	603.504
1961	716.026	773.43	909.828	930.656
1962	567.944	655.828	641.096	678.18
1963	437.134	535.94	499.11	503.428
1964	895.096	817.88	784.352	721.614
1965	754.38	757.682	912.876	757.682
1966	621.792	657.352	787.654	745.236
1967	676.656	684.784	822.452	917.194
1968	796.544	852.932	1,136.142	817.372
1969	738.632	786.384	1,152.652	768.604
1970	879.348	929.132	926.338	754.634
1971	684.022	826.77	786.638	628.142
1972	1,003.554	937.514	1,070.864	934.72
1973	841.248	940.054	1,013.714	1041.146
1974	899.922	1,041.146	1,021.588	1048.004
1975	933.196	1,049.782	998.474	1032.764
1976	759.714	892.556	767.588	1045.21
1977	54.864	957.072	1,027.684	1169.924
1978	699.262	829.31	910.336	805.434
1979	876.046	1,065.53	1,082.294	1132.586
1980	854.71	875.538	863.092	848.106
1981	881.38	914.4	970.534	897.89
1982	733.806	881.38	833.12	845.058
1983	885.19	984.504	1,007.11	851.916
1984	753.364	833.12	827.532	1026.922
1985	852.424	939.8	923.29	1,005.586
1986	721.106	850.646	934.466	1,125.474
1987	618.744	670.56	807.212	897.382
1988	735.33	721.614	744.728	773.938
1989	884.428	897.128	811.022	993.14
1990	1,592.834	1,193.546	1,251.712	1,347.216
1991	530.86	628.142	680.212	716.28
1992	1,019.048	1,010.412	1,069.34	1,020.064
1993	898.652	915.416	811.276	805.434
1994	909.066	867.41	852.17	788.924
1995	813.308	918.718	799.592	830.072
1996	1,128.014	1,097.28	979.424	1,168.146
1997	797.56	861.822	875.284	968.248
1998	978.662	970.788	869.442	963.168
1999	751.586	806.45	791.972	987.044
2000	1,013.968	932.434	969.518	887.222
2001	764.54	771.906	778.256	774.192
2002	931.926	744.982	891.54	837.946
2003	1,149.604	1,142.492	1,205.484	1,036.828
2004	1,049.274	1,088.136	1,056.386	956.818
2005	897.382	982.726	908.812	910.59
2006	1,053.084	1,087.882	1,160.78	1,362.964
2007	918.21	970.534	985.266	1,145.794
2008	887.984	1,012.19	963.422	1,165.606
2009	781.558	899.668	941.832	965.708
2010	774.446	846.328	747.776	927.862
2011	1,360.678	406.908	1,551.432	315.722
2012	782.574	664.718	849.122	782.828

Table 10.5. Kendall’s tau correlation coefficient matrix.

	R330058	R336949	R333780	R331458
R330058	1	0.6418	0.5064	0.4151
R336949	0.6418	1	0.5631	0.5300
R333780	0.5064	0.5631	1	0.4490
R331458	0.4151	0.5300	0.4490	1

10.3.1 Application of D-Vine Copula to Four-Dimensional Rainfall Variables

Copula Identification for T₁

According to Kendall’s tau correlation coefficient matrix, the proper structure for T₁ is as follows: R330058 − R333780 − R336949 − R331458R330058−R333780−R336949−R331458 (i.e., the bivariate pairs for T₁ are [R330058, R333780]; [R333780, R336949]; [R336949, R331458]). Using the empirical marginals (Weibull plotting position formula), let U₁, U₂, U₃, U₄U1,U2,U3,U4 represent the empirical marginals as follows:

U1=F̂nR330058;U2=F̂nR333780;U3=F̂nR336949;andU4=F̂nR331458.

The D-vine structure for this example is the same as in Figure 10.5. In this case study, we choose Archimedean copulas for dealing with the positive dependence (Gumbel–Hougaard, Clayton, Frank, Joe, and BB1 copulas) as the candidates. Chapter 4 listed the one-parameter Archimedean copulas candidates. Hence we only give the formula for BB1 copula, which is a two-parameter Archimedean copula with the limiting conditions of either the Clayton or Gumbel–Hougaard copula. The BB1 copula (Joe, 1997) can be formulated as follows:

Cuvθ1θ2=1+u−θ1−1θ2+v−θ1−1θ21θ2−1θ1;θ1>0;θ2≥1

(10.6)

The BB1 copula converges to (i) the Gumbel–Hougaard copula if θ₁→0θ1→0; and (ii) the Clayton copula if θ₂ = 1θ2=1.

Figure 10.5 D-vine structure for four-dimensional rainfall variables: (1) R330058, (2) R336949, (3) R333780, and (4) R331458.

In addition, the BB1 copula has both upper- and lower-tail dependence coefficients, as follows:

λL=2−1θ1θ2,λU=2−21θ2

The parameters of T₁ are estimated with the pseudo-MLE through the empirical marginals for all the copula candidates (Table 10.5). Table 10.6 also lists the log-likelihood, AIC, and BIC values with the best-fitted copula highlighted. From Table 10.6, we see that the two-parameter BB1 copula is the best-fitted copula for stations (R330058, R333780, R333780, and R336949), and the Gumbel–Hougaard copula is the best-fitted copula for stations R336949 and R331458.

Table 10.6 Estimation results for copula candidates.

Variables	Copulas	θθ	LL	AIC	BIC
	Gumbel-Hougaard (GH)	2.7782	35.0603	−68.1206	−66.0601
	Clayton (C)	2.9226	33.9990	−65.9979	−63.9375
U₁ v. s. U₂U1v.s.U2	Frank (F)	8.8627	31.7661	−61.5322	−59.4718
	Joe (J)	3.3021	28.8828	−55.7655	−53.7051
	BB1	[1.0203, 1.9788]	39.0411	−74.0821	−69.9612

	Gumbel-Hougaard (GH)	2.3336	26.0645	−50.1290	−48.0685
	Clayton (C)	1.8549	20.2556	−38.5113	−36.4508
U₂ v. s. U₃U2v.s.U3.	Frank (F)	6.8861	22.8811	−43.7622	−41.7017
	Joe (J)	2.8627	23.0147	−44.0294	−41.9689
	BB1	[0.3841,2.0196]	26.7981	−49.5963	−45.4754

	Gumbel-Hougaard (GH)	1.7527	22.9526	−43.9052	−41.8447
	Clayton (C)	1.2274	12.7401	−23.4801	−21.4197
U₃ v. s. U₄U3v.s.U4	Frank (F)	4.9432	14.3614	−26.7228	−24.6623
	Joe (J)	1.9670	10.9197	−19.8394	−17.7790
	BB1	[0.4963,1.4682]	15.1711	−26.3423	−22.2214

Based on the AIC/BIC model selection criteria, we again find that (1) the BB1 copula reaches the lowest AIC/BIC values for pairs (U₁, U₂U1,U2); (2) the BB1 copula is also selected to model the pairs (U2 and U3) since it yields the compariable AIC/BIC and may capture the lower tail dependence, compared with Gumbel–Houggard copula and (3) the Gumbel–Hougaard copula reaches the lowest AIC/BIC for pair (U₃, U₄U3,U4).

Copula Identification for T₂

Using the best-fitted copulas for T₁, Table 10.7 lists the conditional probability computed for T₂.

Table 10.7. Conditional probability needed for T₂.

No.	(1)	(2)	(3)	(4)	No.	(1)	(2)	(3)	(4)
1	0.748	0.290	0.588	0.302	30	0.100	0.544	0.877	0.445
2	0.199	0.578	0.803	0.820	31	0.187	0.611	0.611	0.066
3	0.005	0.551	0.664	0.384	32	0.287	0.483	0.506	0.791
4	0.884	0.664	0.962	0.550	33	0.282	0.591	0.716	0.730
5	0.012	0.558	0.872	0.592	34	0.115	0.411	0.189	0.878
6	0.719	0.441	0.149	0.609	35	0.195	0.363	0.057	0.354
7	0.106	0.698	0.997	0.770	36	0.800	0.345	0.273	0.647
8	0.594	0.275	0.003	0.252	37	0.803	0.476	0.730	0.885
9	0.717	0.380	0.070	0.268	38	0.946	0.684	0.525	0.428
10	0.173	0.325	0.325	0.316	39	0.049	0.315	0.315	0.301
11	0.358	0.256	0.123	0.078	40	0.727	0.638	0.356	0.249
12	0.957	0.433	0.714	0.202	41	0.878	0.537	0.778	0.437
13	0.638	0.388	0.076	0.185	42	0.925	0.448	0.522	0.100
14	0.502	0.303	0.164	0.559	43	0.242	0.531	0.789	0.602
15	0.606	0.335	0.511	0.818	44	0.752	0.671	0.830	0.924
16	0.445	0.455	0.200	0.252	45	0.243	0.419	0.419	0.757
17	0.412	0.403	0.006	0.091	46	0.816	0.497	0.828	0.398
18	0.587	0.504	0.281	0.087	47	0.831	0.396	0.490	0.674
19	0.089	0.462	0.439	0.039	48	0.960	0.571	0.458	0.406
20	0.894	0.564	0.247	0.228	49	0.676	0.354	0.394	0.072
21	0.354	0.517	0.972	0.878	50	0.987	0.371	0.099	0.133
22	0.410	0.658	0.864	0.595	51	0.690	0.691	0.424	0.196
23	0.425	0.631	0.803	0.523	52	0.572	0.678	0.563	0.266
24	0.259	0.517	0.858	0.979	53	0.549	0.618	0.778	0.851
25	0.316	0.604	0.689	0.966	54	0.747	0.624	0.395	0.985
26	0.382	0.426	0.506	0.330	55	0.855	0.517	0.222	0.909
27	0.105	0.651	0.698	0.535	56	0.468	0.644	0.474	0.939
28	0.695	0.469	0.492	0.175	57	0.328	0.490	0.335	0.592
29	0.443	0.584	0.359	0.232	58	0.164	0.598	0.930	0.942

τ((1), (2)) = 0.1446; τ((3), (4)) = 0.2922.τ12=0.1446;τ34=0.2922.

Note: (1) C^BB1(U₁| U₂ = u₂; θ₁₂)CBB1U1U2=u2θ12; (2) C^BB1(U₃| U₂ = u₂; θ₂₃)CBB1U3U2=u2θ23;

(3) C^BB1(U₂| U₃ = u₃; θ₂₃)CBB1U2U3=u3θ23; (4) C^GH(U₄| U₃ = u₃; θ₃₄)CGHU4U3=u3θ34.

From Kendall’s correlation coefficient estimated in Table 10.7, we again have the positive dependence for [U₁|U₂, U₃|U₂]U1U2U3U2 and [U₂|U₃, U₄|U₃]U2U3U4U3. Using all copula candidates for T₁, Table 10.8 lists the results from pseudo-MLE for T₂. Based on AIC and BIC, Frank copula is found as the best fitted copula for T₂ variables as shown in Table 10.8. However the goodness-of-fit study shows that BB1 copula should be applied to model the dependence at T₂ (Table 10.8).

Table 10.8 Results of pseudo-maximum likelihood estimation for T2.

Variables	Copulas	θθ	LL	AIC	BIC
	Gumbel-Hougaard (GH)	1.1059	0.5369	0.9262	2.9867
	Clayton (C)	0.2371	1.0970	−0.1937	1.8667
	Frank (F)	1.3126	1.4286	−0.8571	1.2033
	Joe (J)	1.0796	0.1082	1.7835	3.8439
U₁\|U₂ v. s. U₃\|U₂U1U2v.s.U3U2	BB1	[0.2336, 1.0001]	1.0967	1.8069	5.9278

	Gumbel-Hougaard (GH)	1.2934	3.6637	−5.3274	−3.2670
	Clayton (C)	0.5281	2.9222	−5.9477	−3.8873
U₂\|U₃ v. s. U₄\|U₃U2U3v.s.U4U3	Frank (F)	2.7826	5.8333	−9.6668	−7.6064
	Joe (J)	1.3356	2.1710	−2.3419	−0.2815
	BB1	[0.3112, 1.1461]	4.3801	−4.7604	−0.6396

Copula Identification for T₃

Now we can move on to T₃. Similar to T₂, we first need to compute the conditional copula through the BB1 copula as follows:

FU1≤u1U2=u2U3=u3=∂C13∣2Fu1u2Fu3u2∂Fu3u2

FU4≤u4U2=u2U3=u3=∂C24∣3Fu4u3Fu2u3∂Fu2u3

The computed conditional probability distribution using the selected fitted copulas in T₁ and T₂ is listed in Table 10.9.

Table 10.9. Conditional probability distribution computed for parameter estimation of T₃.

No.	C(U₁\| U₂, U₃)CU1U2U3	C(U₄\| U₂, U₃)CU4U2U3	No.	C(U₁\| U₂, U₃)CU1U2U3	C(U₄\| U₂, U₃)CU4U2U3
1	0.8222	0.2421	30	0.1612	0.2798
2	0.2184	0.7546	31	0.1519	0.0328
3	0.0030	0.3020	32	0.2693	0.8071
4	0.9089	0.3088	33	0.2700	0.6734
5	0.0191	0.4304	34	0.0798	0.9377
6	0.6749	0.7570	35	0.1388	0.6254
7	0.2114	0.3717	36	0.7956	0.7325
8	0.5270	0.7909	37	0.8355	0.8653
9	0.6707	0.5060	38	0.9368	0.3979
10	0.1761	0.3532	39	0.0476	0.3403
11	0.3406	0.1462	40	0.6812	0.2610
12	0.9673	0.1203	41	0.8936	0.3145
13	0.5824	0.3821	42	0.9276	0.0656
14	0.4791	0.7055	43	0.2885	0.4868
15	0.6624	0.8341	44	0.7448	0.8926
16	0.3849	0.3462	45	0.2316	0.7938
17	0.3342	0.5159	46	0.8633	0.2583
18	0.5362	0.0943	47	0.8433	0.6859
19	0.0726	0.0241	48	0.9543	0.3989
20	0.8738	0.2856	49	0.6909	0.0563
21	0.6729	0.6668	50	0.9843	0.2652
22	0.4138	0.4388	51	0.6391	0.1787
23	0.4219	0.3929	52	0.5194	0.2126
24	0.3643	0.9708	53	0.5467	0.8060
25	0.2926	0.9680	54	0.7068	0.9923
26	0.3924	0.2969	55	0.8297	0.9524
27	0.0793	0.4514	56	0.4078	0.9552
28	0.6891	0.1395	57	0.2790	0.6537
29	0.3809	0.2404	58	0.2531	0.8725

Using the conditional probability listed in Table 10.9, Kendall’s correlation coefficient is computed as τ = − 0.06τ=−0.06, and we have negative dependent variables for T₃. Applying the Frank copula, we have θ = − 0.5062θ=−0.5062.

Applying the goodness-of-fit study for the fitted vine copula, the Rosenblatt transform approach is applied. The test results (Table 10.10) further confirmed the selected vine copula may properly study the dependence structure of the four-dimensional rainfall dataset.

Table 10.10. Goodness-of-fitted test for T₁, T₂, and T₃.

	Copula	P	S_nbSnb		Copula	P	S_nbSnb
	BB1 (U₁, U₂U1,U2)	0.59	0.23	T₂	BB1(U1\|U2, U3\|U2)	0.31	0.35
T₁	BB1(U₂, U₃U2,U3)	0.62	0.21		BB1(U2\|U3, U4\|U3)	0.58	0.23
	GH (U₃, U₄U3,U4)	0.41	0.12	T₃	Frank (U₁\|U₂,U₃, U₄\|U₂,U₃)	0.046	0.95

Applying the goodness-of-fit study, we have SnB=0.046,P=0.95. Thus, we have the four-dimensional fitted D-vine copula as BB1-BB1-GH (T₁), BB1-BB1 (T₂) and Frank (T₃) as shown in Figure 10.6.

Figure 10.6 Fitted D-vine copula for four-dimensional rainfall variables.

With the fitted D-vine copula in Figure 10.6, we can simulate the four-dimensional pseudo-rainfall variables (i.e., the marginal CDF of rainfall variables) as shown in Figure 10.7. Here, we will show how to simulate the random variates from the fitted D-vine copula given in Figure 10.6 with a simple example:

1. Generate four independent uniform random variables in [0,1]

W = [0.7582, 0.6289, 0.9611, 0.2743]
W=0.75820.62890.96110.2743.

For the independent random variables generated, we set the following:

u1=W1=0.7582

C(u₂| U₁ = u₁) = 0.6289
Cu2U1=u1=0.6289

C(u₃| U₁ = u₁, U₂ = u₂) = 0.9611;
Cu3U1=u1U2=u2=0.9611;

C(U₄| U₁ = u₁, U₂ = u₂, U₃ = u₃) = 0.2743
CU4U1=u1U2=u2U3=u3=0.2743.
2. Simulate u₂u2 from C(U₂| U₁ = u₁) = 0.6289CU2U1=u1=0.6289.

According to the fitted D-vine copula, we know [U₁, U₂]U1U2 is fitted with the BB1 copula (i.e., Equation (9.1)) with parameters [1.0203,1.9788]. Its conditional copula is then written as follows:

Cu2U1=u1=∂Cu1u2∂u1u1=0.6478=S1θ2−1u1−θ1−1θ2−1u1θ1+1S1θ2+11θ1+1(10.7)

where:S=1u1θ1−1θ2+1u2θ1−1θ2.

Substituting u₁ = 0.7582u1=0.7582 and C(u₂| U₁ = u₁) = 0.6289Cu2U1=u1=0.6289 into Equation (10.7), we can solve for u₂u2 numerically and obtain the following:

u2=0.7755.
3. Simulate u₃u3 from C(u₃| U₁ = 0.7582, U₂ = 0.7755)Cu3U1=0.7582U2=0.7755.

According to the fitted D-vine copula, we know U₂U2 is the one of the center variables, and from the probability density composition discussed in Chapter 5, we have the following:

Cu3U1=u1U2=u2=∂C13∣2C3∣2C1∣2∂C1∣2(10.8)

As seen in Equation (10.8), we may simulate u₃u3 with the following two steps:
1. i. Compute C_3 ∣ 2C3∣2 from C_13 ∣ 2C13∣2. According to Figure 10.6, we know that the BB1 copula with parameter [0.2336, 1.0001] properly models C_13 ∣ 2(C_3 ∣ 2, C_1 ∣ 2)C13∣2C3∣2C1∣2. With this in mind and after computing C_1 ∣ 2C1∣2 using the BB1 conditional copula (i.e., Equation (10.7)), we immediately have the following: C(U₁ ≤ 0.7582| U₂ = 0.7755) = 0.5465CU1≤0.7582U2=0.7755=0.5465. Given {C_3 ∣ 2, C_1 ∣ 2}C3∣2C1∣2 (i.e., one of the bivariate copulas at T₂) again modeled by the BB1 copula, C_3 ∣ 2C3∣2 can then be computed by substituting C_1 ∣ 2 = 0.5465C1∣2=0.5465 as u₁u1 and C_3 ∣ 2C3∣2 as u₂u2, and by equating Equation (10.8) to 0.9383. We can solve for C_3 ∣ 2C3∣2 numerically as C_3 ∣ 2 = 0.9636C3∣2=0.9636.
2. ii. Compute u₃u3 from C_3 ∣ 2C3∣2. From Figure 10.6, {U₂, U₃}U2U3 is also modeled with the BB1 copula; u₃u3 can then be solved for numerically by substituting u₂ = 0.7755u2=0.7755 as u₁u1, and u₃u3 as u₂u2 into Equation (10.7), and by setting the equation equal to C_3 ∣ 2 = 0.9636C3∣2=0.9636. We then have the following:
  
  u3=0.9383.
4. Simulate u₄u4 from C(u₄| U₁ = 0.7582, U₂ = 0.7755, U₃ = 0.6865)Cu4U1=0.7582U2=0.7755U3=0.6865.

From the fitted D-vine copula, we know the conditional copula C(u₄| U₁ = u₁, U₂ = u₂, U₃ = u₃)Cu4U1=u1U2=u2U3=u3 may be written using the probability function decomposition discussed in Chapter 5 as follows:

Cu4U1=u1U2=u2U3=u3=∂C14∣23C1∣23C4∣23∂C1∣23(10.9)

where:

C1∣23=∂C13∣2C1∣2C3∣2∂C3∣2;C4∣23=∂C24∣3C4∣3C2∣3∂C2∣3(10.9a)

We know from Equations (10.9) and (10.9a) that C_14 ∣ 23, C_1 ∣ 23, and C_4 ∣ 23C14∣23,C1∣23,andC4∣23 are modeled by bivariate Frank, BB1, and BB1 copulas, respectively (Figure 10.6). To this end, we can simulate u₄u4 with the steps given in what follows:
1. i. Simulate C_4 ∣ 23C4∣23 using C(u₄| U₁ = 0.7582₁, U₂ = 0.7755, U₃ = 0.9383) = 0.2743Cu4U1=0.75821U2=0.7755U3=0.9383=0.2743.
  
  With the previously simulated u₁, u₂, u₃u1,u2,u3, we first compute the conditional copula C_1 ∣ 23C1∣23 in Equation (10.9a). Applying the corresponding fitted BB1 copulas, we compute the conditional copula as follows: C_1 ∣ 2 = 0.5465, C_3 ∣ 2 = 0.9636, C_1 ∣ 23 = 0.4764C1∣2=0.5465,C3∣2=0.9636,C1∣23=0.4764. The given Frank copula may be applied to model C_14 ∣ 23C14∣23 of T₃, and Equation (10.9) may be rewritten using the conditional Frank copula as follows:
  
  Cu4U1=u1U2=u2U3=u3=e−θC1∣23e−θC4∣23−1e−θC1∣23+C4∣23−e−θC1∣23−e−θC4∣23+e−θ(10.10)
  Substituting θ = − 0.5062, C_1 ∣ 23 = 0.4764, C(u₄| U₁ = u₁, U₂ = u₂, U₃ = u₃) = 0.2743θ=−0.5062,C1∣23=0.4764,Cu4U1=u1U2=u2U3=u3=0.2743 into Equation (10.10), C_4 ∣ 23C4∣23 is solved for numerically as follows: C_4 ∣ 23 = 0.2777C4∣23=0.2777.
2. ii. Simulate C_4 ∣ 3C4∣3 using C(u₄| U₂ = 0.7755, U₃ = 0.9383) = 0.2777Cu4U2=0.7755U3=0.9383=0.2777.
  
  C_4 ∣ 3C4∣3 can be simulated from the conditional copula C_4 ∣ 23C4∣23 through C_24 ∣ 3C24∣3, given as Equation (10.9a). C_24 ∣ 3C24∣3 may be modeled with the bivariate BB1 copula through C_4 ∣ 3, C_2 ∣ 3C4∣3,C2∣3. Applying the BB1 copula to {U₂, U₃U2,U3}, we can easily compute C_2 ∣ 3 = C(U₂ ≤ 0.7755| U₃ = 0.9383) = 0.1736C2∣3=CU2≤0.7755U3=0.9383=0.1736. In model construction (e.g., Figure 10.6), C_24 ∣ 3C24∣3 is also modeled by the BB1 copula. Thus, we can solve for C_4 ∣ 3C4∣3 numerically as C_4 ∣ 3 = 0.1867C4∣3=0.1867.
3. iii. Simulate u₄u4 using C_4 ∣ 3 = 0.1867C4∣3=0.1867.
  
  We know that {U₃, U₄U3,U4} is modeled with the Gumbel–Hougaard copula (Figure 10.6), as shown in Chapter 4; the conditional Gumbel–Hougaard copula can be written as follows:
  
  Cu4U3=u3=−lnu3−1+θ(−lnu3)θ+−lnu4θ−1+1θu3exp−lnu3θ+−lnu4θ(10.11)
Substituting u₃ = 0.9383, C_4 ∣ 3 = 0.1867u3=0.9383,C4∣3=0.1867 into Equation (10.11), we have the following:

u4=0.6865

Figure 10.7 Comparison of simulated random variables with the pseudo-rainfall observations and simulated rank-based Kendall correlation coefficient with sample Kendall correlation coefficient.

Finally, with four independent uniform random variables W = [0.7582, 0.6289, 0.9611, 0.2743]W=0.75820.62890.96110.2743; we successfully simulate the pseudo-rainfall variables from the fitted D-vine copula as follows:

U = [0.7582, 0.7755, 0.9383, 0.6865]

U=0.75820.77550.93830.6865.

Comparison of the simulated copula random variables with the pseudo-rainfall variables (the upper triangle of Figure 10.7) shows that the fitted D-vine copula reasonably preserves the overall dependence. With the use of 200 simulations, the lower triangle of Figure 10.7 compares the Kendall correlation coefficient computed from the simulations with the sample Kendall correlation coefficient computed from the observed four-dimensional rainfall variables. Comparison through the Kendall’s correlation coefficient indicates the following:

1. The sample correlation coefficient is within 50% bound for all free bivariate variates in T₁, i.e.,

(U₁, U₂) : (R330058, R336949), (U₂, U₃) : (R336949, R333780), (U₃, U₄) : (R333780, R331358)
U1U2:R330058R336949,U2U3:R336949R333780,U3U4:R333780R331358;
2. The sample correlation coefficient is also within 50% bound for the bivariate variates through conditioning, i.e., (U₁, U₃) : (R330058, R333780), (U₁, U₄) : (R330058, R331358)U1U3:R330058R333780,U1U4:R330058R331358.
3. The sample correlation coefficient is very close to the 50% bound for the last pair of the bivariate variate through conditioning: (U₂, U₄) : (R336949, R331358)U2U4:R336949R331358.

The preceding comparison ensures the appropriateness of applying the fitted D-vine copula model to investigate the four-dimensional rainfall variables. In addition, with the closeness of rain gauges, it is reasonable to assume that there may exist the tail dependence among the rainfall variables (i.e., there is the concurrent tendency of extreme weather events, e.g., storm events). The possible tail dependence makes the BB1 copula the best choice for a majority of cases. We will provide a detailed discussion in this regard when we compare the fitted vine copula to meta-elliptical and asymmetric Archimedean copulas later in the chapter.

10.3.2 Application of Meta-Elliptical Copula to Four-Dimensional Rainfall Variables

In this section, we will apply the meta-Gaussian and meta-Student t copula to model the four-dimensional rainfall variables. Using the same empirical marginals as those in Section 10.3.1, Table 10.11 lists the parameters (i.e., the correlation matrix for meta-Gaussian copula, and the correlation matrix and degree of freedom for meta-Student t copulas). With the estimated parameters, Figures 10.8 and 10.9 compare the simulated copula random variables with the pseudo-rainfall random variables as well as the simulated Kendall correlation coefficient with the sample Kendall correlation coefficient.

Table 10.11. Parameters estimated for meta-Gaussian and meta-Student t copulas.

Stations	R330058	R336949	R333780	R331358	R330058	R336949	R333780	R331358
Stations	Meta-Gaussian copula				Meta-Student t copula
R330058	1	0.85	0.71	0.61	1	0.87	0.74	0.64
R336949	0.85	1	0.75	0.72	0.87	1	0.78	0.74
R333780	0.71	0.75	1	0.65	0.74	0.78	1	0.68
R331358	0.61	0.72	0.65	1	0.64	0.74	0.68	1
					d.f. ν = 62.16ν=62.16

Figure 10.8 Comparison with the fitted meta-Gaussian copula.

Figure 10.9 Comparison with the fitted meta-Student t copula.

Simulations shown in Figures 10.8 and 10.9 indicate that the overall dependence structure of rainfall variables is very well preserved. In the case of overall dependence structure, the meta-Gaussian and meta-Student t copula visually perform better than the previously fitted D-vine copula, e.g., all sample Kendall correlation coefficients are within 50% bounds of the simulated Kendall correlation coefficients (200 simulations). Furthermore, the goodness-of-fit studies using the Rosenblatt transform yield the following:

Meta-Gaussian copula: SnB = 0.0245, P=SnB=0.0245,P=0.964.

Meta-Student t copula: SnB = 0.094, P = 0.785SnB=0.094,P=0.785.

10.3.3 Application of the Asymmetric Archimedean Copula to Four-Dimensional Rainfall Variables

In this section, we will evaluate the performance of asymmetric Archimedean copulas. Here we will choose the following two types of asymmetric structures (Figure 10.10). In Figure 10.10, U₁, U₂, U₃, and U₄ represent R330058, R336949, R333780, and R331358, respectively, as that applied for the D-vine and meta-elliptical copulas.

Figure 10.10 Asymmetric Archimedean copula structure.

As seen in Section 10.3.1, the BB1 and Gumbel–Hougaard copulas are found to properly model the bivariate random variables in T₁. From Table 10.6, we see that the Gumbel–Hougaard copula comes to the second place to model (R330058, R336949), (R336949, R333780)R330058R336949,R336949R333780, and the BB1 copula comes as the second place to model (R333780, R331358)R333780,R331358). Given the possible difficulties to assess that the parameter of the higher level are lower than the parameter in the lower level (i.e., the parameter of C2 should be less than that of C1) for the two-parameter copulas, we will apply the second-best Gumbel–Hougaard copula for analysis. Applying the Gumbel–Hougaard copula and letting θ₁, θ₂, θ₃θ1,θ2,θ3 represent the parameter for C₁, C₂, C₃C1,C2,C3 respectively, the nested asymmetric Gumbel–Hougaard copula for the four-dimensional case can be written as follows:

Cu1u2u3u4θ1θ2θ3=C3GHC2GHC1GHu1u2θ1u3θ2u4θ3

(10.12)

where θ₁ ≥ θ₂ ≥ θ₃θ1≥θ2≥θ3

Table 10.12 lists the parameters as well as the Kendall correlation coefficient estimated for each level. The parameters listed in Table 10.12 fulfills the conditions for the nested asymmetric Archimedean copula (i.e., given as part of Equation (10.12)). Applying the goodness-of-fit study through the Rosenblatt transform, we obtain SnB = 0.048, P = 0.323.SnB=0.048,P=0.323. The goodness of fit results indicate the appropriateness of the fitted four-dimensional asymmetric Gumbel–Hougaard copula.

Table 10.12. Results from the nested asymmetric Gumbel–Hougaard copula.

Parameters	C1	C2	C3
θθ	2.78	2.22	1.65
ττ	0.64	0.56	0.51

Furthermore, according to the discussion in Chapter 5, we may conclude that (i) pairs (R330058, R333780) and (R336949, R333780) should follow the Gumbel–Houggard copula with parameter θ₂θ2; (ii) and pairs (R330058, R331358), (R336949, R331358), and (R333780, R331358) should all follow the Gumbel–Hougaard copula with parameter θ₃θ3. Figure 10.11 compares the asymmetric Gumbel–Hougaard copula and bivariate Gumbel–Hougaard copula with the pseudo-observations. The scatter plots in Figure 10.11 show that the overall positive dependence may be captured; however, the box plots for the Kendall correlation coefficient indicate that (i) the sample Kendall correlation coefficient obviously falls out of the upper 50% bound for (R336949, R331358), and (R333780, R331358); (ii) the sample Kendall correlation coefficient is slightly higher than the upper 50% bound for (R336949, R333780); and (iii) the sample Kendall correlation coefficient is slightly higher than the 75% bound for (R336949, R331358).

Figure 10.11 Comparison of the asymmetric Gumbel–Hougaard copula with the pseudo-rainfall variables.

10.3.4 Comparison of D-vine, Meta-Elliptical, and Asymmetric Archimedean Copulas

In this section, we will compare the performances of fitted D-vine, meta-elliptical, and asymmetric Archimedean copulas for modeling the dependence in higher dimensions (i.e., four dimensions in this case study). Given all three types of fitted copulas passing the goodness-of-fit study, we will focus on the performance, freedom, and complexity of copula functions.

Flexibility and Complexity of Copula Functions

First, as discussed in Chapter 5, the vine copula is constructed, based on the probability density function decomposition, such that only bivariate copulas are considered as the building blocks either unconditionally (base level T₁) or conditionally (upper levels). The bivariate copulas (i.e., the building blocks) are allowed for free specification (i.e., the copulas do not need to belong to the same family at all). Additionally, there are many choices for model construction. For example, in our four-dimension example illustrated here, we may be able to build 24 different D-vine copula structures through different pairing schemes.

Second, the meta-elliptical copula is only dependent on the correlation matrix for the meta-Gaussian copula, and the correlation matrix and degree of freedom for meta-Student t copula. In addition, its parameter estimation is easier than that of a vine copula.

Third, there are constraints on the asymmetric Archimedean copula. In addition, there are implications for the dependence for indirectly connected bivariate random variates (as discussed in the previous section).

Overall, the vine copula is most complex with the most flexibility of model construction. The meta-elliptical copula may always be able to capture the overall dependence. The asymmetric Archimedean copula has the least flexibility for model construction, and the dependence structure may not be properly captured due the theoretical constraints of the asymmetric copula function.

Comparison of Copula Performances

Applying Equation (5.61) in Chapter 5 to the fitted D-vine copula in Section 10.3.1, we will be able to compute the joint CDF for the four-dimensional rainfall variables. Similar to application of Equation (7.32) and Equation (7.46) in Chapter 7 and Equation (10.12), we can compute the joint CDF fitted by the meta-Gaussian, meta-Student t and asymmetric Gumbel–Hougaard copulas for the four-dimensional rainfall variables. Figure 10.12 compares the fitted parametric four-dimensional copula function with the nonparametric empirical copulas. Table 10.13 lists the RMSE computed between the parametric and empirical copulas. Figure 10.12 shows that (1) there is minimal visual difference between the performance of meta-Gaussian and that of meta-Student T copulas; (2) there is visual difference between the performance of fitted D-vine copula and asymmetric GH copula; and (3) the fitted D-vine copula may underestimate the JCDF for higher orders (>35) more than the asymmetric GH copula. The RMSE results listed in Table 10.13 further confirm the findings visually seen in Figure 10.12.

Figure 10.12 Comparison of vine, meta-elliptical, and asymmetric Archimedean copulas with empirical copula.

Table 10.13. RMSE computed between parametric and empirical copulas.

Copula	D-vine	Asymmetric GH	Meta-Gaussian	Meta-Student t
RSME	0.040	0.032	0.029	0.027

Comparing all three types of the copulas, one may directly apply the meta-elliptical copula for higher dimensions as the following:

1. The variance–covariance structure may be very well preserved (Figures 10.8 and 10.9).
2. A meta-elliptical copula is easy to construct, compared to both vine and asymmetric Archimedean copulas.
3. A meta-elliptical copula yields the overall best performance.

10.4 Summary

In this chapter, we discussed the application of copula to (1) the partial duration rainfall sequences to construct the DDF curve, and (2) the spatial dependence of precipitation measured from multiple rain gauge stations (i.e., four stations are selected in the case study).

The study shows the following:

i. Even with the differences between the NOAA and copula-based DDF curves constructed for the partial duration time series, the copula-based method may be considered as a rational alternative for rainfall DDF (or IDF) construction with simpler and faster rainfall separation (events regardless of the length of rainfall duration) compared to that of NOAA analysis (rainfall duration based directly).
ii. Applying vine, meta-elliptical, and asymmetric copulas to model the spatial dependence, we have found that the vine copula is most complex and most flexible at the same time. In regard to the copula performance, one may directly apply the meta-elliptical copula, given the simplicity of the parameter estimation and the capture of pairwise dependence structure for all correlated random variables.

References

Abdul Rauf, U. F. A. and Zeephongsekul, P. (2014). Copula based analysis of rainfall severity and duration: a case study. Theoretical and Applied Climatology, 115(1–2), 153–166.

Bacchi, B., Becciu, G., and Kottegoda, N. T. (1994). Bivariate exponential model applied to intensities and durations of extreme rainfall. Journal of Hydrology, 155, 225–236.

Cantet, P. and Arnaud, P. (2014). Extreme rainfall analysis by a stochastic model: impact of the copula choice on the sub-daily rainfall generation. Stochastic Environmental Research and Risk Assessment, 28, 1479–1492.

Cong, R.-G. and Brady, M. (2012). The interdependence between rainfall and temperature: copula analysis. Scientific World Journal, 405675, doi:10.1100/2012/405675.

Cordova, J. R. and Rodriguez-Iturbe, I. (1985). On probabilistic structure of storm surface runoff. Water Resources Research, 21(5), 755–763.

Favre, A.-E., El Adlouni, S., Perreault, L, Thiemonge, N., and Bobee, B. (2004). Multivariate hydrological frequency analysis using copulas. Water Resources Research, 40, W01101.

Goel, N. K., Kurothe, R. S., Mathur, B. S., and Vogel, R.M. (2000). A derived flood frequency distribution for correlated rainfall intensity and duration. Journal of Hydrology, 228, 56–67.

Grimaldi, S., Serinaldi, F., Napolitano, F., and Ubertine, L. (2005). A 3-copula function application for design hyetograph analysis. IAHS Publication, 293, 1–9.

Hao, Z. and Singh, V. P. (2013). Entropy-based method for extreme rainfall analysis in Texas. Journal of Geophysical Research, 118, 263–273, doi:10.1029/2011JD017394.

Hashino, M. (1985). Formulation of the joint return period of two hydrologic variates associated with a poisson process. Journal of Hydroscience and Hydraulic Engineering, 3(2), 73–84.

Joe, H. (1997). Multivariate Models and Multivariate Dependence Concepts. Chapman & Hall/CRC, New York.

Kao, S.-C. and Govindaraju, R. S. (2007). A bivariate frequency analysis of extreme rainfall with implications for design. Water Resources Research, 112, D13119.

Kao, S.-C. and Govindaraju, R. S. (2008). Tivariate statistical analysis of extreme rainfall events via Plackett family of copulas. Water Resources Research, 44, W02415.

Khedun, C. P., Mishra, A. K., Singh, V. P., and Giardino, J. R. (2014). A copula-based precipitation model: investigating the interdecadal modulation of ENSO’s impacts on monthly precipitation. Water Resources Research, 50, 1–20, doi:10.1002/2013WR013763.

Kurothe, R. S., Goel, N. K., and Mathur, B. S. (1997). Derived flood frequency distribution of negatively correlated rainfall intensity and duration. Water Resources Research, 33(9), 2103–2107.

Moazami, S., Golian, S., Kavianpour, M. R., and Hong, Y. (2014). Uncertainty analysis of bias from satellite rainfall estimates using copula method. Atmospheric Research, 137, 145–166

Singh, K. and Singh, V. P. (1991). Derivation of bivariate probability density functions with exponential marginals. Stochastic Hydrology and Hydraulics, 6(1), 47–54.

Vernieuwe, H., Vandenberghe, S., De Bates, B., and Verhoest, N. E. C. (2015). A continuous rainfall model based on vine copulas. Hydrology and Earth System Sciences, 19, 2685–2699. doi:10.5194/hess-19-2685-2015.

Yue, S. (2000a). Joint probability distribution of annual maximum storm peaks and amounts as represented by daily rainfalls. Hydroscience Journal, 45(2), 315–326.

Yue, S. (2000b). The Gumbel logistic model for representing a multivariate storm event. Advances in Water Resources, 24(2), 179–185.

Yue, S. (2000c). The Gumbel mixed model applied to storm frequency analysis. Water Resources Management. 14(5), 377–389.

Zhang, L. and Singh, V. P. (2007a). Bivariate rainfall frequency analysis using Archimedean copulas. Journal of Hydrology, 332, 93–109.

Zhang, L. and Singh, V. P. (2007b). IDF curves using Frank Archimedean copula. Journal of Hydrologic Engineering, 12(6), 651–662.

Zhang, L. and Singh, V. P. (2007c). Gumbel–Houggard copula for trivariate rainfall frequency analysis. Journal of Hydrologic Engineering 12(4), 409–419.

Zhang, Q., Li, J., and Singh, V. P. (2012). Application of Archimedean copulas in the analysis of the precipitation extremes: effects of precipitation change. Theoretical and Applied Climatology, 107(1–2), 255–264.

Zhang, Q., Li, J., Singh, V. P., and Xu, C.-Y. (2013). Copula-based spatio-temporal patterns of precipitation extremes in China. International Journal of Climatology, 33(5), 1140–1152.

Tags: Copulas and their Applications in Water Resources Engineering

Oct 12, 2020 | Posted by drzezo in Water and Sewage | Comments Off

Civil Engineer Key

Fastest Civil Engineer Engine

10 – Rainfall Frequency Analysis

Abstract