1 – Introduction | Civil Engineer Key

Abstract

This chapter briefly reviews the development of the copula theory and its applications in the field of water resources engineering (flood, drought, rainfall, groundwater, etc.). It points out the need for applying the copula theory in hydrology and engineering. The chapter is concluded with an outline of the structure of the book.

1 Introduction

1.1 Need for Copulas

Complex hydrological processes, such as floods, droughts, winds, rainstorms, and snowfall, are characterized by more than one correlated random variable. Hydrologic events emanating from these processes are multivariate and their treatment requires multivariate analysis. Yue (1999, 2000a, 2000b, 2000c), Yue et al. (2001), and Yue and Rasmussen (2002) reviewed some applications of multivariate hydrological analyses using traditional frequency analysis methods with multivariate distributions.

Multivariate frequency distributions have usually been derived using one of three fundamental assumptions (Zhang and Singh, 2006): (1) the random variables each have the same type of marginal probability distribution; (2) the variables are assumed to have a joint normal distribution or are transformed to have a joint normal distribution; or (3) the variables are assumed independent – a trivial case. In reality, the correlated random variables are generally dependent, do not follow the normal distribution, and/or do not have the same type of marginal distributions. In general, multivariate hydrological analyses are mathematically complicated, and the resulting joint distributions may be valid only in a limited solution space.

When deriving multivariate distributions, it has been demonstrated in the last two decades that the aforementioned difficulties can be overcome with the use of copulas because: (1) they separate the dependence function from the marginal distributions of random variables; (2) the dependence function represented by the copula function is the cumulative joint distribution of correlated random variables; and (3) the mutual information (bivariate/multivariate) may be expressed as the negative copula entropy that avoids the complexity of evaluating the uncertainty with the use of entropy theory (information theory). In what follows, we briefly summarize copulas and their applications.

1.2 Introduction of Copulas and Their Application

Copula was first introduced by Sklar (1959). Later on, Joe (1997) and Nelsen (2006) further discussed the dependence structure of multivariate random variables using the copula theory. The copula theory was first developed in the fields of statistics and finance (more specifically econometrics). In this section, we will first briefly introduce the history of development of copulas, followed by a brief introduction of copula properties, parameter estimation, and applications to the field of water resources engineering.

1.2.1 Development and Applications of Copulas in Statistics and Finance

Copula theory has been developed and applied in the fields of statistics and finance. Ali et al. (1978) proposed a bivariate distribution family, i.e., the bivariate logistic distribution by considering the survival odds ratio. They also studied the properties of the bivariate distribution. Now it is named the Ali–Mikhail–Haq (AMH) copula family. It is worth noting that this copula family may not be applicable, unless Kendall’s tau rank correlation coefficient falls in the range of (–1/3 to 1/3).

Cook and Johnson (1981) proposed a simple bivariate distribution family to represent nonelliptical symmetric bivariate random variables. The proposed copula, however, may only be applied to the positively correlated random variables. They also proved that multivariate Pareto, Burr, and logistic distributions were special cases, and that copula is now named the Cook–Johnson (Clayton) Archimedean copula family.

Genest and McKay (1986) described bivariate distributions with uniform marginals on a unit interval. They discussed how bivariate distributions (copula) may be applied for singular components and the geometric interpretation of Kendall’s tau. Genest (1987) studied the Frank family of bivariate distributions and concluded that it was appropriate to apply the Frank family to construct the bivariate distribution with any given marginals and cover all possible dependence structures. He then introduced three nonparametric estimators and one parametric estimator, i.e., the maximum likelihood estimation (MLE) method. Genest and Rivest (1993) studied the Archimedean one-parameter copula. They applied Kendall’s tau for parameter estimation and found that Kendall’s tau may also be applied for selecting the appropriate copula for certain multivariate random variables, and analyzed uranium exploration data to explain how to apply the estimation procedure.

Genest et al. (1995) investigated the properties of another semi-parametric estimation method to estimate copula parameters. This semi-parametric estimation method can be considered as a pseudo-likelihood method that is found to be consistent and asymptotically normal. The performance of the pseudo-likelihood method was investigated by analyzing the bivariate Clayton (Cook–Johnson) copula. Later, Caperaa et al. (1997) proposed a new nonparametric method and examined its asymptotic properties and small sample behavior compared to the estimation method through Kendall’s tau statistic and maximum likelihood method. They found that the proposed method was strongly convergent and asymptotically unbiased.

Genest and Boies (2003) discussed the Kendall plot as a measure of dependence. Similar to chi-plot, the Kendall plot is invariant with respect to the monotone transformation of marginal distributions. They also found that the Kendall plot is easier to interpret than the chi-plot, which may also be extended to multivariate analysis (dimension ≥ 3). Genest et al. (2006, 2007a) investigated the formal goodness-of-fit statistical tests for copulas. Chakak and Koehler (1995) presented a procedure to construct families of multivariate distributions through specified univariate and bivariate margins. Their procedure constructs multivariate distributions through conditional distributions.

Zheng and Klein (1995) proposed a copula-graphic estimator, which is a maximum likelihood estimator. The copula-graphic estimator was applied for the estimation of marginal distributions from the given copula for survival analysis. Simulation was performed using the Monte Carlo method, and the robustness of the method showed that the assumption of completely specifying the copula allowed for estimating the complete joint survival function based only on the competing risk data.

Quesada-Molina and Rodriguez-Lallena (1995a, b) investigated bivariate copulas with quadratic and cubic sections, which were derived from simple univariate real-valued functions on the interval [0, 1]. They applied various positive dependence structures (i.e., quadrant dependence and total positivity), measures of association (i.e., Kendall’s τ and Spearman’s ρ), stochastic ordering, and various notions of symmetry, which were shown to be equivalent to certain simple properties of univariate functions used for constructing bivariate copulas. They applied several examples to illustrate how these copulas can be constructed.

Müller and Scarsini (2001) considered two random vectors X and Y with the component of X dominated in the convex order by the corresponding components of Y. They found that the positive linear combination of the components of X dominated in the convex order by the same positive linear combination of the components of Y had the properties as the two random vectors having the common copula and conditionally increasing.

Frees and Valdez (1997) applied copulas, i.e., the Archimedean copula in an actuarial study, and estimated their parameters by both nonparametric and parametric methods. It was concluded that the Archimedean copula could be used to represent the bivariate distribution in the actuarial study fairly well.

Sancetta and Satchell (2001) analyzed financial multivariate data whose marginals were not normally distributed. Based on the nice Bernstein properties, they applied the Bernstein polynomial approximation to copulas and then investigated the multivariate convergence properties. The portfolio data were applied to investigate statistical properties and applications of Bernstein copulas. Chen and Fan (2002) investigated the issue related to the density forecast by applying a copula. They proposed a parametric test for the correct density forecasts by nesting a series of independently identically distributed random variables from stationary Markov processes. By applying the copula, they found that this test exhibited a large variety of marginal properties. Coupling the same marginals with different copula functions, they found that the test again exhibited numerous dependence properties.

Fang et al. (2002) investigated the joint probability density function of continuous random variables with given marginals by analyzing elliptically contoured distributions, e.g., normal distribution. They named this joint density function as meta-elliptical distribution. The analytical formulation, conditional distribution, and dependence properties of this meta-elliptical density function were discussed. They found that meta-elliptical joint distribution held the same Kendall tau as did the meta-Gaussian joint distribution belonging to the meta-elliptical joint distribution. Brakekers and Veraverbeke (2005) extended the estimator proposed by Rivest and Wells (2001) to the fixed design regression application. In survival analysis, the variables were generally assumed independent, which may be invalid in certain practical applications.

1.2.2 Construction and Parameter Estimation of Copulas

With the development of copula theories in statistics, Nelsen (2006) summarized the four most efficient methods to construct the copulas: (1) inversion method, (2) geometric method, (3) algebraic method, and (4) with specified properties. A detailed discussion of the construction of copulas and their properties will be provided in Chapter 3.

For any given copulas, their parameters may be estimated non-parametrically, parametrically, or semi-parametrically. The nonparametric method estimates the parameters with the rank correlation coefficient, i.e., Kendall’s τ or Spearman’s ρ. This method yields the analytical solution if there is a closed-form solution between rank correlation coefficient and copula parameters (e.g., certain Archimedean copulas that will be discussed in Chapter 4).

The copula parameters may be estimated parametrically with the use of one of the following three methods:

Full MLE, by which the parameters of marginal distributions and copulas are estimated simultaneously.
Two-stage MLE, by which the parameters of marginal distributions and the parameters of copula function are estimated separately using MLE. In this case, the fitted parametric marginal distributions will be applied to estimate the copula parameters through MLE.
The semi-parametric method (also called pseudo-MLE: PMLE), which applies the empirical distribution (computed using probability plotting-position formula or kernel density) to estimate the copula parameters using MLE. Unlike the parametric approach, the semi-parametric method is marginal free.

Details of the estimation methods will be discussed in Chapter 3 and the following chapters.

To assess the goodness-of-fit of the fitted or proposed copula functions, Genest and Boies (2003), Genest et al. (2006), and Genest et al. (2007a) proposed the graphical and numerical assessment tools. These goodness-of-fit measures will be further introduced and applied in the chapters that follow.

1.2.3 Application of Copulas in Water Resources Engineering

With the theoretical development of copula theory and its advancement in statistics and econometrics, copulas have been adopted and applied in the fields of hydrology, water resources, and environmental engineering. These applications are briefly reviewed in the following section.

Copula Applications in Flood Frequency Analysis

Salvadori and De Michele (2004) provided a general theoretical framework exploiting copulas to determine return periods of bivariate hydrological events. They concluded the following: (1) copula may greatly simply the calculations of return period and may even yield an analytical solution; (2) copula may be associated with the return period of specific events; (3) with the use of copula, one may define sub-, super-, and critical events as well as those of primary and secondary return periods; and (4) the copula approach may be easily generalized to multivariate cases. The proposed methodology was further illustrated using flood peak and flood volume in a river basin in southern Taiwan, the spillway design flood of an existing Italian dam, and the annual maximum peak flow at Chute-des-Passes. Using flood variables (i.e., peak discharge, flood volume, and flood duration) observed at Kanawa River as an example, Grimaldi and Serinaldi (2006a) showed that (1) the flood variables were correlated; and (2) the dependence may not be symmetric among the flood variables, depending on the threshold used to identify the flood event. Employing the asymmetric Frank copula, the symmetric Frank copula, and the logistic Gumbel distribution through case studies, they presented the following: (1) the possible improvement obtained using the asymmetric copula and (2) the advantages in using the asymmetric copula.

Zhang and Singh (2006) applied the copula method to derive bivariate distributions of flood peak and volume, and flood volume and duration, such that the mariginals may follow different probability distributions. The conditional return periods for hydrologic design were tested using flood data from Amite River at Denham Springs, Louisiana, and the Ashuapmushuan River at Saguenay, Quebec, Canada. Comparing the derived distributions with the Gumbel mixed distribution and the bivariate Box–Cox transformed normal distribution, the copula-based distributions were found to result in the best agreement with plotting position-based frequency estimates. Genest et al. (2007b) presented how meta-elliptical copulas could be used to model the dependence structure of random vectors when observed differences between their bivariate margins precluded the use of exchangeable copula families, e.g., the Archimedean copula family. A case of peak, volume, and duration of the annual spring flood for the Romaine River was employed to illustrate rank-based estimation and goodness-of-fit techniques for this broad extension of the multivariate normal distribution. Analysis of annual spring flood for the Romaine River suggested that in view of the short length of the series, any of the eight meta-elliptical copula models considered in their studies could be used for prediction purposes. Only with additional evidence could one hope to distinguish between these dependence structures.

Simonovic and Karmakar (2007) focused on the selection of marginal distribution functions for flood characteristics by parametric and nonparametric estimation procedures, and demonstrated how the concept of copula may be used for establishing a joint distribution function with mixed marginal distributions for 70 years of streamflow data of Red River at Grand Forks in North Dakota, United States. Zhang and Singh (2007b) employed the Gumbel–Hougaard copula to model trivariate distributions of flood peak, volume, and duration, and then obtained conditional return periods. The derived distributions were tested using flood data from the Amite River basin in Louisiana. A major advantage of the copula method is that marginal distributions of individual variables can be of any form and the variables can be correlated.

Grimaldi and Serinaldi (2006a) described the fully nested (asymmetric) Archimedean copula properties and the inference procedure, and applied the copulas to multivariate flood frequency analysis of the Kanawha River (Kanawha Falls, West Virginia, drainage area 21,681 km²) recorded from 1877 to 2003, and multivariate sea wave frequency analysis of Rete Ondametrica Nazionale (RON) network off the La Spezia (Liguria region, Italy). They found the following: (1) the inference procedure via copulas was quite easy to perform; and (2) asymmetric Archimedean copulas were useful to describe trivariate structures of dependence of nonexchangeable variables with different mutual degrees of correlation fulfilling the conditions described in Section 5.2.1; and finally, (3) comparison between observed and synthetic samples generated by estimated trivariate distributions confirmed the satisfactory performance of the Chen–Fan–Patton (CFP) test in order to choose the best-fitting copula. But asymmetric Archimedean copulas were not able to describe all mutually different structures of dependence. In addition, since the CFP test is based on Rosenblatt’s transformation, its application becomes difficult when the number of variables increases. Consequently, further studies are needed to find both families of copulas that are capable of describing more complex structures of dependence and goodness-of-fit tests suitable for application to every copula class and high dimensions.

Wang et al. (2009) used a copula-based flood frequency (COFF) approach to estimate the risk of floods at confluence points. The four often-used Archimedean copulas (Ali–Mikhail-Haq, Clayton, Frank, and Gumbel–Hougaard) were applied in a river basin for the joint probability estimation. The Frank copula and Gumbel–Hougaard copula performed the best for the discharge data collected at two United States Geological Survey (USGS) gauge stations located on the Des Moines River at Fort Dodge, Iowa (USGS 05480500; Station A) and the Boone River near Webster City, Iowa (USGS 05471000; Station B), upstream of Des Moines River basin near Stratford, Iowa. It was shown that the copula method for specifying the multivariate distribution function was powerful, because it avoided the requirement that the marginal distributions be of the same type, which is assumed in most studies of empirical multivariate distributions. They also explained that it avoided the complex formulas that arise for many multivariate distribution functions. Zhang and Singh (2014) studied the trivariate flood frequency analysis by allowing different lengths of the records for maximum daily discharge at different locations.

Copula Application to Precipitation and Storm Characteristics Analysis

Salvadori and De Michele (2006) presented a statistical procedure to estimate probability distributions of storm characteristics. They discussed a method to describe the temporal dynamics of rainfall via a reward alternating renewal process that describes wet and dry phases of storms. The dependence among the three variables of interest (I for average rainfall intensity, W for the wet phase, and D for the dry one) was given via a Frank 3-copula. Based on real data collected by the Italian Sea Wave Measurement Network, De Michele et al. (2007) focused on how copulas can be used for the multidimensional frequency analysis of sea storm significant wave height (H), storm duration (D), storm direction (A), and storm interarrival time (I) (i.e., the calm period separating two successive storms). These included the following analyses:

The construction of a bivariate model for the pair (H, D). In turn, this yielded the statistics of the sea storm magnitude M.
Calculation of the return period of multivariate events. This gives the possibility to calculate the probability of occurrence of supercritical events and yielded an estimate of the minimum energetic content of sea storms having an assigned (multivariate) return period.
Construction of a trivariate model for a triplet (H, D, A). This provided useful indications about the relation between sea storm magnitude and direction.
Extension to storm interarrival duration I. This yielded a trivariate model for the triple (D, I, A) that cast new light on the relation between sea storm timing and direction.
The construction of a global model for the vector (H, D, I, A). The overall structure was that of a reward alternating renewal process, whose dynamics develops along a random direction. In turn, this gave the possibility to simulate a sequence of sea storm events, accounting for all the variables of interest and their mutual relations.

These statistical analyses are very important when dealing with coastal dynamics, marine structure reliability, or the planning of operations at sea.

Zhang and Singh (2007a) derived trivariate rainfall frequency distributions using the Gumbel–Hougaard copula, which does not assume the rainfall variables to be independent or normal or have the same type of marginal distributions. The trivariate distribution was then employed to determine joint conditional return periods and was tested using rainfall data from the Amite River basin in Louisiana. Zhang and Singh (2007c) derived bivariate rainfall frequency distributions using the copula method in which four Archimedean copulas (Gumbel–Hougaard, Ali–Mikhail–Haq, Frank, and Cook–Johnson) were examined and compared. Results indicated that the advantage of the copula method is that no assumption is needed for the rainfall variables to be independent or normal or have the same type of marginal distributions. They also used the aforementioned Archimedean copulas to determine joint and conditional return periods, and tested using rainfall data from the Amite River basin in Louisiana, United States. Salvadori and De Michele (2007) summarized a general theoretical framework for studying the return period of hydrological events and presented a trivariate Frank copula model for the temporal structure of the sequence of storms at the Scoffera station, located in the Bisagno River basin (Thyrrhenian Liguria, northwestern Italy). The model includes, simplifies, and generalizes many of the approaches already present in the literature. They also gave an explicit derivation of the storm volume statistics for any suitable copula and marginals and a copula-based procedure for estimating the probability law of antecedent moisture conditions. Results indicated that the copula may have important applications in many fields of water resources and hydrologic systems, as well as in several geophysical areas.

Using three different samples of extreme rainfall criteria, including annual maximum volume (AMV), annual maximum peak intensity (AMI), and annual maximum cumulative probability (AMP), Kao and Govindaraju (2007) characterized extreme rainfall events using hourly precipitation data from Indiana, United States. Results of their study have implications for current hydrologic design in that they provided better estimates of design rainfall. Gebremichael and Krajewski (2007) explored the use of copulas to construct the joint distribution between the sampling error and the corresponding rainfall rate. Taking 15-minute radar-rainfall data for the Mississippi River basin in the central United States as an example, the approach (1) estimated the marginal distribution functions in a parametric way; (2) used these with a number of copula functions in search of the one most appropriate; (3) used the maximum likelihood to estimate the parameters of copulas; and (4) selected the best-fitted parametric copula function as the one that gave the largest likelihood. Results showed that the approach had important implications for the interpretation and propagation of remote sensing precipitation uncertainties.

Based on a non-Archimedean Plackett copula family derived using the theory of constant cross-product ratio, Kao and Govindaraju (2008) showed that the Plackett family not only performed well at the bivariate level, but also allowed trivariate stochastic analysis where the lower-level dependencies between variables can be fully preserved while allowing for specificity at the trivariate level as well. The authors proposed a numerical method to estimate the feasible range of Plackett parameters. The trivariate Plackett family of copulas was then applied to study a total of 53 hourly rain gauges from the Hourly Precipitation Database (TD 3240) of the National Climate Data Center in Indiana. Results of this study suggested that while the constant cross-product ratio theory was conventionally applied to discrete type random variables, it was also applicable to continuous random variables, and that it provided further flexibility for multivariate stochastic analyses of rainfall.

Evin and Favre (2008) proposed a new stochastic point rainfall model (Neyman–Scott cluster process) considering the dependence between cell depth and duration using cubic copula, and explored the properties of this class of copulas and suggested several families of this kind attaining a large range of dependence. They derived first-, second-, and third-order moments of the modified Neyman–Scott rectangular pulses model. Hourly rainfall data from Belgium and America were employed to fit the model by these theoretical moments and obtained successful results for two rainfall series with different climates. Generating long series of synthetic rainfall and the observed rainfall data and under specific cubic families and exponential margins, the model fitting can be improved. Results also indicated that the independent Pareto distribution for cell intensity yielded interesting results, and both hourly and daily annual maxima were adequately reproduced by most of the models. Vandenbreghe et al. (2011) investigated the bivariate frequency of storms using the copula method.

Copula Application to Drought Characteristics Analysis

Shiau (2006) used the run theory to abstract the paired drought duration and severity data from observed drought events in Wushantou (Taiwan), which were defined as the Standardized Precipitation Index (SPI) continuously below 0. The exponential and gamma distributions were then used to model the drought duration and severity, respectively. Several two dimensional copulas, such as Ali–Mikhail–Haq, Clayton, Frank, Galambos, Gumbel–Hougaard, and Plackett copulas, were employed to construct the dependence structure for drought duration and severity, and the joint drought duration and severity distribution. A method of inference function for margins (IFM method), a two-step procedure, was employed to estimate the copula parameters. The Galambos copula (belonging to extreme value family) fitted the observed drought data best for the Wushantou case under consideration. The bivariate probabilistic properties of droughts, such as joint probabilities and bivariate return periods, were also investigated to demonstrate comprehensive drought assessments. Shiau (2006) showed that copulas were easily applied to construct the dependence structure of the bivariate correlated random variables that were often met in hydrology.

Dupuis (2007) discussed the bivariate modeling of extreme tails of correlated hydrological random variables and applied the copula approach to model the dependence structure independently of marginal distributions. Dupuis also applied results from the classical extreme value theory to choose marginal distributions for excesses of high thresholds. Using six copula families (Gumbel, Frank, Normal, Student t, Clayton, and associated Clayton), the author discussed pertinent copula properties and examined the effects of model misspecification and the impact of the chosen estimation method, targeting the estimated quantities frequently used in hydrology. Based on a simulation study, Dupuis showed not only the dangers of improper copula selection but also the possible benefits of using a bivariate approach to estimate univariate quantities. Finally, the author applied copulas to study low-flow events and analyzed two Canadian hydrometric datasets.

Using monthly medians of streamflow of the Yellow River in China as the truncation levels, Shiau et al. (2007) defined hydrological droughts to obtain drought duration and drought severity. Drought duration and drought severity were fitted by the mixture of exponential and gamma distributions. The observed drought duration was highly correlated with the observed drought severity. The Clayton copula was used to construct the bivariate drought distribution from the predetermined marginal distributions of drought duration and drought severity. Results showed that the most severe drought of the Yellow River occurring during the period 1919–2002 was the 1930–1933 drought with the drought duration of 36 months and drought severity of 5264.8 m³ s⁻¹. The return period for this drought event was 105 years. The 1997–1998 drought had a return period of 4.4 years. It suggested that the dramatically reduced streamflow in the downstream Yellow River in 1997 deteriorated due to other factors, such as human activities.

Wong et al. (2007) employed the trivariate Gaussian copula and the Gumbel copula to fit drought data. Results showed that the drought data were best described by the Gumbel copula and three-parameter Weibull marginal distribution. Song and Singh (2009) modeled the joint probability distribution of periodic hydrologic data using meta-elliptical copulas. Monthly precipitation data from a gauging station (410120) in Texas, United States, were used to illustrate parameter estimation and goodness-of-fit for univariate drought distributions using the chi-square test, Kolmogorov–Smirnov test, Cramér–von Mises statistic, Anderson–Darling statistic, modified weighted Watson statistic, and Liao and Shimokawa statistic. Pearson’s classical correlation coefficient r_nrn, Spearman’s ρ_nρn, Kendall’s τ, chi-plots, and K-plots were employed to assess the dependence of drought variables. The meta-elliptical copulas and Gumbel–Hougaard, Ali–Mikhail–Haq, Frank and Clayton copulas were tested to determine the best-fit copula. Based on the root mean square error and the Akaike information criterion, meta-Gaussian and t copulas yielded a better fit. A bootstrap version based on Rosenblatt’s transformation was employed to test the goodness-of-fit for meta-Gaussian and t copulas. It was found that none of meta-Gaussian and t copulas considered could be rejected at the given significance level. The meta-Gaussian copula was then employed to model dependence due to its simplicity for parameter estimation, and results were found satisfactory. Mirabbasi et al. (2012) and Chen et al. (2013) investigated the copula applications for drought characteristics.

Copula Application in Other Fields Related to Water Resources Engineering

Using four copulas (independence/product, Farlie–Gumbel–Morgenstern, Frank, and Clayton), Favre et al. (2004) described the modeling of the combined risk in the framework of frequency analysis of peak flows from the watershed of Peribonka in Québec, Canada, and the joint modeling of peak flows and volumes of the watershed of Rimouski River in Québec, Canada, using three copulas (Independence, Frank, and Clayton). Results showed that the copula approach was promising, since it allowed the researchers to take into account a wide range of correlation that can happen in hydrology. De Michele et al. (2005) proposed a two-copula method to model a bivariate extreme value distribution with generalized extreme value marginals. The peak-volume pair can then be transformed to the corresponding flood hydrograph, representing the river basin response, through a simple linear model. The hydrological safety of dams was considered for checking the adequacy of dam spillway. The reservoir behavior was tested using a long synthetic series of flood hydrographs with application to an existing dam.

Bárdossy (2006) calculated empirical copulas for four water quality parameters, chloride, sulfate, pH, and nitrate, obtained from a large-scale groundwater quality measurement network in Baden-Württemberg (Germany). A Gaussian and a non-Gaussian copula were applied, and results indicated that the spatial dependence structure of the investigated parameters was not Gaussian. According to the bootstrap-based statistical tests using stochastic simulation of multivariate distributions, the Gaussian copula was rejected for most of the parameters, but the non-Gaussian alternative was not rejected in most cases. Grimaldi and Serinaldi (2006b) proposed a procedure to describe the trivariate cumulative distribution function (CDF) of critical depth, peak, and total depth. Seven three-copula functions were estimated with the canonical maximum likelihood (CML) method, and the best one was chosen for analyzing the CDF of copulas.

Bárdossy and Li (2008) used the Gaussian as well as non-Gaussian copulas to depict the dependence structure of the investigated parameters without the influence of marginal distributions. Division of observations into multipoint subsets and subsequent maximization of the corresponding likelihood function were employed to estimate copula parameters. Chloride, nitrate, pH, sulfate, and dissolved oxygen observations of a large-scale groundwater quality measurement network in Baden-Württemberg were used to demonstrate the methodology. Results showed that all five parameters showed non-Gaussian dependence, and the non-Gaussian copulas gave better results than the geostatistical interpolations. Meanwhile, validation of the confidence intervals showed that they were more realistic than the estimation variances obtained by ordinary kriging.

1.3 Theme of the Book

The goal of the book is to discuss for graduate level students and engineers how to appropriately apply the copula method. The book is divided into two parts. Part I introduces the copula theory, including copula properties, methods of construction, copula families, etc. Part II discusses applications of copulas in hydrology and water resources engineering with case studies.

More specifically, Part I includes the following chapters with regard to copula theory. Chapter 2 briefly reviews the preliminaries for univariate and multivariate frequency analysis. Chapter 3 discusses the important properties of copulas. Chapter 4 discusses the bivariate Archimedean copula families and multivariate symmetric Archimedean copula extensions. Chapter 5 discusses the nested (i.e., asymmetric) Archimedean copula and the vine copula through pair copula construction. Chapter 6 discusses the non-Archimedean Plackett copula family. Chapter 7 discusses meta-elliptical non-Archimedean copula families. Chapter 8 discusses the entropic copulas. Chapter 9 discusses the copula application in time series analysis.

Part II provides the following case studies. Chapter 10 discusses the copula application to rainfall analysis. Chapter 11 discusses the copula application to flood analysis. Chapter 12 discusses the copula application to water quality analysis. Chapter 13 discusses the copula application to drought analysis. Chapter 14 discusses the copula application to compound extremes. Chapter 15 discusses the copula application to network design. Chapter 16 discusses the river sediment transport. And Chapter 17 discusses the interbasin transfer.

References

Ali, M. M., Mikhail, N. N., and Haq, M. S. (1978). A class of bivariate distributions including the bivariate logistic. Journal of Multivariate Analysis, 8, 405–412.

Bárdossy, A. (2006). Copula-based geostatistical models for groundwater quality parameters. Water Resources Research, 42, W11416, doi:10.1029/2005WR004754.

Bárdossy, A. and Li, J. (2008). Geostatistical interpolation using copulas. Water Resources Research, 44, W07412, doi:10.1029/2007WR006115.

Braekers, R. and Veraverbeke, N. (2005). A copula-graphic estimator for the conditional survival function under dependent censoring. Technical Report, 0315. Interuniversity Attraction Pole.

Caperaa, P., Fougeres, A. L., and Genest, C. (1997). A nonparametric estimation procedure for bivariate extreme copulas. Biometrika, 84(3), 567–577.

Chakak, A. and Koehler, K. J. (1995). A strategy for constructing multivariate distributions. Communicational Statistics (Simulation), 24(3), 537–550.

Chen, L., Singh, V. P., Guo, S., Mishra, A., and Guo, J. (2013) Drought analysis using copulas. Journal of Hydrologic Engineering, 18(7), 797–808. doi:10.1061/(ASCE)HE.1943–5584.0000697.

Chen, X. and Fan, Y. (2002). Evaluating density forecasts via the copula approach. www.vanderbilt.edu/Econ/wparchive/workpaper/vu02-w25R.pdf.

Cook, R. D. and Johnson, M. E. (1981). A family of distributions for modeling non-ellipitically symmetric multivariate data. Journal of the Royal Statistical Society. Series B. (Methodological), 43(2), 210–218.

De Michele, C., Salvadori, G., Canossi, M., Petaccia, A., and Rosso, R. (2005). Bivariate statistical approach to check adequacy of dam spillway. Journal of Hydrologic Engineering, 10(1), 50–57.

De Michele, C., Salvadori, G., Passoni, G., and Vezzoli, R. (2007). A multivariate model of sea storms using copulas. Coastal Engineering, 54, 734–751.

Dupuis, D. J. (2007). Using copulas in hydrology: benefits, cautions, and issues. Journal of Hydrologic Engineering, 12(4), 381–393.

Evin, G. and Favre, A. C. (2008). A new rainfall model based on the Neyman–Scott process using cubic copulas. Water Resources Research, 44, W03433, doi:10.1029/2007WR006054.

Fang, H., Fang, K.T., and Kotz, S. (2002). The meta-elliptical distributions with given marginals. Journal of Multivariate Analysis, 82, 1–16.

Favre, A. C., Adlouni, S. E., Perreault, L., Thiémonge, N., and Bobeé, B. (2004). Multivariate hydrological frequency analysis using copulas. Water Resources Research, 40(1), W01101, doi:10.1029/2003WR002456.

Frees, E. W. and Valdez, E. A. (1997). Understanding relationships using copulas. North American Acturial Journal, 2(1), 1–37.

Gebremichael, M. and Krajewski, W. F. (2007). Application of copulas to modeling temporal sampling errors in satellite-derived rainfall estimates. Journal of Hydrologic Engineering, 12(4), 404–408.

Genest, C. (1987). Frank’s family of bivariate distribution. Biometrika, 74(3), 549–555.

Genest, C. and Boies, J. C. (2003). Detecting dependence with Kendall plots. American Statistician, 57(4), 275–284.

Genest, C., Favre, A. C., Béliveau, J., and Jacques, C. (2007b). Meta-elliptical copulas and their use in frequency analysis of multivariate hydrological data. Water Resources Research, 43, W09401, doi:10.1029/2006WR005275.

Genest, C. and MacKay, J. (1986). The joy of copulas: bivariate distributions with uniform marginals. American Statistician, 40(4), 280–283.

Genest, C. and Rivest, L.-P. (1993). Statistical inference procedures for bivariate Archimedean copulas. Journal of the American Statistical Association, 88(423), 1034–1043.

Genest, C., Ghoudi, K., and Rivest, L.-P. (1995). A semiparametric estimation procedure of dependence parameters in multivariate families of distributions. Biometrika, 82(3), 543–552.

Genest, C., Quessy, J.-F., and Rémillard, B. (2006). Goodness-of-fit procedures for copula models based on the integral probability transformation. Scandinavian Journal of Statistics, 33, 337–366.

Genest, C., Rémillard, B., and Beaudoin, D. (2007a). Goodness-of-fit tests for copulas: a review and a power study. Insurance: Mathematics and Economics, doi:10.1016/j.insmatheco.2007.10.005.

Grimaldi, S. and Serinaldi, F. (2006a). Asymmetric copula in multi-variate flood frequency analysis. Advances in Water Resources, 29(8), 1155–1167.

Grimaldi, S. and Serinaldi, F. (2006b). Design hyetograph analysis with 3-copula function. Hydrological Sciences Journal, 51(2), 223–238.

Hosking, J. R. M. (1990). Fortran routines for use with the method of L-moments, Version 2. Research Report RC-17097, IBM Thomas J. Watson Research Center, Yorktown Heights.

Joe, H. (1997). Multivariate Models and Dependence Concept. Chapman & Hall, New York.

Kao, S. C. and Govindaraju, R. S. (2007). A bivariate rainfall frequency analysis of extreme rainfall with implications for design. Journal of Geophysical Research, 112, D13119, doi:10.1029/2007JD008522.

Kao, S. C. and Govindaraju, R. S. (2008). Trivariate statistical analysis of extreme rainfall events via the Plackett family of copulas. Water Resources Research, 44(2), W02415, doi:10.1029/2007WR006261.

Long, D. and Krzysztofowicz, R. (1995). A family of bivariate densities constructed from marginals. Journal of the American Statistical Association, 90(430), 739–746.

Mirabbasi, R., Fakheri-Fard, A., and Dinpashoh, Y. (2012). Bivaraite drought frequency analysis using the copula method. Theoretical Applied Climatology, 108(1–2), 191–206, doi:10.1007/s00704-011-0524-7.

Muller, A. and Scarsini, M. (2001). Stochastic comparison of random vectors with a common copula. Mathematics of Operations Research, 26(4), 723–740.

Nelsen, R. B. (2006). An Introduction to Copulas. Springer, New York.

Quesada-Molina, J. J. and Rodriguez-Lallena, J. A. (1995a). Bivariate copulas with quadratic sections. Nonparametric Statistics, 5, 323–337.

Quesada-Molina, J. J. and Rodriguez-Lallena, J. A. (1995b). Bivariate copulas with cubic sections. Nonparametric Statistics, 7, 205–220.

Rao, A. R. and Hamed, K. H. (2000). Flood Frequency Analysis. CRC Publications, Boca Raton, London, New York, Washington.

Rodriguez-Lallena, J. A. and Úbeda-Flores, M. (2004). A new class of bivariate copulas. Statistics and Probability Letters, 66, 315–325.

Salvadori, G. and Michele, C. D. (2003). A generalized Pareto intensity and duration model of storm rainfall exploiting 2-copulas. Journal of Geophysical Research, 108 (D2), doi:10,1029/2002JD002543.

Salvadori, G. and De Michele, C. (2004). Frequency analysis via copulas: theoretical aspects and applications to hydrological events. Water Resources Research, 40, W12511, doi:10.1029/2004WR003133.

Salvadori, G. and De Michele, C. (2007). On the use of copulas in hydrology: theory and practice. Journal of Hydrologic Engineering, 12(4), 369–380.

Sancetta, A. and Satchell, S. (2001). Berstein Approximations to the Copula Function and Portfolio Optimization. DAE Working Paper 0105, University of Cambridge. www.econ.cam.ac.uk/research-files/repec/cam/pdf/wp0105.pdf.

Shiau, J. T. (2006). Fitting drought duration and severity with two-dimensional copulas. Water Resources Management, 20, 795–815.

Shiau, J. T., Feng, S., and Nadarajah, S. (2007). Assessment of hydrological droughts for the Yellow River, China, using copulas. Hydrological Processes, 21(16), 2157–2163.

Simonovic, S. P. and Karmakar, S. (2007). Flood Frequency Analysis Using Copula with Mixed Marginal Distribution. Report No. 055. www.econ.cam.ac.uk/research-files/repec/cam/pdf/wp0105.pdf.

Singh, V. P. (1988). Hydrologic Systems: Rainfall-Runoff Modeling. Prentice Hall, Englewood Cliffs.

Singh, V. P. (1998). Entropy-Based Parameter Estimation in Hydrology. Kluwer Academic Publishers, Dordrecht, Boston, London.

Singh, V. P., Jain, S. K., and Tyagi, A. (2007). Risk and Reliability Analysis. ASCE Press, Reston.

Sklar, A. (1959). Fonctions de repartition à n dimensionls et leurs marges. Publications de l’Institut de Statistique de l’Université de Paris, Paris. 8, 229–231.

Song, S. B. and Singh, V. P. (2009). Meta-elliptical copulas for drought frequency analysis of periodic hydrologic data. Stochastic Environmental Research and Risk Assessment, doi:10.1007/s00477–009–0331–1.

Vandenberghe, S., Verhoest, N. E. C., Onof, C., and De Baets, B. (2011). A comparative Copula-based bivariate frequency analysis of observed and simulated storm events: a case study on Bartlett-Lewis modeled rainfall. Water Resources Research, 47. doi:10.1029/2009wr008388.

Wang, C., Chang, N. B., and Yeh, G. T. (2009). Copula-based flood frequency (COFF) analysis at the confluences of river systems. Hydrological Processes, 23, 1471–1486.

Wong, G., Lambert, M. F., and Metcalfe, A. V. (2007). Trivariate copulas for characterisation of droughts. ANZIAM Journal, 49, C306–C323.

Yue, S. (1999). Applying bivariate normal distribution to flood frequency analysis. Water International, 24(3), 248–254.

Yue, S. (2000a). Joint probability distribution of annual maximum storm peaks and amounts as represented by daily rainfalls. Hydrologic Science Journal, 45(2), 315–326.

Yue, S. (2000b). The Gumbel logistic model for representing a multivariate storm event. Advances in Water Resources, 24 (2), 179–185.

Yue, S. (2000c). The Gumbel mixed model applied to storm frequency analysis. Water Resources Management, 14(5), 377–389.

Yue, S., Ouarda, T. B. M. J., Bobée, B., Legendre, P., and Bruneau, P. (1999). The Gumbel mixed model for flood frequency analysis. Journal of Hydrology, 226, 88–100.

Yue, S., Ouarda, T. B. M. J., and Bobée B (2001). A review of bivariate gamma distributions for hydrological application. Journal of Hydrology, 246, 1–18.

Yue, S. and Rasmussen, P. (2002). Bivariate frequency analysis: discussion of some useful concepts for hydrological application. Hydrological Processes, 16(14), 811–819.

Zheng, M. and Klein, J. P. (1995). Estimates of marginal survival for dependent competing risk based on assumed copula. Biometrika, 82(1), 127–138.

Zhang, L. and Singh, V. P. (2006). Bivariate flood frequency analysis using the copula method. Journal of Hydrologic Engineering, 11(2), 150–164.

Zhang, L. and Singh, V. P. (2007a). Gumbel-Hougaard copula for trivariate rainfall frequency analysis. Journal of Hydrologic Engineering, 12(4), 409–419.

Zhang, L. and Singh, V. P. (2007b). Trivariate flood frequency analysis using the Gumbel–Hougaard copula. Journal of Hydrologic Engineering, 12(4), 431–439.

Zhang, L. and Singh, V. P. (2007c). Bivariate rainfall frequency distributions using Archimedean copulas. Journal of Hydrology, 332, 93–109.

Additional Reading

Adamson, P. T., Metcalfe, A. V., and Parmentier B. (1999). Bivariate extreme value distributions: an application of the Gibbs sampler to the analysis of floods. Water Resources Research, 35(9), 2825–2832.

Ashkar, F. (1980). Partial duration series models for flood analysis. PhD thesis. Ecole Polytechnique of Montreal, Montreal, Canada.

Ashkar, F., El Jabi, N., and Issa, M. (1998). A bivariate analysis of the volume and duration of low-flow events. Stochastic Hydrology and Hydraulics, 12, 97–116.

Bacchi, B., Becciu, G,. and Kottegoda, N. T. (1994). Bivariate exponential model applied to intensities and durations of extreme rainfall. Journal of Hydrology, 155, 225–236.

Choulakian, V., El Jabi, N., and Moussi, J. (1990). On the distribution of flood volume in partial duration series analysis of flood phenomena. Stochastic Hydrology and Hydraulics, 4, 217–226.

Correia, F. N. (1987). Multivariate partial duration series in flood risk analysis. In: Singh, V. P. (Ed) Hydrologic Frequency Modeling. Reidel, Dordrecht, 541–554.

Cunnane, C. (1987). Review of statistical models for flood frequency estimation. In: Singh, V. P. (Ed) Hydrologic Frequency Modeling, Reidel, Dordrecht, 49–95.

Durrans, S. R. (1998). Total probability methods for problems in flood frequency estimation. In: Parent, E., Hubert, P., Bobee, B., and Miquel, J. (Eds) Statistical and Bayesian Methods in Hydrological Science. International Hydrological Programme, Nairobi, Jakarta, Venice, Cairo, and Montevideo. Technical Documents in Hydrology, No. 20UNESCO, Paris, 299–326.

Futter, M. R., Mawdsley, J. A., and Metcalfe, A. V. (1991). Short-term flood risk prediction: a comparison of the Cox regression model and a conditional distribution model. Water Resources Research, 27(7), 1649–1656.

Goel, N. K., Seth, S. M., and Chandra, S. (1998). Multivariate modeling of flood flows. Journal of Hydraulic Engineering, 124(2), 146–155.

Goel, N. K., Kurothe, R. S., Mathur, B. S., and Vogel, R. M. (2000). A derived flood frequency distribution for correlated rainfall intensity and duration. Journal of Hydrology, 228, 56–67.

Grimaldi, S., Serinaldi, R., Napolitano, F., and Ubertini, L. (2005). A 3-copula function application or design hyetograph analysis. Proceedings of Symposium S2, Held during the Seventh IAHS Scientific Assembly at Foz do Iguacu, Brazil, April 2005. IAHS publ. 293. International Association of Hydrological Sciences (IAHS), London. https://iahs.info/uploads/dms/13113.33%20203-211%20s2-10%20Grimaldi%20et%20al%2066.pdf.

Haimes, Y. Y., Lambert, J. H., and Li, D. (1992). Risk of extreme events in a multiobjective framework. Water Resources Bulletin, 28(1), 201–209.

Hashino, M. (1985). Formulation of the joint return period of two hydrologic variates associated with a Poisson process. Journal of Hydroscience and Hydraulic Engineering, 3(2), 73–84.

Hosking, J. R. M. and Wallis, J. R. (1997). Regional Frequency Analysis. Cambridge University Press. Cambridge.

Kelly, K. S. and Krzysztofowicz, R. (1997). A bivariate meta-Gaussian density for use in hydrology. Stochastic Hydrology and Hydraulics, 11, 17–31.

Kite, G. W. (1978). Frequency and Risk Analysis in Hydrology. Water Resource Publications, Fort Collins.

Kurothe, R. S., Goel, N. K., and Mathur, B. S. (1997). Derived flood frequency distribution for negatively correlated rainfall intensity and duration. Water Resources Research, 33, 2103–2107.

Krstanovic, P. F. and Singh, V. P. (1987). A multivariate stochastic flood analysis using entropy. In: Singh, V. P. (Ed) Hydrologic Frequency Modeling. Reidel, Dordrecht, 515–539.

Lall, U. and Bosworth, K. (1994). Multivariate kernel estimation of functions of space and time. In: Hipel K. V., Mcleod, A. I., Panu, U. S., Singh, V. P. (Eds) Time Series Analysis in Hydrology and Environmental Engineering. Kluwer Academic Publications, Dordrecht, 301–315.

Loganathan, G. V., Kuo, C. Y., and Yannaccone, J. (1987). Joint probability distribution of streamflows and tides in estuaries. Nordic Hydrology, 18, 237–246.

Long, D. and Krzysztofowicz, R. (1996). Geometry of a correlation coefficient under a copula. Communications in Statistics: Theory and Methods, 25(6), 1397–1404.

Nachtnebel, H. P. and Konecny, F. (1987). Risk analysis and time-dependent flood models. Journal of Hydrology, 91, 295–318.

Renard, B. and Lang, M. (2007). Use of a Gaussian copula for multivariate extreme value analysis: some case studies in hydrology. Advances in Water Resources, 30, 897– 912.

Rényi, A. (1974). On measure of dependence. Acta Mathematica Academiae Scientiarum Hungarica, 10, 441–451.

Rivest, L.-P. and Wells, M. T. (2001). A martingale approach to the Copula-graphic estimator for the survival function under dependent censoring. Journal of Multivariate Analysis, 79, 138–155.

Sackl, B. and Bergmann, H. (1987). A bivariate flood model and its application. In: Singh, V. P. (Ed) Hydrologic Frequency Modeling. Reidel, Dordrecht, 571–582.

Salvadori, G. and De Michele, C. (2006). Statistical characterization of temporal structure of storms. Advances in Water Resources, 29(6), 827–842.

Schweizer, B. and Wolff, E. F. (1981). On nonparametric measures of dependence for random variables. Annals of Statistics, 9, 879–885.

Schweizer, B. (1991). Thirty years of copula. In: Dall’Aglio, G., Kotz, S., and Salinetti, G. (Eds) Advances in Probability Distributions with Given Marginals: Beyond the Copulas. Mathematics and Its Applications, 67, Kluwer Academic Publishers, Dordrecht, 13–50.

Serinaldi, F. and Grimaldi, S. (2007). Fully nested 3-copula: procedure and application on hydrological data. Journal of Hydrologic Engineering, 12(4), 420–430.

Singh, K. and Singh, V. P. (1991). Derivation of bivariate probability density functions with exponential marginals. Journal of Stochastic Hydrology and Hydraulics, 5, 55–68.

Wilks, D. S. (1998). Multisite generalization of a daily stochastic precipitation generation model. Journal of Hydrology, 210, 178–191.

Wolff, E. F. (1977). Measures of Dependence Derived from Copulas. PhD thesis, University of Massachusetts, Amherst.

Zhang, L. and Singh, V. P. (2014). Trivariate flood frequency analysis using discharge time series with possible different lengths: Cuyahoga River case study. Journal of Hydrologic Engineering. doi:10.1061/(ASCE)HR.1943-5584.0001003.

Tags: Copulas and their Applications in Water Resources Engineering

Oct 12, 2020 | Posted by drzezo in Water and Sewage | Comments Off

Civil Engineer Key

Fastest Civil Engineer Engine

1 – Introduction