5 – Asymmetric Copulas | Civil Engineer Key

Abstract

Much of the literature on copulas, discussed in the previous chapters, is limited to the bivariate cases. The Gaussian and student copulas have been commonly applied to model the dependence in higher dimensions (Genest and Favre, 2007; Genest et al., 2007a). In Chapter 4, we discussed the extension of symmetric bivariate Archimedean copulas as well as their major restrictions to model high-dimensional dependence (i.e., d ≥ 3)d≥3). Through the extension of the bivariate Archimedean copula, the multivariate Archimedean copula is symmetric and denoted as exchangeable Archimedean copula (EAC). EAC allows for the specification of only one generating function and only one set of parameters θ. In other words, random variates by pair share the same degree of dependence. Using the trivariate random variable {X1, X2, X3} as an example, {X1, X2}, {X2, X3}, and {X1, X3} should have the same degree of dependence. However, this assumption is rarely valid. This chapter discusses the following two approaches of constructing asymmetric multivariate copulas: nested Archimedean copula construction (NAC) and the vine copulas through pair-copula construction (PCC).

5 Asymmetric Copulas High Dimension

5.1 Construction of Higher-Dimensional Copulas

In general, there are dd−12 pairs of variables for a given d-dimensional multivariate problem. The NAC approach constitutes a significant improvement over EAC; however, it is still not rich enough to model all possible mutual dependencies among the d dimensional random variables (Berg and Aas, 2007). Based on the multivariate probability density function decomposition (Joe, 1997), the PCC approach allows for the free specification of dd−12 copulas that are hierarchical in nature. Further, it allows for selecting copulas from different families to model the dependence structure (Berg and Aas, 2007; Aas et al., 2009). Hence, the NAC approach is introduced first, followed by the PCC approach.

5.2 Nested Archimedean Copulas (NAC)

Representing one type of multivariate extension, NAC constitutes a significant improvement over EAC. We first review the fully nested Archimedean construction (FNAC) and the partially nested Archimedean construction (PNAC), and then turn to the general nested Archimedean copula.

5.2.1 Fully Nested Archimedean Copulas (FNAC)

For d-dimensional random variables modeled with FNAC, there are d – 1 bivariate copula functions, which result in dependence structure with partial exchangeability (Joe, 1997; Embrechts et al., 2003; Whelan, 2004; McNeil, 2007; Savu and Trede, 2010; among others). Figure 5.1 presents an example of a four-dimensional FNAC structure. The bivariate copula is the building block for FNAC. The FNAC structure is constructed, based on the degree of dependence between the pair variables, with the following procedures:

i. Choose the variables with the highest degree of dependence (rank-based) as the first two variables (1 and 2).
ii. Compute the empirical copula using variables 1 and 2.
iii. Evaluate the degree of dependence (rank-based) between empirical copula from step ii with the remaining variables.
iv. Choose variable 3, i.e., yielding the highest degree of dependence (rank-based) with the empirical copula built with variables 1 and 2.
v. Continue the process until the last variable is considered.

Figure 5.1 Four-dimensional FNAC structure.

From Figure 5.1, it is seen that three bivariate copulas are needed to represent the dependence for the four-dimensional random variables through FNAC as follows. First, random variables u₁u1 and u₂u2 are coupled through copula C₃C3. Second, random variable u₃u3 is coupled with C₃(u₁, u₂)C3u1u2 through copula C₂C2. Third, random variable u₄u4 is coupled with C₂(u₃, C₃(u₁, u₂))C2u3C3u1u2 through copula C₁C1. Hence, a four-dimensional copula requires three bivariate copulas C₁, C₂C1,C2, and C₃C3, with generators ϕ₁ϕ1, ϕ₂ϕ2, and ϕ₃ϕ3 and may be written as follows:

C(u1,u2,u3,u4)=C1(u4,C2(u3,C3(u1,u2)))=ϕ1−1(ϕ1(u4)+ϕ1(ϕ2−1(ϕ2(u3)+ϕ2(ϕ3−1(ϕ3(u1)+ϕ3(u2))))))=ϕ1−1(ϕ1(u4)+ϕ1∘ϕ2−1(ϕ2(u3)+ϕ2∘ϕ3−1(ϕ3(u1)+ϕ3(u2))))

(5.1)

where _○ represents the composition of functions.

Similarly, the FNAC for d-dimensional random variables (e.g., Joe, 1997; Embrechts et al., 2003; Whelan, 2004; Nelsen, 2006) may be generated as follows:

C(u1,…,ud)=ϕ1−1(ϕ1(ud)+ϕ1∘ϕ2−1(ϕ2(ud−1)+ϕ2∘…∘ϕd−1−1(ϕd−1(u1)+ϕd−1(u2))))

(5.2)

It is worth noting that Equation (4.1) in Chapter 4, i.e., the exchangeable symmetric Archimedean copula, is a special case of Equation (5.2) if ϕ(θ₁) = ϕ₂(θ₂) = … = ϕ_d − 1(θ_d − 1) = ϕ(θ), θ₁ = θ₂ = … = θ_d − 1ϕθ1=ϕ2θ2=…=ϕd−1θd−1=ϕθ,θ1=θ2=…=θd−1. For the d-dimensional FNAC, the bivariate margins themselves are also Archimedean copulas that allow for free specification of d – 1 copulas with the remaining identified implicitly through FNAC (Whelan, 2004; Berg and Aas, 2007). Using Equation (5.1) (Figure 5.1) as an example, this statement may be expressed as follows: (i) there are three Archimedean copulas of free specification, i.e., C₃C3 with parameter θ₃θ3 for variables u₁, u₂u1,u2; C₂C2 with parameter θ₂θ2 for variables {u₃, C₃(u₁, u₂; θ₃}{u3,C3(u1,u2;θ3}; and C₁C1 with parameter θ₁θ1 for variables {u₄, C₂(u₃, C₃(u₁, u₂; θ₃); θ₂}{u4,C2(u3,C3u1u2θ3;θ2}; (ii) pairs (u₁, u₃), (u₂, u₃)u1u3,u2u3 have copula C₂C2 with parameter θ₂θ2; and (iii) pairs (u₁, u₄), (u₂, u₄), (u₃, u₄)u1u4,u2u4,u3u4 have copula C₃C3 with parameter θ₁θ1. The decreasing degree of dependence for the increasing levels of nesting (i.e., θ₁ ≤ θ₂ ≤ … ≤ θ_d − 1θ1≤θ2≤…≤θd−1 with θ₁θ1 and θ_d − 1θd−1 representing the parameters for the highest and lowest levels, respectively) is another technical condition for proper construction of the d-dimensional fully nested asymmetric Archimedean copula.

It should also be pointed out that the following conditions need to be satisfied for the nested generating functions:

ϕ1−1,ϕ2−1,…,ϕd−1−1 must satisfy the necessary conditions for being completely monotonic.
According to Embrechts et al. (2003), the coupling of functions wk=ϕk∘ϕk+1−1 belongs to a class of functions ℒ∞∗ defined as follows:

ℒ∞∗=ω:0∞→0∞ω0=0ω∞=∞−1k−1dkωtdt≥0k=12…∞(5.3)

Based on Equation (5.2), the simplest three-dimensional FNAC (shown in Figure 5.2) can be written as follows:

Cu1u2u3=ϕ1−1(ϕ1u3+ϕ1∘ϕ2−1ϕ2u1+ϕ2u2

(5.4)

Figure 5.2 Three-dimensional FNAC structure.

In accordance with Equation (5.4), we outline here the derivation of five three-dimensional asymmetric Archimedean copulas that are commonly applied.

M3 (Joe, 1997):

C2u1u2=−1θ2ln1−1−e−θ2u11−e−θ2u21−e−θ2

Let t = C₂(u₁, u₂)t=C2u1u2. Then we have C1u3t=−1θ1ln1−1−e−θ1u3(1−e−θ1t1−e−θ1

Cu1u2u3=C1u3C2u1u2=C1u3t=−1θ1ln1−1−e−θ1u31−1−e−θ2u11−e−θ2u21−e−θ21−e−θ1

(5.5)

θ₂ ≥ θ₁ ∈ [0, ∞), τ₁₂, τ₁₃, τ₂₃ ∈ [0, 1]θ2≥θ1∈0∞,τ12,τ13,τ23∈01 for positive dependent trivariate variables.

The M3 copula may be also called the asymmetric trivariate Frank copula.

We now use the following specific examples to illustrate these marginal distributions.

Example 5.1 Derive the M3 copula for θ₁ = 2.0 and θ₂ = 3.0θ1=2.0andθ2=3.0 by setting u₃ = 0.6u3=0.6. Assuming u₁~F₁(x₁) : X₁~gamma(2, 4); u₂~F₂(x₂) : X₂~normal(1, 3²); u₃~F₃(x₃) : X₃~EV1(10, 7)u1~F1x1:X1~gamma24;u2~F2x2:X2~normal132;u3~F3x3:X3~EV1107, and {X₁, X₂}X1X2 has a higher pairwise dependence.

Solution: With {X₁, X₂}X1X2 having higher pairwise dependence, we first couple X₁and X₂X1andX2 and build the copula function from the marginals as follows:

u1=F1x1=1Γ2γ4×1γ:incompletegammafunction

u2=F2x2=Φx2−13,Φ:Standardnormaldistribution

u3=F3x3=exp−exp−x3−107

Since we already set u₃ = 0.6u3=0.6, then we have x₃≈9.388x3≈9.388 from the EV1 population.

Finally, we can write the fully nested copula using the M3 copula as follows:

C2u1u23=−13.0ln1−1−e−3.0u11−e−3.0u21−e−3.0=−13.0ln1−(1−e−3.0(1Γ2γ4×1))1−e−3.0Φx2−131−e−3.0

Cu1u20.632=C0.6C2u1u232=−12.0ln1−1−e−2.00.61−1−e−3.0u11−e−3.0u21−e−3.01−e−2.0=−12.0ln1−1−e−2.00.61−(1−e−3.0(1Γ2γ4×1))1−e−3.0Φx2−131−e−3.01−e−2.0

Figure 5.3(a) plots the corresponding joint CDF for the derived M3 copula with u₃ = 0.6u3=0.6. M4 (Joe, 1997):

C2u1u2=u1−θ2+u2−θ2−1−1θ2

Let t = C₂(u₁, u₂)t=C2u1u2. Then we have C1u3t=u3−θ1+t−θ1−1−1θ1

Cu1u2u3=C1u3C2u1u2=u1−θ2+u2−θ2−1−θ1θ2+u3−θ1−1−1θ1

(5.6)

θ₂ ≥ θ₁ ∈ [0, ∞), τ₁₂, τ₁₃, τ₂₃ ∈ [0, 1]θ2≥θ1∈0∞,τ12,τ13,τ23∈01 for positive dependent trivariate variables. The M4 copula may also be called the trivariate asymmetric Clayton copula.

Figure 5.3 Joint CDF for derived FNACs: (a) M3 copula, (b) M4 copula, (c) M5 copula, (d) M6 copula, and (e) M12 copula.

Example 5.2 Derive the M4 copula using information given in Example 5.1.

Solution: In Example 5.1, we have θ₁ = 2.0, θ₂ = 3.0θ1=2.0,θ2=3.0 by setting u₃ = 0.6u3=0.6. Thus, we have the following:

C2u1u23=u1−3.0+u2−3.0−1−13.0=1Γ2γ4×1−3.0+Φx2−13−3.0−1−13.0

Cu1u20.632=C1C2u1u230.6=u1−3.0+u2−3.0−123+0.6−2.0−1−12.0=1Γ2γ4×1−3.0+Φx2−13−3.0−123+0.6−2.0−1−12.0

Figure 5.3(b) plots the corresponding joint CDF for the derived M4 copula with u₃ = 0.6u3=0.6.

M5 (Joe, 1997):

C2u1u2=1−1−u1θ2+1−u2θ2−1−u1θ21−u2θ21θ2

Let t=C2u1u2,1−t=1−u1θ2+1−u2θ2−1−u1θ21−u2θ21θ2. Then we have the following:

Cu1u2u3=C1u3C2u1u2=1−1−u1θ21−1−u2θ2+1−u2θ2θ1θ21−1−u3θ1+1−u3θ11θ1

(5.7)

θ₂ ≥ θ₁ ∈ [1, ∞), τ₁₂, τ₁₃, τ₂₃ ∈ [0, 1]θ2≥θ1∈1∞,τ12,τ13,τ23∈01. The M5 copula may also be called the trivariate asymmetric Joe copula.

Example 5.3 Derive M5 copula using the information given in Example 5.1.

Solution: In Example 5.1, we have θ₁ = 2.0, θ₂ = 3.0θ1=2.0,θ2=3.0 by setting u₃ = 0.6u3=0.6. Thus we have the following:

C2u1u23.0=1−1−u13.0+1−u23.0−1−u13.01−u23.013.0

=1−(1−1Γ2γ4×13.0+1−Φx2−133.0−1−1Γ2γ4×11−Φx2−133.0)13.0

Cu1u20.632=1−(1−u13.01−1−u23.0+1−u23.02.03.01−0.42.0+0.42.0)12.0

Figure 5.3(c) plots the corresponding joint CDF for the derived M5 copula with u₃ = 0.6u3=0.6.

M6 (Joe, 1997; Embrechts, 2003):

Let C2u1u2=e−−lnu1θ2+−lnu2θ21θ2,and

t=C2u1u2,−lnt=(−lnu1)θ2+−lnu2θ21θ2.Thenwehave

Cu1u2u3=C1(u3,C2u1u2=e−−lnu1θ2+−lnu2θ2θ1θ2+−lnu3θ11θ1

(5.8)

θ₂ ≥ θ₁ ∈ [1, ∞), τ₁₂, τ₁₃, τ₂₃ ∈ [0, 1]θ2≥θ1∈1∞,τ12,τ13,τ23∈01 for positive dependent trivariate variables. The M6 copula may also be called the trivariate asymmetric Gumbel–Hougaard copula.

Example 5.4 Derive the M6 copula using the information given in Example 5.1.

Solution: In Example 5.1, we have θ₁ = 2.0, θ₂ = 3.0θ1=2.0,θ2=3.0 by setting u₃ = 0.6u3=0.6. Thus we have the following:

C2u1u23=e−−lnu13.0+−lnu23.013.0

Cu1u20.632=e−−lnu13.0+−lnu23.023+−ln0.62.012.0

Figure 5.3(d) plots the corresponding joint CDF for the derived M6 copula with u₃ = 0.6u3=0.6.

M12 (Embrechts, 2003):

C2u1u2=11+1u1−1θ2+1u2−1θ21θ2

Let t=C2u1u2,1t−1=1u1−1θ2+1u2−1θ21θ2. Then we have

Cu1u2u3=11+1u1−1θ2+1u2−1θ2θ1θ2+1u3−1θ11/θ1

(5.9)

θ2≥θ1∈1∞,τ12,τ13,τ23∈131.

Example 5.5 Derive the M12 copula using the information given in Example 5.1.

Solution:

C2u1u23=11+1u1−13.0+1u2−13.013.0

Cu1u20.632=11+1u1−13.0+1u2−13.023+10.6−12.01/2.0

Figure 5.3(e) plots the joint CDF for the derived M12 copula with u₃ = 0.6u3=0.6.

Example 5.6 Derive a four-dimensional FNAC copula function based on the bivariate Frank copula.

Solution: From Figure 5.1, we have the following:

C3u1u2θ3=−1θ3ln1−1−e−θ3u11−e−θ3u21−e−θ3

C₃C3 and u₃u3 are coupled as copula C₂(C₃, u₃)C2C3u3 with parameter θ₂θ2, which can be written as follows:

C2u1u2u3=C2C3u3θ2=−1θ2ln1−1−1−1−e−θ3u11−e−θ3u21−e−θ3θ2θ31−e−θ2u31−e−θ2

Finally, C₂C2 and u₄u4 are defined as copula C₁(C₂, u₄)C1C2u4 with parameter θ₁θ1, which results in C₁(C₂, u₄; θ₁) = C(u₁, u₂, u₃, u₄; θ₁, θ₂, θ₃)C1C2u4θ1=Cu1u2u3u4θ1θ2θ3 as follows:

Cu1u2u3u4θ1θ2θ3=−1θ1ln1−1−e−θ1u41−e−θ11−1−1−1−1−e−θ3u11−e−θ3u21−e−θ3θ2θ31−e−θ2u31−e−θ2θ1θ2

In the same way as for the previous examples, for the four-dimensional random variables {X_i, i = 1, …, 4}Xii=1…4, the random variable X_iXi may follow different marginal distributions as follows:

u₁ = F₁(x₁); u₂ = F₂(x₂); u₃ = F₃(x₃); u₄ = F₄(x₄)

u1=F1x1;u2=F2x2;u3=F3x3;u4=F4x4.Asanillustration,wecansay,

X₁~ exp (λ₁) ⇒ u₁ = F₁(x₁) = 1 − exp (−λ₁x₁);

X1~expλ1⇒u1=F1x1=1−exp−λ1×1;

X2~gammaαβ⇒u2=F2x2=1Γαγβx2;

X3~logisticab⇒u3=F3x3=11+expx−ab;

X4~PearsonIIIcαβ⇒u4=F4x4=1Γαγβx−c.

5.2.2 Partially Nested Archimedean Copulas (PNAC)

Originally, Joe (1997) proposed the structure of PNAC as an alternative approach for FNAC. PNAC may be considered a composite of EAC and FNAC (Berg and Aas, 2007).Similar to FNAC, PNAC also has d – 1 bivariate copulas that are partially exchangeable. As a simple example, Figure 5.4 illustrates the PNAC structure for four-dimensional random variables: (1) couple the two pairs (u₁, u₂)u1u2 and (u₃, u₄)u3u4 with copula C₃C3 with parameter θ₃θ3 and C₂C2 with parameter θ₂,θ2, respectively, at the first level; and (2) the third copula C₁C1 with parameter θ₁θ1 will be applied to couple C₂C2 and C₃C3 at the second level (Berg and Aas, 2007). Figure 5.4 also shows (1) exchangeability between u₁u1 and u₂u2, as well as between u₃u3 and u₄u4; and (2) four pairs (u₁, u₃), (u₁, u₄), (u₂, u₃)u1u3,u1u4,u2u3, and (u₂, u₄)u2u4 all have copula C₁C1. Furthermore, the same constraints on parameters for FNAC are required to be satisfied for PNAC (Berg and Aas, 2007), i.e., (i) PNAC may be used to model the positively dependent variables, and (ii) the dependence decreases with the increase of nesting levels (i.e., the parameters of a higher level are smaller than those of a lower level).

Figure 5.4 Partially nested Archimedean construction.

Example 5.7 Using the bivariate Frank copula as the building block to derive a four-dimensional PNAC function for the structure given in Figure 5.4.

Solution: As shown in Figure 5.4, (u₁, u₂)u1u2 and (u₃, u₄)u3u4 can be represented through the Frank copula as follows:

C3u1u2θ3=−1θ3ln1−1−e−θ3u11−e−θ3u2e−θ3

C4u3u4θ2=−1θ2ln1−1−e−θ2u31−e−θ2u4e−θ2

Then C₁C1 can be represented through C₃, C₂C3,C2 as follows:

Cu1u2u3u4θ1θ2θ3=C1C3C2θ1=−1θ1ln1−1−e−θ1C31−e−θ1C21−e−θ1=−1θ1ln1−1−1−1−e−θ3u11−e−θ3u21−e−θ3θ1θ31−1−1−e−θ2u31−e−θ2u41−e−θ2θ1θ21−e−θ1

with the parameters: 0 ≤ θ₁ ≤ θ₂, θ₃0≤θ1≤θ2,θ3.

In the same manner for FNAC, random variables {X₁ : i = 1, 2, 3, 4}X1:i=1234 may follow different marginal distributions as u_i = F_i(x_i)ui=Fixi.

5.2.3 General Case

Originating in Joe (1997), the general nested Archimedean copula (GNAC) construction was further developed by Whelan (2004) and Savu and Trede (2006). Savu and Trede (2006) first introduced the notation for arbitrary nesting and the procedure for calculating the d-dimensional probability density function in general. To build a hierarchy of Archimedean copulas, they also applied the notation for the hierarchical Archimedean copula for GNAC. The main idea of the generally nested Archimedean construction is presented in this section (Berg and Aas, 2007).

For the GNAC with L levels, there are n_lnl distinct objects (an object is either a copula or a variable) at each level l. At level l = 1l=1, variables u₁, …, u_du1,…,ud are grouped into n₁n1 exchangeable multivariate Archimedean copulas. These copulas are, in turn, coupled with n₂n2 copula at level l = 2l=2, and so on. Berg and Aas (2007) presented an example of a nine-dimensional copula to explain this structure (Figure 5.5).

Figure 5.5 Hierarchically nested Archimedean copula construction.

Following Figure 5.5, the nine-dimensional copula can be written as

C(u₁, …, u₉) = C₁₁(C₂₁(C₃₁(C₄₁(u₁, u₂), u₃, u₄), u₅, u₆), C₃₂(u₇, C₄₂(u₈, u₉)))

Cu1…u9=C11C21C31C41u1u2u3u4u5u6C32u7C42u8u9.(5.10)

At the first level, there are two two-dimensional EACs, i.e., C₄₁(u₁, u₂)C41u1u2 with parameter θ₄₁θ41 and C₄₂(u₈, u₉)C42u8u9 with parameter θ₄₂θ42. There are one three-dimensional and one two-dimensional EACs at the second level, i.e., C₃₁(C₄₁, u₃, u₄)C31C41u3u4 with parameter θ₃₁θ31 and C₃₂(u₇, C₄₂)C32u7C42 with parameter θ₃₂θ32. At the third level, there is only one copula, C₂₁(C₃₁, u₅, u₆)C21C31u5u6 with parameter θ₂₁θ21. At the top (fourth) level, the copula C₁₁C11, with parameter θ₁₁θ11, is applied to model the dependence between C₂₁C21 and C₃₂C32.

To ensure that GNAC is a valid Archimedean copula, there are a number of conditions that need to be satisfied (Savu and Trede, 2006; Berg and Aas, 2007):

a. The number of copulas must decrease with the increasing level of nesting. The top level may contain only one copula, and the inverse of the generating functions (ϕ⁻¹ϕ−1) must be completely monotonic.
b. The dependence of GNAC must decrease with the increasing level of nesting. For example, in Figure 5.5, parameters must be stratified following the condition θ₄₁ ≥ θ₃₂ ≥ θ₂₁ ≥ θ₁₁θ41≥θ32≥θ21≥θ11 and θ₄₂ ≥ θ₃₂ ≥ θ₁₁θ42≥θ32≥θ11. However, when mixing copula generators that belong to different Archimedean copula families, this requirement might not be sufficient. Two Archimedean copulas from different families (i.e., Fam1 and Fam2) can only be nested if the derivative of the product ϕ1∘ϕ2−1 is completely monotonic. Joe (1997) presented details about copula families that can be mixed and explored structures where all the generators are from the same family are explored, and the other structures are still not fully explored.

5.2.4 Parameter Estimation for Nested Copulas

For NAC with an explicit density expression, the maximum likelihood estimation method is commonly applied to estimate the copula parameters; however, the NAC density function may not be straightforwardly derived. Savu and Trede (2006) proposed a recursive approach to derive the density function for general NAC. With this approach, the number of computational steps for evaluating the density increases rapidly with the copula complexity, and parameter estimation becomes very time consuming in higher dimensions (Savu and Trede, 2006; Berg and Aas, 2007).

The density function of NAC can be derived using the chain rule as discussed by Savu and Trede (2006). We will use the following examples to illustrate the general procedure on how to apply the chain rule. Furthermore, we derive the density functions for the M3, M4, M5, M6, and M12 copulas (Joe, 1997) in the appendix as specific examples.

Example 5.8 Derive the density function for three-dimensional FNAC (Equation (5.4) corresponding to Figure 5.2).

Solution: Equation (5.4) may be rewritten as follows:

C(u₁, u₂, u₃) = C₁(C₂(u₁, u₂), u₃)Cu1u2u3=C1C2u1u2u3 and its density, i.e., c(u₁, u₂, u₃)cu1u2u3, may be derived as follows:

∂Cu1u2u3∂u1=∂C1C2u1u2u3∂u1=∂C1∂C2∂C2∂u1;∂C2u1u2.u3∂u1∂u2=∂2C1∂C22∂C2∂u2∂C2∂u1+∂2C1∂C2∂2C2∂u1∂u2

Finally, we have the following:

cu1u2u3=∂3Cu1u2u3∂u1∂u2∂u3=∂3C1∂C22∂u3∂C2∂u2∂C2∂u1+∂2C1∂C2∂u3∂2C2∂u1∂u2

Example 5.9 Derive the density function for four-dimensional FNAC (i.e., Equation (5.1) corresponding to Figure 5.1).

Solution: Following Equation (5.1) and Figure 5.1, we have the following:

C(u₁, u₂, u₃, u₄) = C₁(u₄, C₂) = C₁(u₄, C₂(u₃, C₃(u₁, u₂)))Cu1u2u3u4=C1u4C2=C1u4C2u3C3u1u2 and its density c(u₁, u₂, u₃, u₄)cu1u2u3u4 may be derived as follows:

∂Cu1u2u3u4∂u1=∂C1u4C2u3C3u1u2∂u1=∂C1∂C2∂C2∂C3∂C3∂u1∂2Cu1u2u3u4∂u1∂u2=∂2C1∂C22∂C2∂C32∂C3∂u1∂C3∂u2+∂C1∂C2∂2C2∂C32∂C3∂u1∂C3∂u2+∂C1∂C2∂C2∂C3∂2C3∂u1∂u2∂3Cu1u2u3u4∂u1∂u2∂u3=∂3C1∂C22∂C2∂C32∂C2∂u3∂C3∂u1∂C3∂u2+2∂2C1∂C22∂C2∂C3∂2C2∂C3∂u3∂C3∂u1∂C3∂u2=∂2C1∂C22∂C2∂u3∂2C2∂C32∂C3∂u1∂C3∂u2=∂C1∂C2∂3C2∂C32∂u3∂C3∂u1∂C3∂u2+∂2C1∂C22∂C2∂u3∂C2∂C3∂2C3∂u1∂u2+∂C1∂C2∂2C2∂C3∂u3∂2C3∂u1∂u2

Finally, we have the following:

cu1u2u3u4=∂4Cu1u2u3u4∂u1∂u2∂u3∂u4=∂4C1∂C23∂u4∂C2∂C32∂C2∂u3∂C3∂u1∂C3∂u2+2∂3C1∂C22∂u4∂C2∂C3∂2C2∂C3∂u3∂C3∂u1∂C3∂u2+∂3C1∂C22∂u4∂C2∂u3∂2C2∂C32∂C3∂u1∂C3∂u2+∂2C1∂C2∂u4∂3C2∂C32∂u3∂C3∂u1∂C3∂u2+∂3C1∂C22∂u4∂C2∂u3∂C2∂C3∂2C3∂u1∂u2+∂2C1∂C2∂u4∂2C2∂C3∂u3∂2C3∂u1∂u2

Example 5.10 Derive the density function for the copula function represented by Figure 5.4.

Solution: According to Figure 5.4, we have the following: C(u₁, u₂, u₃, u₄) = C₁(C₃(u₁, u₂), C₂(u₃, u₄)).Cu1u2u3u4=C1C3u1u2C2u3u4. Then its density function c(u₁, u₂, u₃, u₄)cu1u2u3u4 may be expressed as follows:

∂Cu1u2u3u4=∂C1∂C3∂C3∂u1

∂2Cu1u2u3u4∂u1∂u2=∂2C1∂C32∂C3∂u2∂C3∂u1+∂C1∂C3∂2C3∂u1∂u2

∂3Cu1u2u3u4∂u1∂u2∂u3=∂3C1∂C32∂C2∂C2∂u3∂C3∂u2∂C3∂u1+∂2C1∂C3∂C2∂C2∂u3∂2C3∂u1∂u2

Finally, we have the following:

cu1u2u3u4=∂4Cu1u2u3u4∂u1∂u2∂u3∂u4=∂4C1∂C32∂C22∂C2∂u4∂C2∂u3∂C3∂u2∂C3∂u1+∂3C1∂C32∂C2∂2C2∂u3∂u4∂C3∂u2∂C3∂u1+∂3C1∂C3∂C22∂C2∂u4∂C2∂u3∂2C3∂u1∂u2+∂2C1∂C3∂C2∂2C2∂u3∂u4∂2C3∂u1∂u2

With the copula density function derived, we can then apply MLE to estimate parameters simultaneously with the constraints of parameters at a lower level being larger than those at a higher level. However, the copula parameters may also be estimated sequentially with the use of MLE as follows:

i. Estimate the copula parameter at the lowest level.
ii. Estimate the copula parameter for the second-lowest level by fixing the parameters estimated for the lowest level.
iii. Repeat the preceding steps until we reach the top level of the NAC structure.

5.2.5 Simulation for Nested Copulas

In the previous chapters, we have shown that EAC may be simulated with several methods, such as Laplace transform (LT) and CPI Rosenblatt’s transform, and through its unique generating function ϕϕ with a simple algorithm. Frees and Valdez (1998) showed how to use the LT method to simulate NACs for the generators taken from either the Gumbel– Hougaard or the Clayton copula family. However, Berg and Aas (2007) have pointed out that the LT method is limited to the copulas such that we can find a distribution that equals the LT of the inverse generating function and from which we can easily sample. In most cases, the LT method needs to obtain the d – 1 first derivatives of the copula function, which usually yield extremely complex expressions under higher-order derivatives. The limitation of LT method may cause the simulation to become inefficient for high dimensions (Berg and Aas, 2007).

Compared to the LT method, the CPI Rosenblatt transform method is more universal and will be introduced to simulate from NAC. Let X = {X₁, X₂, …, X_d}X=X1X2…Xd be a d-dimensional random vector with marginal distributions F(x_i)Fxi and conditional distributions F(x_i| x₁, …, x_i − 1), i = 1, …, dFxix1…xi−1,i=1,…,d. The CPI Rosenblatt’s transform of X is defined as T(X) = {T(X₁), …, T(X_d)}TX=TX1…TXd:

T(X₁) = F₁(x₁), T(X₂) = F_2 ∣ 1(x₂| x₁), …, T(X_d) = F_{d ∣ 1, 2, …, d − 1}(x_d| x₁, x₂, …, x_d − 1)

TX1=F1x1,TX2=F2∣1x2x1,…,TXd=Fd∣1,2,…,d−1xdx1x2…xd−1.(5.11)

With the use of CPI method, random variables are simulated with the following procedure:

i. Generate W = {w₁, w₂, …, w_d}W=w1w2…wd independent random variables following the uniform distribution [0, 1].
ii. Set x₁ = w₁x1=w1.
iii. Set w₂ = T(X₂) = F_2 ∣ 1(x₂| x₁)w2=TX2=F2∣1x2x1 to obtain x2=F2∣1−1w2x1.
iv. Set w₃ = T(X₃) = F_{3 ∣ 1, 2}(w₃| x₁, x₂)w3=TX3=F3∣1,2w3x1x2 to obtain x3=F3∣1,2−1w3x1x2.

…

Set w_d = T(X_d) = F_{d ∣ 1, 2, …d − 1}(w_d| x₁, x₂, …, x_d)wd=TXd=Fd∣1,2,…d−1wdx1x2…xd.

Example 5.11 Assuming the pseudo-observations given in Table 5.1 may be modeled with the M6 copula, (1) estimate the copula parameters both simultaneously and sequentially using MLE; and (2) simulate the random variables with a sample size of 50.

Table 5.1. Trivariate pseudo-observations.

	u₁u1	u₂u2	u₃u3
1	0.241	0.138	0.103
2	0.241	0.172	0.172
3	0.241	0.241	0.276
4	0.241	0.586	0.655
5	0.793	0.828	0.897
6	0.483	0.345	0.379
7	0.931	0.914	0.621
8	0.724	0.759	0.724
9	0.414	0.621	0.586
10	0.759	0.414	0.310
11	0.862	0.793	0.793
12	0.655	0.517	0.448
13	0.414	0.379	0.552
14	0.569	0.448	0.414
15	0.569	0.690	0.690
16	0.414	0.310	0.241
17	0.241	0.552	0.862
18	0.069	0.035	0.035
19	0.241	0.276	0.345
20	0.069	0.069	0.069
21	0.897	0.914	0.931
22	0.655	0.655	0.483
23	0.069	0.103	0.138
24	0.241	0.207	0.207
25	0.655	0.724	0.759
26	0.517	0.483	0.517
27	0.828	0.862	0.828
28	0.966	0.966	0.966

Solution: Estimate the copula parameters.

To estimate the parameters for the fitted M6 copula, we use Figure 5.2 as the FNAC scheme.

Estimate the copula parameters simultaneously.

To estimate the copula parameters simultaneously, the copula density function (i.e., Equation (M6–3) in the appendix) is applied to write the log-likelihood function as follows:

logL=∑1u1u2u3−lnu1θ2−1−lnu2θ2−1−lnu3θ3−1e−w1θ1(G2θ1θ2−2w3θ1−3+2θ1−2w2θ1−2+θ2−θ1Gθ1θ2−2w2θ1−2+θ1−12θ1−1G2θ1θ2−2w1θ1−3+θ1−1G2θ1θ2−2w2θ1−3+θ1−1θ2−θ1Gθ1θ2−2w1θ1−2)
where G =−lnu1θ2+−lnu2θ2;w=−lnu3θ1+[−lnu1θ2+−lnu2)θ1θ1θ2.

The parameter constraint is given as 1 ≤ θ₁ ≤ θ₂1≤θ1≤θ2, where θ₂θ2 corresponds to the parameters for the first level.

Maximizing the log-likelihood function numerically (e.g., using genetic algorithm ga function in MATLAB), the parameters are estimated as follows:

θ₂ = 4.4158; θ₁ = 3.3532
θ2=4.4158;θ1=3.3532.

It is worth noting that to properly estimate the parameters simultaneously, the linear constraint needs to be applied with vector A = [–1,1] B = 0, which represents –θ₂ + θ₁ ≤ 0–θ2+θ1≤0.
Estimate the copula parameters sequentially.

To estimate the copula parameters sequentially, the density function for the bivariate Gumbel–Hougaard copula is applied (Chapter 4).
- Step 1: Maximizing the log-likelihood function for (u₁, u₂)u1u2, we have θ₂ = 4.4682θ2=4.4682.
- Step 2: Compute C(u₁, u₂; θ₂ = 4.4682C(u1,u2;θ2=4.4682) and estimate the parameter for
- (u₃, C(u₁, u₂; θ₂ = 4.4682))u3Cu1u2θ2=4.4682. Again using MLE, we have θ₁ = 3.2088θ1=3.2088. It is worth noting that to estimate the parameter (i.e., the Gumbel–Hougaard copula) for the top level, the lower and upper bounds are [1, θ₂]1θ2.
- Finally, for both simultaneous and sequential estimation, the parameters estimated are coded as follows:
- param = [param(1), param(2)] = [θ₂, θ₁]param=param1param2=θ2θ1; param(1) and param(2) represents bottom and top levels, respectively.
Simulation from the fitted M6 copula.

As discussed previously, the random variates are simulated using the CPI Rosenblatt transform, as shown in Figure 5.6(a).

In addition, we have discussed previously that [u₁, u₃]u1u3 and [u₂, u₃]u2u3 may be modeled with the Gumbel–Hougaard copula with parameter θ₁θ1. Figure 5.6(b) compares the simulation as well as the box plot of simulated and sample Kendall’s tau (100 simulations with a sample size of 28).

Figure 5.6 (a) Comparison of pseudo-observations with those simulated from M6 copula; (b) simulation comparison from the Gumbel–Hougaard copula with parameter θ₁θ1 for (u₁, u₃), (u₂, u₃)u1u3,u2u3 directly; (c) comparison of sample Kendall’s tau with simulated Kendall’s tau from Gumbel–Hougaard copula with parameter θ = 2.8816θ=2.8816.

Example 5.12 Assuming the Gumbel–Hougaard copula may be applied as a biviarate building block, and using the scheme shown in Figure 5.4 and the pseudo-observations listed in Table 5.2, (1) estimate the copula parameters; and (2) simulate random variates with fitted copula for a sample size of 100.

Table 5.2. Pseudo-observations for Example 5.12.

	u₁u1	u₂u2	u₃u3	u₄u4
1	0.194	0.338	0.421	0.545
2	0.819	0.901	0.743	0.705
3	0.614	0.639	0.615	0.662
4	0.235	0.208	0.298	0.292
5	0.792	0.755	0.865	0.894
6	0.433	0.517	0.559	0.480
7	0.130	0.197	0.095	0.087
8	0.570	0.583	0.802	0.680
9	0.128	0.274	0.256	0.137
10	0.218	0.116	0.262	0.481
11	0.468	0.367	0.367	0.439
12	0.490	0.434	0.391	0.515
13	0.194	0.083	0.019	0.042
14	0.120	0.227	0.178	0.289
15	0.676	0.601	0.759	0.673
16	0.990	0.990	0.991	0.993
17	0.657	0.777	0.942	0.950
18	0.226	0.174	0.284	0.134
19	0.828	0.857	0.836	0.916
20	0.373	0.367	0.151	0.249
21	0.698	0.656	0.727	0.584
22	0.645	0.738	0.641	0.787
23	0.025	0.051	0.034	0.199
24	0.298	0.300	0.470	0.394
25	0.906	0.936	0.955	0.950
26	0.658	0.476	0.556	0.647
27	0.302	0.158	0.224	0.105
28	0.581	0.393	0.733	0.779
29	0.371	0.433	0.179	0.145
30	0.169	0.537	0.213	0.344
31	0.041	0.083	0.009	0.059
32	0.982	0.978	0.928	0.935
33	0.585	0.162	0.326	0.312
34	0.618	0.753	0.661	0.633
35	0.280	0.622	0.400	0.574
36	0.902	0.969	0.879	0.904
37	0.440	0.648	0.587	0.811
38	0.243	0.147	0.281	0.524
39	0.044	0.081	0.177	0.052
40	0.122	0.149	0.229	0.180
41	0.497	0.645	0.528	0.545
42	0.701	0.644	0.745	0.599
43	0.323	0.538	0.806	0.796
44	0.013	0.044	0.063	0.041
45	0.651	0.721	0.774	0.646
46	0.190	0.298	0.773	0.841
47	0.520	0.772	0.636	0.542
48	0.926	0.943	0.900	0.812
49	0.468	0.447	0.518	0.633
50	0.868	0.894	0.893	0.905
51	0.422	0.710	0.727	0.560
52	0.888	0.835	0.868	0.823
53	0.372	0.590	0.734	0.792
54	0.132	0.116	0.095	0.041
55	0.429	0.288	0.219	0.125
56	0.390	0.366	0.375	0.172
57	0.983	0.986	0.991	0.990
58	0.980	0.988	0.976	0.974
59	0.308	0.318	0.147	0.193
60	0.932	0.913	0.943	0.933

Solution:

1. Estimate the copula parameters.

According to Figure 5.4, let us use θ₁₂, θ₃₄θ12,θ34 to represent the copula parameters of [u₁, u₂], [u₃, u₄]u1u2,u3u4 at the bottom level and θθ to represent the copula parameter at the top level.
- Estimate the parameters simultaneously.
  
  Given the Gumbel–Hougaard copula as a bivariate building block, the copula density function for the four-dimensional PNAC Gumbel–Hougaard copula may be derived based on the chain rule following the procedure given in Example 5.10. With the parameter constraints 1 ≤ θ ≤ θ₁₂, θ₃₄1≤θ≤θ12,θ34, i.e.,
  
  θ−θ12≤0θ−θ34≤0, the inequality vector is then given as A=−1,0,10,−1,1,B=00, with the parameter set as param = [θ₁₂, θ₃₄, θ]param=θ12θ34θ.
  
  The parameters can be estimated numerically by maximizing the log-likelihood function with the preceding linear constraint as follows:
  
  θ₁₂ = 3.6949, θ₃₄ = 4.5035, θ = 2.8816
  θ12=3.6949,θ34=4.5035,θ=2.8816.
- Estimate the parameters sequentially.
  
  With the same estimation procedures shown in Example 5.11:
  
  The parameter for (u₁, u₂)u1u2 is estimated as θ₁₂ = 3.8545θ12=3.8545.
  
  The parameter for (u₃, u₄)u3u4 is estimated as θ₃₄ = 4.3949θ34=4.3949.
  
  The parameter for {C₃(u₁, u₂; θ₁₂), C₂(u₃, u₄; θ₃₄)}C3u1u2θ12C2u3u4θ34 is estimated by fixing θ₁₂, θ₃₄θ12,θ34 as θ = 3.3297θ=3.3297.
2. Simulate random variates.

Using the CPI Rosenblatt transform, Figure 5.7(a) compares the pseudo-observations in Table 5.2 with those simulated from the fitted PNAC Gumbel–Hougaard copula function.

Figure 5.7 (a) Comparison of pseudo-observations with those simulated with the parameters estimated simultaneously (θ₁₂ = 3.6949, θ₃₄ = 4.5035, θ = 2.8816θ12=3.6949,θ34=4.5035,θ=2.8816); (b) comparison of observed variables with simulated variables with θ = 2.8816θ=2.8816; (c) comparison of sample Kendall’s tau with the simulated Kendall’s taus.

As discussed previously for the PNAC structure, we know (u₁, u₃), (u₁, u₄), (u₂, u₃), (u₂, u₄)u1u3,u1u4,u2u3,u2u4 should have the same joint distribution that may be modeled using the Gumbel–Hougaard copula with parameter at the top level, i.e., θ = 2.8816θ=2.8816 with the comparison of simulated random variable and Kendall’s tau as shown in Figure 5.7(b) and 5.7(c). Figure 5.7(b) and 5.7(c) indicates that the preceding four pairs may be modeled using the same Gumbel–Hougaard copula.

5.3 Pair-Copula Construction (PCC)

PCCs are also hierarchical in nature. Compared to EAC and NAC, a large improvement is made in PCCs that allows for the free specification of dd−12 copulas. The modeling scheme of PCCs is based on the decomposition of a multivariate density function. The d-dimensional probability density function may be decomposed to dd−12 bivariate density functions, where the first d − 1d−1 density functions are unconditional and the rest are conditional (Berg and Aas, 2007). First proposed by Joe (1997), there are two main types of PCCs, canonical (C)-vines and D-vines, in the literature (e.g., Bedford and Cooke, 2001, 2002; Kurowicka and Cooke, 2004, 2006; Aas et al., 2009).

5.3.1 Principle of Pair-Copula Decomposition of General Multivariate Distribution

Following Aas et al. (2009), we introduce the pair-copula decomposition of general multivariate distributions.

Let X = (X₁, X₂, …, X_d)X=X1X2…Xd be a vector of random variables with a joint density function f(x₁, …, x_d)fx1…xd. According to the conditional probability theory, the joint density function can be defined as follows:

f(x₁, x₂, …, x_d) = f(x₁)f(x₂| x₁)⋯f(x_d| x₁, …, x_d − 1)

fx1x2…xd=fx1fx2x1⋯fxdx1…xd−1(5.12)

In Chapters 3 and 4, the multivariate distribution F with marginals F₁(x₁), …, F_d(x_d)F1x1,…,Fdxd is defined using Sklar’s theorem as follows:

Fx1…xd=CF1x1…FdxdorCu1…ud=FF1−1×1…Fd−1xd

(5.13)

where u_i = F_i(x_i)ui=Fixi; Fi−1ui is the inverse distribution of marginal F_i(x_i)Fixi.

Then, for an absolutely continuous F with strictly increasing, continuous marginal probability densities f₁(x₁), …, f_d(x_d)f1x1,…,fdxd, applying ∂d∂x1…∂xd to Equation (5.13), we have

fx1…xd=∂d∂F1x1…∂FdxdCF1x1…Fdxd∂F1x1∂x1…∂Fdxd∂xd

(5.14a)

fx1…xd=c1,2,…,dF1x1…Fdxd∏i=1dfixi

(5.14b)

where c_{1, 2, …, d}(⋅)c1,2,…,d⋅ stands for the d-dimensional copula density function.

In the bivariate case, Equation (5.14b) can be simplified to

f(x₁, x₂) = c₁₂(F₁(x₁), F₂(x₂))f₁(x₁)f₂(x₂)

fx1x2=c12F1x1F2x2f1x1f2x2(5.15)

where c₁₂(⋅)c12⋅ is the appropriate pair-copula density.

Using the conditional probability in Equation (5.12), the conditional probability density function can be easily written as

fx1x2=fx1x2f2x2=c12F1x1F2x2f1x1f2x2f2x2=c12F1x1F2x2f1x1

(5.16)

Likewise, we have

f(x_d − 1| x_d) = c_{d − 1, d}(F_d − 1(x_d − 1), F_d(x_d))f_d − 1(x_d − 1)

fxd−1xd=cd−1,dFd−1xd−1Fdxdfd−1xd−1.(5.17)

Similarly, in the trivariate case, we can obtain the conditional probability density function:

fx1x2x3=fx1x2x3fx2x3=fx3fx1x2x3fx3fx2x3=fx1x2x3fx2x3

(5.18)

According to the definition of conditional copula, we have

fx1x2x3=∂2Fx1x2x3∂x1∂x2=∂2∂x1∂x2C12∣3F1∣3x1x3F2∣3x2x3=∂2C12∣3F1∣3x1x3F2∣3x2x3∂F1∣3x1x3∂F2∣3x2x3∂F1∣3x1x3∂x1∂F2∣3x2x3∂x2=c12∣3F1∣3x1x3F2∣3x2x3f1∣3x1x3f2∣3x2x3

(5.19)

Thus,

fx1x2x3=fx1x2x3fx2x3=c12∣3F1∣3x1x3F2∣3x2x3fx1x3fx2x3fx2x3=c12∣3F1∣3F2∣3f1∣3

(5.20)

Alternatively, f(x₁| x₂, x₃)fx1x2x3 may be also written as follows:

f(x₁| x₂, x₃) = c_13 ∣ 2(F_1 ∣ 2, F_3 ∣ 2)f_1 ∣ 2

fx1x2x3=c13∣2F1∣2F3∣2f1∣2(5.21)

Equations (5.20) and (5.21) can be further decomposed as follows:

f(x₁| x₂, x₃) = c_13 ∣ 2(F_1 ∣ 3, F_2 ∣ 3)c₁₃(F₁, F₃)f(x₁)

fx1x2x3=c13∣2F1∣3F2∣3c13F1F3fx1(5.22a)

f(x₁| x₂, x₃) = c_13 ∣ 2(F_1 ∣ 2, F_3 ∣ 2)c₁₂(F₁, F₂)f(x₁)

fx1x2x3=c13∣2F1∣2F3∣2c12F1F2fx1(5.22b)

From the expression of the appropriate pair-copula, a conditional marginal density function can be expressed in a general form as follows:

f(x| v) = c_{xv_j ∣ v_−j}(F(x| v_−j), F(v_j| v_−j))f(x| v_−j)

fxv=cxvj∣v−jFxv−jFvjv−jfxv−j(5.23)

where v is a d-dimensional vector; v_jvj is one arbitrarily chosen component of v; and v_−jv−j denotes the v vector except v_jvj, i.e., v_−j = v\v_jv−j=v\v_j.

Under appropriate conditions, a multivariate probability density function may be expressed through the product of pair-copulas, acting on several different conditional probability distributions (Aas et al., 2009).

Joe (1997) showed a conditional marginal distribution for the appropriate pair-copula for every j as

Fxv=∂Cx,vj∣v−jFxv−jFvjv−j∂Fvjv−j

(5.24)

where C_{x, v_j ∣ v_−j}Cx,vj∣v−j is a bivariate copula function with the conditional marginals. For the special case where v is univariate, Equation (5.24) can be rewritten as follows:

Fxv=∂Cx,vFXxFVv∂FVv

(5.25)

In Equation (5.25), when xx and vv are copula random variables (i.e., the margins following the uniform [0,1] as f(x) = f(v) = 1, F_X(x) = x, F_V(v) = vfx=fv=1,FXx=x,FVv=v), Equation (5.25) can be rewritten as follows:

hxvΘ=Fxv=∂Cx,vxvΘ∂v

(5.26)

where the second variable of h(⋅)h⋅ function represents the conditional variable, and ΘΘ denotes the set of copula parameters to model the joint distribution function of xx and vv. Letting u = xu=x, Equation (5.26) is essentially the conditional copula function of C(u| V = v; Θ)CuV=vΘ.

Example 5.13 Derive the hh function for the bivariate Gumbel–Hougaard copula.

Solution: As seen in the previous chapters, the bivariate Gumbel–Hougaard copula can be written as follows:

Cu1u2θ=e−(−lnu1θ+−lnu2)θ1θ

Then the hh function, i.e., h(u₁, u₂, θ)hu1u2θ, can be expressed as follows:

hu1u2θ=Fu1U2=u2θ=∂Cu1u2θ∂u2=Cu1u2u2−lnu2θ−1−lnu1θ+−lnu2θ^ˆ1θ−1=e−−lnu1θ+−lnu2θ1θ−lnu2θ−1−lnu1θ+−lnu2θ1θ−1u2

5.3.2 Vines

High-dimensional distributions have a significant number of possible pair-copula constructions. The regular vine, introduced by Bedford and Cooke (2001, 2002), is used to organize the general structure and embrace a large number of possible pair-copula decompositions. Two special types of regular vines, the C-vine and the D-vine (Kurowicka and Cooke, 2004), are given in the form of a nested set of trees and are used to decompose the multivariate density function. Figure 5.8 shows one sample specification corresponding to a five-dimensional D-vine that can be explained with Table 5.3.

Figure 5.8 A D-vine with five variables, four trees, and 10 edges.

Table 5.3. Five-dimensional D-vine.

Tree T_jTj	Nodes	Edges
T₁T1	1, 2, 3, 4, 5	12, 23, 34, 45
T₂T2	12, 23, 34, 45	13\|2, 24\|3, 35\|4
T₃T3	13\|2, 24\|3, 35\|4	14\|23, 25\|34
T₄T4	14\|23, 25\|34	15\|234

In Figure 5.8 and Table 5.3, each edge represents a pair-copula density, and the edge label corresponds to the subscript of the pair-copula density. For example, 14|23 corresponds to the copula density c_14 ∣ 23(C_13 ∣ 2, C_24 ∣ 3)c14∣23C13∣2C24∣3. The entire decomposition is defined by dd−12=55−12=10 edges as well as the density functions of random variables.

The density function of random variable X = {X₁, X₂, …, X_d}X=X1X2…Xd with a D-vine copula can be written as

fx1…xd=∏k=1dfxk∏j=1d−1∏i=1d−jci,i+j∣j+1,…,i+j−1Fxixi+1…xi+j−1Fxi+jxi+1…xi+j−1

(5.27)

where index jj identifies the trees, and ii identifies the edges in each tree.

A sample of C-vine with five variables is given in Figure 5.9. The meanings of symbols are the same as in Figure 5.8. We can see that each tree T_jTj has a unique node connecting to d − jd−j edges in tree T_jTj. For example, node 1 of tree T₁T1 is connected to nodes 2, 3, 4, and 5 and forms the edges 12, 13, 14, and 15. Similarly, node 12 of T₂T2 is connected to nodes 13, 14, and 15 and forms the edges 23|1, 24|1 and 25|1.

Figure 5.9 A C-vine with five variables, four trees, and 10 edges.

In general, the d-dimensional density function corresponding to a C-vine is defined as

fx1…xd=∏k=1dfxk∏j=1d−1∏i=1d−jcj,i+j∣1,…,j−1Fxjx1…xj−1Fxi+jx1…xj−1

(5.28)

Looking at Figures 5.8 and 5.9, it is seen that the D-vine is more flexible than the C-vine. However, the C-vine might be advantageous if a particular variable is known to be the key variable governing interactions among the variables. In such a situation, one may decide to locate this variable at the root of the C-vine.

Following Aas et al. (2009), we present several typical pair-copulas.

Three Variables

For three-dimensional variables, there should be a total of six different pair-copula decompositions, including three D-vines and three C-vines. However, for three-dimensional variables, the D-Vine and C-vine are exactly the same, i.e., there are three different decompositions whose structures are both canonical vine and D-vine, as shown in Figure 5.10.

Figure 5.10 Decomposition schemes for three-dimensional variables using vines.

According to the decomposition schemes in Figure 5.10 and using Figure 5.10(a) as an example, the probability density function for both C-vine and D-vine structures can be written for three-dimensional random variables as

fx1x2x3=∏i=13fixic12F1x1F2x2c23F2x2F3x3C13∣2F1∣2x1x2F3∣2x3x2

(5.29)

where f₁, f₂, f₃f1,f2,f3 and F₁, F₂, F₃F1,F2,F3 represent the univariate PDF and CDF for variables x₁, x₂, x₃x1,x2,x3, respectively.

Four Variables

For four-dimensional variables, we can construct a total of 24 different pair-copula decompositions, including 12 D-vines and 12 C-vines, as shown in Figure 5.11 (examples for one D-vine and one C-vine construction). Following the scheme, one may easily construct the rest D-vine and C-vine structures for four-dimensional variables.

Figure 5.11 Vines for four-dimensional variables: (a) D-vine; (b) C-vine).

According to Figure 5.11(a), the four-dimensional D-vine structure can be expressed as

fx1x2x3x4=∏i=14fixi⋅c12F1x1F2x2⋅c23F2x2F3x3⋅c34F3x3F4x4⋅c13∣2F1∣2x1x2F3∣2x3x2⋅c24∣3F2∣3x2x3F4∣3x4x3⋅c14∣23F1∣23x1x2x3F4∣23x4x2x3

(5.30)

and according to Figure 5.11(b), the four-dimensional C-vine structure can be expressed as follows:

fx1x2x3x4=∏i=14fixi⋅c12F1x1F2x2⋅c13F1x1F3x3⋅c14F1x1F4x4⋅c23∣1F2∣1x2x1F3∣1x3x1⋅c24∣1F2∣1x2x1F4∣1x4x1⋅c34∣12F3∣12x3x1x2F4∣12x4x1x2

(5.31)

Five Variables

For five-dimensional variables, there are 240 different possible pair-copula decompositions, including 60 C-vines (Figure 5.8, for example), 60 D-vines (Figure 5.9 is an example), and 120 other regular vine decompositions (Aas et al., 2009; shown in Figure 5.12 with two examples)

Figure 5.12 Two regular-vine examples for five-dimensional variables.

According to Figure 5.8, the general expression for the five-dimensional D-vine structure can be given as follows:

fx1x2x3x4x5=f1x1f2x2f3x3f4f4f5x5c12F1x1F2x2⋅c23F2x2F3x3c34F3x3F4x4c45F4x4F5x5⋅c13∣2F1∣2x1x2F3∣2x3x2c24∣3F2∣3x2x3F4∣3x4x3⋅c35∣4F3∣4x3x4F5∣4x5x4c14∣23F1∣23x1x2x3F4∣23x4x2x3⋅c25∣34F2∣34x2x3x4F5∣34x5x3x4⋅c15∣234F1∣234x1x2x3x4F5∣234x5x2x3x4

(5.32)

According to Figure 5.9, the general expression for the five-dimensional C-vine structure can be given as

fx1x2x3x4x5=f1x1f2x2f3x3f4x4f5x5c12F1x1F2x2c13F1x1F3x3⋅c14F1x1F4x4c15F1x1F5x5c23∣1F2∣1x2x1F3∣1x3x1⋅c24∣1F2∣1x2x1F4∣1x4x1c25∣1F2∣1x2x1F5∣1x5x1⋅c34∣12F3∣12x3x1x2F4∣12x4x1x2c35∣12F3∣12x3x1x2F5∣12x5x1x2⋅c45∣123F4∣123x4x1x2x3F5∣123x5x1x2x3

(5.33)

According to Figure 5.12(a), the density function for a five-dimensional regular vine structure can be expressed as follows:

fx1x2x3x4x5=f1x1f2x2f3x3f4x4f5x5c12F1x1F2x2c25F2x2F5x5⋅c23F2x2F3x3c34F3x3F4x4,c15∣2F1∣2x1x2F5∣2x5x2⋅c13∣2F1∣2x1x2F3∣2x3x2c24∣3F2∣3x2x3F4∣3x4x3⋅c35∣12F3∣12x3x1x2F5∣12x5x1x2c14∣23F1∣23x1x2x3F4∣23x4x2x3⋅c45∣123F4∣123x4x1x2x3F5∣123x5x1x2x3

(5.34a)

According to Figure 5.12(b), the density function for the five-dimensional regular vine can be expressed as follows:

fx1x2x3x4x5=f1x1f2x2f3x3f4x4f5x5c12F1x1F2x2c23F2x2F3x3⋅c24F2x2F4x3c45F4x4F5x5,c13∣2F1∣2x1x2F3∣2x3x2⋅c14∣2F1∣2x3x2F4∣2x4x2c25∣4F2∣4x2x4F5∣4x5x4⋅c34∣12F3∣12x3x1x2F4∣12x5x1x2c15∣24F1∣24x1x2x4F5∣24x5x2x4⋅c35∣124F3∣124x3x1x2x4F5∣124x5x1x2x4

(5.34b)

d-Dimensional Variables

For a d-dimensional D-vine, Aas et al. (2009) concluded that there are d!d! possible ways of ordering the variables in tree T₁T1. But only d ! /2d!/2 are different trees on the first level. Given such a tree T₁T1, trees T₁, T₂, …, T_d − 1T1,T2,…,Td−1 are completely determined. This implies that the number of distinct D-vines on d nodes is given by d ! /2d!/2. For a d-dimensional C-vine, there are also d ! /2d!/2 distinctive vine structures.

5.3.3 Conditional Independence and the Pair-Copula Decomposition

First, let us consider the three-dimensional case in Equation (5.29). If X₁X1 and X₃X3 are independent, conditioned on random variable X₂X2, i.e., c_13 ∣ 2(F_1 ∣ 2(x₁| x₂), F_3 ∣ 2(x₃| x₂)) = 1c13∣2F1∣2x1x2F3∣2x3x2=1, the density function in Equation (5.29) can be simplified as

f(x₁, x₂, x₃) = f₁(x₁)f₂(x₂)f₃(x₃)c₁₂(F₁(x₁), F₂(x₂))c₂₃(F₂(x₂), F₃(x₃))

fx1x2x3=f1x1f2x2f3x3c12F1x1F2x2c23F2x2F3x3(5.35)

Equation (5.35) indicates that the number of levels reduces to one with the assumption of conditional independence imposed for the three-dimensional variable.

Similarly, if X and Y are independent conditioned on any vector v, we have the following:

c_xy ∣ v(F(x| v), F(y| v)) = 1

cxy∣vFxvFyv=1(5.36)

5.3.4 Simulation from Vine Copulas

As discussed previously in Section 5.2.5, the CPI Rosenblatt transformation is commonly applied for the simulation (or sampling) from vine copulas. The conditional probability of the jth variable conditioned on the previous j–1 variables, i.e., F(x_j| x₁, …, x_j − 1)Fxjx1…xj−1, can be written using Equations (5.37) and (5.38) for C-vine and D-vine copulas, respectively, as follows.

For the C-vine copula, the conditional probability is

Fxjx1…xj−1=∂Cj,j−1∣1,…,j−2Fxjx1…xj−2Fxj−1×1…xj−2∂Fxj−1×1…xj−2

(5.37)

For the D-vine copula structure, we use

Fxjx1…xj−1=∂Cj,1∣2,…,j−1(Fxjx2…xj−1,Fx1x2…xj−1)∂Fx1x2…xj−1

(5.38)

Here, we give the simulation procedure of the C-vine and D-vine copulas (Aas et al., 2009). In these algorithms, we first define that x = {x₁…, x_d}x=x1…xd are pseudo-observations (i.e., the maringal CDF: copula variables); we also define the parameters as T₁ : θ₁₁, …, θ_1(d − 1)T1:θ11,…,θ1d−1, T₂ : θ₂₁, …, θ_2(d − 2)T2:θ21,…,θ2d−2,…, T_d − 1 : θ_(d − 1)1Td−1:θd−11.

Simulation from a C-Vine Copula

The procedure for sampling from a C-vine copula can be described as algorithm 1 in Aas et al. (2009). This algorithm applies the margins (i.e., marginal CDF) as variable xx and variable 1 as the center variable. In other words, the algorithm simulates the pseudorandom variables rather than the random variables in a real domain. Algorithm 1 involves the following steps:

i. Generate d independent random numbers W = {w₁, …, w_d}W=w1…wd from uniform [0, 1] distribution. And we have x₁ = u₁ = w₁x1=u1=w1 and w_i = F(x_i| x₁, …, x_i − 1), i = 2, .., dwi=Fxix1…xi−1,i=2,…,d.
ii. Simulate x₂ = u₂x2=u2 from u₁u1 and w₂w2 as x₂ = u₂ = h⁻¹(w₂, u₁; θ₁₁)x2=u2=h−1w2u1θ11.
iii. Simulate x₃ = u₃x3=u3 from u₁, u₂u1,u2 and w₃w3, where w₃ = C(u₃| u₁, u₂)w3=Cu3u1u2 as follows:
- Simulating C(u₃| u₁)Cu3u1:
  
  w3=Cu3u1u2=∂C2,3∣1Cu3u1θ12Cu2u1θ11θ21∂Cu2u1=hC3∣1C2∣1θ21
  
  C_3 ∣ 1(u₃| u₁; θ₁₂) = h⁻¹(w₃, C_2 ∣ 1; θ₂₁) = h⁻¹(w₃, w₂; θ₂₁)
  C3∣1u3u1θ12=h−1w3C2∣1θ21=h−1w3w2θ21
- Simulating u₃u3 using C_3 ∣ 1C3∣1, which we just simulated, as follows:
  
  u₃ = h⁻¹(C_3 ∣ 1, u₁; θ₁₂)
  u3=h−1C3∣1u1θ12
iv. Simulate x₄ = u₄x4=u4 from u₁, u₂, u₃,u1,u2,u3, and w₄w4 with the following procedures:
- Simulating C(u₄| u₁, u₂)Cu4u1u2:
  
  w4=Cu4u1u2u3θ31=∂C34∣12Cu4u1u2θ22Cu3u1u2θ21θ31∂Cu3u1u2θ21=∂C34∣12Cu4u1u2θ22w3θ31∂w3C4∣12u4u1u2=h−1w4w3θ31
- Simulating u₄u4 using u₁u1 and C_2 ∣ 1 = w₂C2∣1=w2 as follows:
  
  C4∣12=∂C2,4∣1C4∣1C2∣1θ22∂C2∣1=∂C2,4∣1C4∣1w2θ22∂w2
  
  C_4 ∣ 1 = h⁻¹(h⁻¹(w₄, w₃; θ₃₁), w₂; θ₂₂) ⇒ u₄ = h⁻¹(C_4 ∣ 1, u₁; θ₁₃)
  C4∣1=h−1h−1w4w3θ31w2θ22⇒u4=h−1C4∣1u1θ13
  …

Carry on the logic for simulation until we reach the dimension d. And one may refer to Aas et al. (2009) for the exact algorithm.

Simulating the Random Variables for a D-Vine Copula

Algorithm 2 in Aas et al. (2009) provided the simulation procedure for the D-vine copula. As stated in Aas et al. (2009), algorithm 2 is less efficient than that for the C-vine copula. To simulate a d-dimensional D-vine copula, we will need to compute (d − 2)²d−22 conditional copulas, while we only need to compute(d − 2)(d − 1)/2d−2d−1/2 for a C-vine. Again, as with algorithm 1, algorithm 2 simulates the pseudorandom variables and includes the following steps:

i. Generate d-independent random numbers W = {w₁, …, w_d}W=w1…wd from uniform [0, 1] distribution. And we have x₁ = u₁ = w₁x1=u1=w1 and w_i = F(x_i| x₁, …, x_i − 1), i = 2, .., dwi=Fxix1…xi−1,i=2,…,d;
ii. Simulate x₂ = u₂x2=u2 from u₁u1 and w₂w2 as x₂ = u₂ = h⁻¹(w₂, u₁; θ₁₁)x2=u2=h−1w2u1θ11.
iii. Simulate x₃ = u₃x3=u3 from u₁, u₂u1,u2 and w₃w3 where w₃ = C(u₃| u₁, u₂)w3=Cu3u1u2 as follows:
- Compute the conditional copula C1∣2=∂C12u1u2θ11∂u2
- Simulate C(u₃| u₂)Cu3u2:
  
  w3=Cu3u1u2=∂C1,3∣2Cu3u2θ12Cu1u2θ11θ21∂Cu1u2=hC3∣2C1∣2θ21
  
  C_3 ∣ 2(u₃| u₂; θ₁₂) = h⁻¹(w₃, C_1 ∣ 2; θ₂₁) = h⁻¹(w₃, C_1 ∣ 2; θ₂₁)
  C3∣2u3u2θ12=h−1w3C1∣2θ21=h−1w3C1∣2θ21
- Simulate u₃u3 using C_3 ∣ 1C3∣2, which we just simulated, as follows:
  
  u₃ = h⁻¹(C_3 ∣ 2, u₂; θ₁₂)
  u3=h−1C3∣2u2θ12
iv. Simulate x₄ = u₄x4=u4 from u₁, u₂, u₃,u1,u2,u3, and w₄w4 with the following procedures:
- Compute the conditional copula C_1 ∣ 23C1∣23:
  
  C1∣23=∂C13∣2C1∣2C3∣2θ21∂C3∣2
- Simulate C(u₄| u₂, u₃)Cu4u2u3:
  
  w4=Cu4u1u2u3θ31=∂C14∣23Cu4u2u3θ22Cu1u2u3θ21θ31∂Cu1u2u3θ21=∂C14∣23Cu4u2u3θ22C1∣23θ31∂C1∣23C4∣23u4u2u3=h−1w4C1∣23θ31
- Compute C_2 ∣ 3C2∣3:
  
  C2∣3=∂C23u2u3θ12∂u3
- Simulate u₄u4 using u₃u3 and C_2 ∣ 3C2∣3 as follows:
  
  C4∣23=∂C2,4∣3C4∣3C2∣3θ22∂C2∣3⇒C4∣3=h−1C4∣23C2∣3θ22⇒u4=h−1C4∣3u3θ13
  …

Carry on the computation until we reach the d-dimension using Equation (5.38). Refer to Aas et al. (2009) for the exact algorithm.

Example 5.14 Simulate the random variables for the Clayton–Clayton C-vine copula with the following information: Θ = (θ₁₁, θ₁₂, θ₂₁) = (2.0, 5.0, 2.0)Θ=θ11θ12θ21=2.05.02.0 and the independent variables of (x₁, F(x₂| x₁), F(x₃| x₁, x₂)) = (w₁, w₂, w₃) = (0.1858, 0.1930, 0.3416)x1Fx2x1Fx3x1x2=w1w2w3=0.18580.19300.3416, where {x₁, x₂, x₃} ∈ uniform[0, 1]x1x2x3∈uniform01.

Solution: According to the sampling procedure discussed, we can simulate the random variables from the vine copula using Figure 5.8(b) in what follows.

As shown in Chapter 4, the bivariate Clayton copula is given as follows:

Cuvθ=u−θ+v−θ−1−1θ

a. Set x₁ = w₁ = 0.1858x1=w1=0.1858
b. From w2=Fx2x1=hx2x1θ11=∂Cx1x2θ11∂x1, we have the following:

w2=∂Cx1−θ11+x2−θ11−1−1θ11∂x1=x1−θ11−1×1−θ11+x2−θ11−1−1−1θ11

⇒x2=h−1w2x1θ11=1+x1−θ11w2−θ111+θ11−x1−θ11−1θ11

Substituting x₁ = 0.1858, w₂ = 0.1930, θ₁₁ = 2.0x1=0.1858,w2=0.1930,θ11=2.0 into the preceding equation, we have the following:

x₂ = 0.1304
x2=0.1304.
c. Set w₃ = F(x₃| x₁, x₂) = h{h(x₃, x₁; θ₁₂), h(x₂, x₁; θ₁₁); θ₂₁}w3=Fx3x1x2=hhx3x1θ12hx2x1θ11θ21, where

hx3x1θ12=t2=x1−θ12−1×1−θ12+x3−θ12−1−1−1θ12;

hx2x1θ11=t1=x1−θ11−1×1−θ11+x2−θ11−1−1−1θ11;

hhx3x1θ12hx2x1θ11θ21=t1−θ21−1t1−θ21+t2−θ21−1−1−1θ21

Substitute x₁ = 0.1858, x₂ = 0.1304, w₃ = 0.3416, θ₁₁ = 2.0, θ₁₂ = 5.0, θ₂₁ = 2.0x1=0.1858,x2=0.1304,w3=0.3416,θ11=2.0,θ12=5.0,θ21=2.0 to solve the nonlinear equation

x₃ = h⁻¹{h⁻¹(0.3416, h(0.1304, 0.1858; 2.0); 2.0), 0.1858; 5.0}x3=h−1h−10.3416h0.13040.18582.02.00.18585.0, and we have the following:

x₃ = 0.1484
x3=0.1484.

Finally, we get the following:

(x₁, x₂, x₃) = (0.1858, 0.1304, 0.1484)
x1x2x3=0.18580.13040.1484.

5.3.5 Parameter Estimation for a Specified Pair-Copula Decomposition

Parameter estimation for specified pair-copula decomposition can be obtained using the log-likelihood method for the C-vine copula using the density function given by Equation (5.28) or D-vine copula with the density function given by Equation (5.27).

Parameter Estimation for a C-Vine Copula

From Equation (5.28), the log-likelihood expression of the C-vine copula is given as

LogLxvΘ=∑j=1d−1∑i=1d−j∑t=1Tlncj,j+i∣1,…,j−1Fxj,tx1,t…xj−1,tFxj+i,tx1,t…xj−1,t

(5.39)

The log-likelihood in Equation (5.39) must be numerically maximized over all parameters using the algorithm 3 (Aas et al., 2009). As discussed earlier, for the d-dimensional Vine copula, we have T = {T_i : i = 1, …d − 1}T=Ti:i=1…d−1 levels. Within each level T_iTi, we have Edge_{T_i} = {E_j : j = 1, …, d − i}.EdgeTi=Ej:j=1…d−i. In other words, we have d − id−i bivariate unconditional/conditional copulas for each level T_iTi. There are two loops in algorithm 3. The outer loop identifies the tree level, while the inner loop identifies the edges (i.e., the bivariate copulas) of each level. Using variable 1 as the center variable, the algorithm can be explained as follows:

Setting

x₀ = [x₁, …, x_d] = [u₁, …, u_d], θ = [θ₁₁, θ₁₂, …θ_(d − 1)1]x0=x1…xd=u1…ud,θ=θ11θ12…θd−11 and LL=0

Outer Loop: i = 1 to d − 1 (for level T)

    Inner Loop: j = 1 to d − i (edges for each level)

        c = copulapdf(x_{i − 1, 1}, x_{i − 1, j + 1}, θ_ij);

        LL = LL +  ∑  ln (c);

        x_ij = h(x_{i − 1, j + 1}, x_{i − 1, 1}; θ_ij)

    End Inner Loop

End Outer Loop

Parameter Estimation for a D-Vine Copula

For the D-vine copula, the log-likelihood function is given by

LogLxvΘ=∑j=1d−1∑i=1d−1∑t=1Tlnci,i+j∣1,…,j−1Fxi,txi+1,t…xi+j−1,tFxi+j,txi+1,t…xi+j−1,t

(5.40)

Let Θ_{j, i}Θj,i be the set of parameters of the copula density C_{i, i + j ∣ i + 1, …, i + j − 1}(⋅, ⋅)Ci,i+j∣i+1,…,i+j−1⋅⋅. Algorithm 4 (Aas et al., 2009) evaluates the likelihood, which can be explained as follows:

Setting

s₀ = [s₀₁, s₀₂, …, s_0d] = [x₁, …, x_d] = [u₁, …, u_d], θ = [θ₁₁, θ₁₂, …θ_(d − 1)1]

s0=s01s02…s0d=x1…xd=u1…ud,θ=θ11θ12…θd−11andLL=0

Compute the log-likelihood (LL) for T₁ and start the computation of conditional copulas:

for i = 1 to d − 1

fori=1tod−1

c = c(x_i, x_i + 1; θ_1i), LL = LL + ∑ (lnc)

c=cxixi+1θ1i,LL=LL+∑lnc

end

end

s₁₁ = h(s₀₁, s₀₂; θ₁₁)

s11=hs01s02θ11

Prepare the conditional probability for a higher level:

for i = 1 to d − 3

fori=1tod−3

s_1(2i) = h(s_0(i + 2), s_0(i + 1); θ_1(i + 1)), s_1(2i + 1) = h(s_0(i + 1), s_0(k + 2); θ_1(i + 1))

s12i=hs0i+2s0i+1θ1i+1,s12i+1=hs0i+1s0k+2θ1i+1

end

end

s_{1(2d − 4)} = h(s_0d, s_0(d − 1); θ_1(d − 1))

s12d−4=hs0ds0d−1θ1d−1

Update the log-likelihood as well as the conditional probability for a higher level:

for i = 2 to d − 1

fori=2tod−1

for j = 1 to d − i

forj=1tod−i

c = copulapdf(s_{(i − 1)(2j − 1)}, s_{(i − 1)(2j)}; θ_ij)

c=copulapdfsi−12j−1si−12jθij

LL = LL + ∑ (lnc)

LL=LL+∑lnc

end

end

stop the loop if i = d − 1i=d−1; otherwise, we will continue the loop

s_i1 = h(s_(i − 1)1, s_(i − 1)2; θ_i1)

si1=hsi−11si−12θi1

again stop the loop if d ≤ 4d≤4; otherwise we will continue on

for j = 1 to d − i − 2

forj=1tod−i−2

s_{i, 2j} = h(s_{(i − 1)(2j + 2)}, s_{(i − 1)(2j + 1)}; θ_i(j + 1)),

si,2j=hsi−12j+2si−12j+1θij+1,

s_i(2j + 1) = h(s_{(i − 1)(2j + 1)}, s_{(i − 1)(2j + 2)}; θ_i(j + 1))

si2j+1=hsi−12j+1si−12j+2θij+1

end

end

s_{i(2d − 2i − 2)} = h(s_{(i − 1)(2d − 2i)}, s_{(i − 1)(2d − 2i − 1)}; θ_i(n − i))

si2d−2i−2=hsi−12d−2isi−12d−2i−1θin−i

end

end

To apply algorithms 3 and 4 to optimize the parameters, the initial values of the parameters are needed, which may be determined as follows (Aas et al., 2009):

a. Estimate parameters of the copulas in T₁ from the original data.
b. Compute observations (i.e., conditional distribution functions) for T₂ using the copula parameters from T₁ and the corresponding h-function.
c. Estimate parameters of the copulas in T₂ using the results computed from step b.
d. Compute observations for T₃ using the copula parameters at T₂ and the corresponding h-function.
e. Estimate the parameters of copulas in T₃ using the results computed from step d.

…
f. Repeat the previous steps sequentially until we teach the top level of the vine tree, i.e., T_d–1.

Parameter Estimation for Basic Three-Variable Model

For a three-dimensional special case (i.e., Figure 5.10(a)), the log-likelihood in Equation (5.39) and Equation (5.40) can be simply written as

LogLxvΘ=∑i=1nlnc12(x1,ix2,iΘ11+lnc23x2,ix3,iΘ12+lnc13∣2v1,iv2,iΘ21)

(5.41)

where v_{1, i} = F(x_{1, i}| x_{2, i}) = h(x_{1, i}, x_{2, i}; Θ₁₁)v1,i=Fx1,ix2,i=hx1,ix2,iΘ11 and v_{2, i} = F(x_{3, i}| x_{2, i}) = h(x_{3, i}, x_{2, i}; Θ₁₂)v2,i=Fx3,ix2,i=hx3,ix2,iΘ12; Θ_jiΘji are the set of parameters of the corresponding copula density c_{j, j + i ∣ 1, …, j − 1}(⋅| ⋅)cj,j+i∣1,…,j−1⋅⋅. Here we give some common h-functions.

For the Gumbel–Hougaard copula, the h-function can be given as

hu1u2θ=∂Cu1u2θ∂u1=Cu1u2θu2−lnu2θ−1−lnu1θ+−lnu2θ1θ−1

(5.42)

where Cu1u2θ=e−−lnu1θ+−lnu2θ1θ. For the Clayton copula, the h-function can be expressed as

hu1u2θ=∂Cu1u2θ∂u2=u2−θ−1u1−θ+u2−θ−1−1−1θ

(5.43)

For the Frank copula, the h-function can be written as

hu1u2θ=∂Cu1u2θ∂u2=e−θu21−e−θe−θu1+e−θu2−1

(5.44)

For the Ali–Mikhail–Haq copula, the h-function can be cast as

hu1u2θ=∂Cu1u2θ∂u2=u2+θu2−1+u2−1+θ−1+u1−1+u22

(5.45)

For the Gaussian copula, the h-function can be written as

hu1u2ρ12=∂Cu1u2ρ12∂u2=ΦΦ−1u1−ρ12Φ−1u21−ρ122

(5.46)

In Equation (5.46), ρ₁₂ρ12 is the parameter of copula, i.e., the correlation coefficient for the bivariate random variables after meta-Gaussian transformation, and Φ⁻¹()Φ−1 is the inverse of the standard univariate Gaussian distribution function.

For the Student t copula, the h-function can be given as

hu1u2ρ12ν12=∂Cu1u2ρ12ν12∂u2=Tν12+1Tν12−1u1−ρ12Tν12−1u2ν12+Tν12−1u221−ρ122ν12+1

(5.47)

In Equation (5.47), ρ₁₂ρ12 and ν₁₂ν12 are the parameters of Student t copula, i.e., the correlation coefficient and degree of freedom for the transformed variables using Student distribution with degree of freedom (d.f.) of ν₁₂ν12; and Tν12−1⋅ is the inverse of Student T distribution with d.f. of ν₁₂ν12, expectation 0, and variance ν12ν12−2.

Example 5.15 Assuming that the trivariate random variable given in Table 5.4 may be modeled by the Clayton–Clayton–Frank vine copula with the vine scheme shown in Figure 5.10(a), (1) estimate the parameters using the sequential MLE; and (2) simulate 50 samples from the fitted vine-copula function.

Table 5.4. Data and results for Example 5.14.

u₁u1	u₂u2	u₃u3	h(u₁, u₂; θ₁₁)hu1u2θ11	h(u₃, u₂; θ₁₂)hu3u2θ12
0.241	0.138	0.103	0.892	0.061
0.241	0.172	0.172	0.762	0.460
0.241	0.241	0.276	0.424	0.729
0.241	0.586	0.655	0.010	0.696
0.793	0.828	0.897	0.503	0.741
0.483	0.345	0.379	0.771	0.660
0.931	0.914	0.621	0.767	0.026
0.724	0.759	0.724	0.452	0.379
0.414	0.621	0.586	0.102	0.344
0.759	0.414	0.310	0.936	0.061
0.862	0.793	0.793	0.705	0.500
0.655	0.517	0.448	0.716	0.195
0.414	0.379	0.552	0.526	0.954
0.569	0.448	0.414	0.699	0.297
0.569	0.690	0.690	0.254	0.472
0.414	0.310	0.241	0.727	0.083
0.241	0.552	0.862	0.013	0.981
0.069	0.035	0.035	0.935	0.460
0.241	0.276	0.345	0.287	0.852
0.069	0.069	0.069	0.424	0.460
0.897	0.914	0.931	0.661	0.694
0.655	0.655	0.483	0.473	0.053
0.069	0.103	0.138	0.100	0.908
0.241	0.207	0.207	0.593	0.460
0.655	0.724	0.759	0.364	0.587
0.517	0.483	0.517	0.517	0.609
0.828	0.862	0.828	0.539	0.431
0.966	0.966	0.966	0.854	0.776

Solution:

1. Estimate the parameters. For the bivariate Clayton copula C(u, v; θ)Cuvθ, its copula density function can be given as follows:

cuvθ=1+θuθ+1vθ+1u−θ+v−θ−12+1θ(5.48)
For the bivariate Frank copula, its copula density function can be given as follows:

cuvθ=θe−θu+ve−θu−1e−θv−1e−θ−12s12−θe−θu+ve−θ−1s1;s1=e−θu−1e−θv−1e−θ−1+1(5.49)
1. a. Estimate the parameters for T_1.Using the maximum likelihood estimation for the Clayton copula, the copula parameters estimated for T₁ can be estimated as follows:θ₁₁ = 4.1728; θ₁₂ = 8.3834θ11=4.1728;θ12=8.3834 for (u₁, u₂)u1u2 and (u₂, u₃)u2u3, respectively.
2. b. Compute the conditional distribution functions for T₂ using the copula parameters estimated from T₁. Using the h-function for the Clayton copula (Equation (5.43)) and parameters estimated for T₁, we have the following:
  
  hu1u2θ11=u2−5.1728u1−4.1728+u2−4.1728−1−1−14.1728
  
  hu3u2θ12=u2−9.3834u2−8.3834+u3−8.3834−1−1−18.3834
  Table 5.4 lists the original datasets with the fourth and fifth columns as the computed conditional probabilities.
3. c. Estimate the parameter for T₂ using the computed conditional probabilities from step b.

Similar to step a, using the maximum likelihood estimation for the Frank copula, the parameter estimated for T₂ is estimated as θ₂₁ = − 3.8431θ21=−3.8431.

2. Simulate 50 samples from the fitted vine-copula function:

Based on the algorithm 2 for sampling from the D-vine copula, we can simulate the samples from the fitted vine-copula as follows:
1. a. Generate independently uniform random variables {w₁, w₂, w₃}.w1w2w3.
2. b. Set u₁ = w₁.u1=w1.
3. c. Use w₂ = C(u₂| u₁) = h₁₂(u₂, u₁; 4.1728)w2=Cu2u1=h12u2u14.1728 to compute u2=h12−1w2u14.1728 using the h-function of the Clayton copula (Equation (5.43)).
4. d. Compute u₃u3 with the following procedure:
  
  Cu3u1u2=∂C13∣2C1∣2u1u2C3∣2u3u2θ21∂C1∣2u1u2=h13∣2h23u3u28.3834h12u1u24.1728−3.8431
  
  u3=h23−1h13∣2−1w3h12u1u24.1728− 3.8431u28.3834
  where h₁₂, h₂₃h12,h23 are h-functions for the Clayton copula at T₁; h_13 ∣ 2h13∣2 is the h-function for the Frank copula (Equation (5.44)) at T₂.

Using the simulated samples and pseudo-observations, Figure 5.13 evaluated the performance of the fitted vine copula. it is seen that the pair-wise dependence is well preserved

Figure 5.13 Comparison of observed variables with those simulated from vine copula.

Example 5.16 Using the four-dimensional pseudo-observations in Example 5.12 to (1) estimate the copula parameters using sequential MLE if D-vine copula (Figure 5.11(a)) with the specified copula (i.e., the Gumbel– Hougaard copula for T1 and the Frank copula for T2 and T3) and C-vine copula (Figure 5.11(b)) with specified copula (i.e., the Gumbel– Hougaard copula for T1, T2, and T3); and (2) simulate the random variates for the sample size of 100 from the fitted copulas.

Solution:

I. D-Vine Copula
1. 1. Estimate the copula parameters:
  
  The density function of the biviariate Gumbel–Hougaard and Frank copulas are given in Chapter 4 as follows:
  
  Gumbel–Hougaard copula:
  
  cuvθ=lnulnvθ−1e−S11θS12θ−2−1−θS11θ−2uv;S1=−lnuθ+−lnvθ(5.50)
  
  Frank copula: The same as the previous example, its copula density is given as Equation (5.49).
  1. a. Estimate the parameters for the D-vine copula.
    
    Estimation of copula parameters (the Gumbel–Hougaard copula) for T₁:
    
    For T₁, applying the MLE, we have: θ₁₁ = 3.8545, L₁₁ = 59.783θ11=3.8545,L11=59.783 for (u₁, u₂)u1u2; θ₁₂ = 3.0942, L₁₂ = 49.653θ12=3.0942,L12=49.653 for (u₂, u₃)u2u3; θ₁₃ = 4.3949, L₁₃ = 71.727θ13=4.3949,L13=71.727 for (u₃, u₄)u3u4.
    
    Estimation of copula parameters (Frank copula) for T₂:
    1. i. Compute the conditional distribution C_1 ∣ 2(u₁| U₂ = u₂; θ₁₁ = 3.8545)C1∣2u1U2=u2θ11=3.8545, C_3 ∣ 2(u₃| U₂ = u₂; θ₁₂ = 3.0942); C_2 ∣ 3(u₂| U₃ = u₃; θ₁₂ = 3.0942);C3∣2u3U2=u2θ12=3.0942;C2∣3u2U3=u3θ12=3.0942; andC_4 ∣ 3(u₄| U₃ = u₃; θ₁₃ = 4.3949)C4∣3u4U3=u3θ13=4.3949.
    2. ii. Apply the MLE to estimate the parameters for T₂ as follows: θ₂₁ = 1.9708, L₂₁ = 3.032θ21=1.9708,L21=3.032 for (C_1 ∣ 2, C_3 ∣ 2)C1∣2C3∣2; θ₂₂ = 0.7916, L₂₂ = 0.565θ22=0.7916,L22=0.565 for (C_2 ∣ 3, C_4 ∣ 3)C2∣3C4∣3.
      
      Estimation of copula parameters (the Frank copula) for T₃:
      
      According to Figure 5.11(a), the copula function for T₃ is given as follows: C_14 ∣ 23(F(u₁| u₂, u₃), F(u₄| u₂, u₃))C14∣23Fu1u2u3Fu4u2u3
      
      From Equation (5.24), we have the following:
      
      Fu1u2u3=∂C13∣2Fu1u2Fu3u2∂Fu3u2;Fu4u2u3=∂C24∣3Fu2u3Fu4u3∂Fu2u3

Using the parameters estimated for T₁ and T₂, we can easily calculate the conditional probability distribution needed for parameter estimation in T₃. Maximizing the log-likelihood for the specified Frank copula, we have θ₃₁ = − 0.4281, L₃₁ = 0.173θ31=−0.4281,L31=0.173.

Finally, we have the following:

T₁: θ₁₁ = 3.8545; θ₁₂ = 3.0942; θ₁₃ = 4.3949θ11=3.8545;θ12=3.0942;θ13=4.3949
T₂: θ₂₁ = 1.9708; θ₂₂ = 0.7916θ21=1.9708;θ22=0.7916
T₃: θ₃₁ = − 0.4281θ31=−0.4281

The overall log-likelihood is computed as the sum of all LL s: L = 184.933L=184.933. Table 5.5 lists the conditional probability distributions computed for T₂ and T₃ using the fitted copula of the previous level.

Table 5.5. Conditional probability distributions computed for T₂ and T₃ for fitted D-Vine copula

T₂				T₃
C_{u₂ ∣ u₁}Cu2∣u1	C_{u₂ ∣ u₃}Cu2∣u3	C_{u₃ ∣ u₂}Cu3∣u2	C_{u₄ ∣ u₃}Cu4∣u3	C_{u₁ ∣ u_2,u₃}Cu1∣u2,u3	C_{u₄ ∣ u₂, u₃}Cu4∣u2,u3
0.143	0.327	0.654	0.830	0.099	0.851
0.134	0.971	0.089	0.387	0.237	0.302
0.470	0.613	0.499	0.703	0.469	0.687
0.524	0.258	0.638	0.456	0.458	0.504
0.722	0.200	0.910	0.799	0.554	0.837
0.307	0.445	0.625	0.291	0.246	0.298
0.220	0.665	0.149	0.346	0.340	0.316
0.500	0.106	0.949	0.118	0.292	0.152
0.102	0.487	0.409	0.119	0.107	0.118
0.736	0.122	0.749	0.929	0.648	0.948
0.742	0.486	0.487	0.701	0.761	0.705
0.654	0.588	0.414	0.821	0.702	0.814
0.773	0.651	0.058	0.575	0.899	0.546
0.143	0.535	0.307	0.780	0.179	0.777
0.760	0.196	0.888	0.220	0.615	0.261
0.601	0.529	0.713	0.712	0.505	0.710
0.177	0.040	0.991	0.725	0.068	0.794
0.593	0.209	0.674	0.084	0.516	0.101
0.401	0.697	0.497	0.965	0.395	0.960
0.507	0.843	0.092	0.756	0.698	0.706
0.690	0.376	0.749	0.125	0.592	0.134
0.254	0.802	0.313	0.945	0.313	0.933
0.143	0.386	0.193	0.943	0.214	0.949
0.467	0.191	0.796	0.296	0.326	0.347
0.265	0.376	0.821	0.482	0.151	0.506
0.908	0.361	0.693	0.806	0.884	0.825
0.809	0.261	0.575	0.096	0.801	0.112
0.891	0.052	0.969	0.761	0.786	0.821
0.345	0.880	0.081	0.305	0.534	0.243
0.015	0.934	0.055	0.817	0.031	0.763
0.145	0.754	0.020	0.833	0.281	0.806
0.762	0.985	0.071	0.683	0.890	0.597
0.989	0.140	0.771	0.440	0.984	0.510
0.164	0.801	0.319	0.443	0.202	0.384
0.026	0.887	0.144	0.902	0.045	0.873
0.032	0.991	0.048	0.792	0.065	0.724
0.101	0.693	0.409	0.985	0.106	0.983
0.713	0.159	0.722	0.948	0.632	0.961
0.163	0.137	0.659	0.042	0.113	0.054
0.309	0.231	0.609	0.286	0.254	0.329
0.177	0.784	0.293	0.579	0.226	0.524
0.731	0.301	0.812	0.113	0.613	0.128
0.099	0.071	0.968	0.508	0.037	0.592
0.072	0.211	0.399	0.209	0.077	0.246
0.320	0.413	0.737	0.126	0.218	0.132
0.190	0.015	0.991	0.874	0.075	0.912
0.047	0.873	0.224	0.245	0.069	0.193
0.363	0.899	0.250	0.086	0.472	0.063
0.566	0.375	0.664	0.843	0.492	0.857
0.383	0.614	0.602	0.692	0.328	0.675
0.041	0.525	0.625	0.095	0.028	0.091
0.863	0.437	0.750	0.267	0.813	0.274
0.096	0.225	0.862	0.805	0.044	0.840
0.461	0.424	0.290	0.108	0.560	0.112
0.805	0.595	0.297	0.149	0.871	0.137
0.555	0.470	0.505	0.058	0.557	0.058
0.414	0.312	0.873	0.486	0.248	0.523
0.191	0.929	0.210	0.510	0.274	0.425
0.449	0.783	0.124	0.602	0.628	0.549
0.785	0.327	0.852	0.403	0.666	0.435

Note: C_{u₁ ∣ u₂, u₃} = ∂C_13 ∣ 2(C_{u₁ ∣ u₂}, C_{u₃ ∣ u₂})/∂C_{u₃ ∣ u₂}Cu1∣u2,u3=∂C13∣2Cu1∣u2Cu3∣u2/∂Cu3∣u2; C_{u₄ ∣ u₂, u₃} = ∂C_24 ∣ 3(C_{u₄ ∣ u₃}, C_{u₂ ∣ u₃})/∂C_{u₂ ∣ u₃}Cu4∣u2,u3=∂C24∣3Cu4∣u3Cu2∣u3/∂Cu2∣u3.

II: C-Vine Copula
1. a. Estimation of copula parameters (the Gumbel–Hougaard copula) for T₁:
  
  According to Figure 5.11(b), we have the parameters estimated for T₁ as follows: θ₁₁ = 3.8545, L₁₁ = 59.783θ11=3.8545,L11=59.783 for (u₁, u₂)u1u2; θ₁₂ = 3.0834, L₁₂ = 47.245θ12=3.0834,L12=47.245 for (u₁, u₃)u1u3; θ₁₃ = 2.5704, L₁₃ = 38.08θ13=2.5704,L13=38.08 for (u₁, u₄)u1u4.
2. b. Estimation of copula parameters (the Gumbel–Hougaard copula) for T₂:
  
  From Figure 5.11(b), we need to compute the conditional distribution using the parameter estimated from T₁ first, and then we will be able to estimate the copula parameters for T₂ as follows:
  1. i. Compute the conditional distribution C_2 ∣ 1(u₂ ∣ U₁ = u₁; θ₁₁ = 3.8545C2∣1(u2∣U1=u1;θ11=3.8545), C_3 ∣ 1(u₃| U₁ = u₁; θ₁₂ = 3.0834)C3∣1u3U1=u1θ12=3.0834 and C_4 ∣ 1(u₄| U₁ = u₁; θ₁₃ = 2.5704)C4∣1u4U1=u1θ13=2.5704.
  2. ii. Apply the MLE to estimate the parameters for T2 as follows:
    - θ₂₁ = 1.2618, L₂₁ = 4.265θ21=1.2618,L21=4.265 for (C_2 ∣ 1, C_3 ∣ 1)C2∣1C3∣1;θ₂₂ = 1.267, L₂₂ = 4.356θ22=1.267,L22=4.356 for (C_2 ∣ 1, C_4 ∣ 1)C2∣1C4∣1.
3. c. Estimation of copula parameters (the Gumbel–Hougaard copula) for T₃:
  
  According to Figure 5.11(b), the copula function for T₃ is given as C_34 ∣ 12(F(u₃| u₁, u₂), F(u₄| u₁, u₂))C34∣12Fu3u1u2Fu4u1u2.
  
  From Equation (5.24), we have the following:
  
  Fu3u1u2=∂C23∣1Fu3u1Fu2u1∂Fu2u1;Fu4u1u2=∂C24∣1Fu4u1Fu2u1∂Fu2u1
  
  Using the parameters estimated for T₁ and T₂, we will first compute the conditional probability needed for parameter estimation in T₃. Maximizing the log-likelihood for the specified Frank copula, we have θ₃₁ = 1.959, L₃₁ = 27.687θ31=1.959,L31=27.687.
  
  Finally, we have the following:
  - T1: θ₁₁ = 3.8545; θ₁₂ = 3.0834; θ₁₃ = 2.5704θ11=3.8545;θ12=3.0834;θ13=2.5704
  - T2: θ₂₁ = 1.2618; θ₂₂ = 1.2672θ21=1.2618;θ22=1.2672
  - T3: θ₃₁ = 1.959θ31=1.959
  - The overall log-likelihood is computed as L = 181.416L=181.416. Table 5.6 lists the conditional probability distributions computed for T₂ and T₃.

Table 5.6. Conditional probability distributions computed for T₂ and T₃ of a fitted C-Vine copula.

T₂			T₃
C_{u₂ ∣ u₁}Cu2∣u1	C_{u₃ ∣ u₁}Cu3∣u1	C_{u₄ ∣ u₁}Cu4∣u1	C_{u₃ ∣ u₁, u₂}Cu3∣u1,u2	C_{u₄ ∣ u₁, u₂}Cu4∣u1,u2
0.804	0.853	0.910	0.805	0.887
0.939	0.323	0.303	0.157	0.143
0.620	0.556	0.663	0.539	0.658
0.368	0.585	0.536	0.654	0.603
0.406	0.855	0.905	0.905	0.946
0.722	0.762	0.591	0.731	0.530
0.637	0.258	0.225	0.224	0.191
0.574	0.954	0.762	0.972	0.784
0.819	0.712	0.370	0.609	0.260
0.145	0.535	0.841	0.667	0.922
0.263	0.307	0.464	0.378	0.557
0.372	0.316	0.572	0.357	0.640
0.101	0.014	0.056	0.025	0.094
0.744	0.548	0.717	0.472	0.664
0.322	0.790	0.573	0.862	0.656
0.594	0.710	0.781	0.721	0.800
0.894	0.998	0.997	0.998	0.996
0.287	0.569	0.213	0.660	0.260
0.743	0.634	0.913	0.568	0.911
0.477	0.089	0.259	0.086	0.264
0.412	0.656	0.346	0.716	0.379
0.830	0.550	0.859	0.420	0.797
0.530	0.319	0.789	0.314	0.825
0.477	0.797	0.631	0.843	0.672
0.863	0.938	0.898	0.904	0.834
0.127	0.321	0.551	0.441	0.691
0.122	0.286	0.098	0.400	0.152
0.129	0.869	0.900	0.940	0.959
0.655	0.125	0.109	0.099	0.085
0.979	0.520	0.727	0.213	0.352
0.594	0.050	0.330	0.040	0.307
0.413	0.046	0.115	0.048	0.121
0.008	0.108	0.137	0.232	0.283
0.898	0.661	0.596	0.471	0.402
0.977	0.709	0.882	0.355	0.558
0.992	0.455	0.646	0.138	0.224
0.925	0.801	0.973	0.599	0.936
0.180	0.529	0.863	0.650	0.933
0.567	0.775	0.293	0.801	0.276
0.503	0.671	0.495	0.707	0.516
0.867	0.594	0.617	0.435	0.456
0.365	0.702	0.367	0.773	0.417
0.906	0.994	0.983	0.991	0.970
0.622	0.625	0.406	0.615	0.376
0.774	0.859	0.563	0.830	0.470
0.736	0.996	0.996	0.998	0.998
0.975	0.778	0.576	0.429	0.256
0.793	0.397	0.173	0.297	0.112
0.459	0.627	0.801	0.672	0.851
0.766	0.745	0.794	0.686	0.748
0.973	0.958	0.743	0.779	0.393
0.251	0.488	0.360	0.586	0.444
0.918	0.973	0.976	0.942	0.950
0.335	0.252	0.090	0.293	0.104
0.178	0.125	0.062	0.175	0.089
0.434	0.461	0.130	0.500	0.134
0.762	0.928	0.888	0.925	0.873
0.918	0.518	0.527	0.313	0.317
0.503	0.133	0.235	0.128	0.233
0.370	0.731	0.645	0.800	0.717

(2) Simulate random variates from fitted copulas:
- Simulation from a fitted D-vine copula

According to algorithm 2, we can simulate the random variates from the fitted D-vine copula as follows:

Step 1: Generate independent uniformly distributed random variables: {w₁, w₂, w₃, w₄}.w1w2w3w4.
Step 2: Simulate u₁u1 by setting u₁ = v₁₁ = w₁.u1=v11=w1.
Step 3: Simulate u₂u2 by setting u₂ = v₂₁ = h⁻¹(w₂, u₁; 3.8545),u2=v21=h−1w2u13.8545, where hh is the conditional probability distribution for the Gumbel–Hougaard copula.
Step 4: Simulate u₃u3:
- Calculate v₂₂ = h(v₁₁, v₂₁; 3.8545) = h(u₁, u₂; 3.8545).v22=hv11v213.8545=hu1u23.8545.
- Simulate u₃u3 in the same way as in Example 5.14:
  
  u₃ = v₃₁ = h⁻¹(h⁻¹(w₃, v₂₂; θ₂₁), v₂₁; θ₁₂)
  u3=v31=h−1h−1w3v22θ21v21θ12
  
  = h⁻¹{h⁻¹[w₃, h(u₁, u₂; 3.8545); 1.9708], u₂; 3.0942}
  =h−1h−1w3hu1u23.85451.9708u23.0942
- Simulate u₄u4 using the following procedure:
  - ✓ Calculate v₃₂v32, v₃₃v33, and v₃₄v34 using
    
    v₃₂ = h(v₂₁, v₃₁; θ₁₂) = h(u₂, u₃; 3.0942)
    v32=hv21v31θ12=hu2u33.0942
    
    v₃₃ = h(v₃₁, v₂₁; θ₁₂) = h(u₃, u₂; 3.0942)
    v33=hv31v21θ12=hu3u23.0942
    
    v₃₄ = h(v₂₂, v₃₃; θ₂₁) = h{h(u₁, u₂; 3.8545), h(u₃, u₂; 3.0942); 1.9708}
    v34=hv22v33θ21=hhu1u23.8545hu3u23.09421.9708
  - ✓ Finally simulate u₄u4 using:
    
    temp1 = h⁻¹(w₄, v₃₄; θ₃₁) = h⁻¹(w₄, v₃₄; −0.4281)
    temp1=h−1w4v34θ31=h−1w4v34−0.4281
    
    temp2 = h⁻¹(temp1, v₃₂; θ₂₂) = h⁻¹(temp1, v₃₂; 0.7916)
    temp2=h−1temp1v32θ22=h−1temp1v320.7916
    
    u₄ = v₄₁ = h⁻¹(temp2, u₃; θ₁₃) = h⁻¹(temp2, u₃; 4.3949)
    u4=v41=h−1temp2u3θ13=h−1temp2u34.3949
  To this end, we simulate random variates from the fitted D-vine copula. As discussed earlier, for every hh function (i.e., the conditional copula function of the corresponding bivariate copula functions: the Gumbel–Hougaard copula for T1 and T2, and the Frank copula for T3), the second variable is the conditioning variable. Figure 5.14(a) compares the pseudo-observations with those simulated from the D-vine copula.
- Simulation from a fitted C-vine copula
  - To simulate random variates from the fitted C-vine copula, algorithm 1 is applied. By generating independent uniformly distributed random variables {w₁, w₂, w₃, w₄}w1w2w3w4, we can simulate u₁ = v₁₁u1=v11 and u₂ = v₂₁u2=v21 using the exact same procedure as that for simulation from the fitted D-vine copula. In what follows, we will discuss how to generate u₃u3 and u₄u4 using algorithm 1 in detail:

i. Simulateu₃u3:
- ✓ Calculate v₂₂v22, i.e., C_2 ∣ 1C2∣1:
  
  v₂₂ = h(v₂₁, v₁₁; θ₁₁) = h(u₂, u₁; 3.8545)
  v22=hv21v11θ11=hu2u13.8545
- ✓ Simulate u₃u3 by computing temp = C_3 ∣ 1temp=C3∣1 first:
  
  From w3=C3∣1,2=∂C23∣1C3∣1C2∣1∂C2∣1=hC3∣1C2∣1θ21=hC3∣1v22θ21, we have the following:C_3 ∣ 1 = temp = h⁻¹(w₃, v₂₂; θ₂₁) = h⁻¹(w₃, v₂₂; 1.2618)C3∣1=temp=h−1w3v22θ21=h−1w3v221.2618, and
  
  u₃ = v₃₁ = h⁻¹(temp, v₁₁; θ₁₂) = h⁻¹(temp, u₁; 3.0834)
  u3=v31=h−1tempv11θ12=h−1tempu13.0834
ii. Similarly, we can simulate u₄u4 as follows:
- ✓ Calculate v₃₂v32 and v₃₃v33:
  
  v₃₂ = h(v₃₁, v₁₁; θ₁₂) = h(u₃, u₁; 3.0834)
  v32=hv31v11θ12=hu3u13.0834
  
  v₃₃ = h(v₃₂, v₂₂; θ₂₁) = h(v₃₂, v₂₂; 1.2618)
  v33=hv32v22θ21=hv32v221.2618
- ✓ Simulate u₄u4:
  
  temp1 = h⁻¹(w₄, v₃₃; θ₃₁) = h⁻¹(w₄, v₃₃; 1.9590)
  temp1=h−1w4v33θ31=h−1w4v331.9590
  
  temp2 = h⁻¹(temp1, v₂₂; θ₂₂) = h⁻¹(temp1, v₂₂; 1.2672)
  temp2=h−1temp1v22θ22=h−1temp1v221.2672
  
  u₄ = v₄₁ = h⁻¹(temp2, v₁₁; θ₁₃) = h⁻¹(temp2, u₁; 2.5704)
  u4=v41=h−1temp2v11θ13=h−1temp2u12.5704

Figure 5.14 (a) Comparison of pseudo-observations with those simulated from the fitted D-vine copula; (b) comparison of pseudo-observations with those simulated from the fitted C-vine copula.

Figure 5.14(b) compares the pseudo-observations with those simulated from the fitted C-vine copula.

For the simulation of random variates, the inverse of the hh function is evaluated numerically for both D-vine and C-vine copulas.

Based on the overall log-likelihood computed in this example, we see that the log-likelihood value for the D-vine copula is slightly higher than that for the C-vine copula. Simulation plots show similar results between the fitted D-vine and C-vine copulas.

5.3.6 Selection of Vine Copula Structure

Previously, we have discussed how to estimate the parameters for the specified vine copula structure. Following Aas et al. (2009), for the estimation of pair-copula decomposition, we should consider (i) the selection of pair-copula decompositions; (ii) the selection of pair-copula types; and (iii) the estimation of copula parameters. In principle, we may use all the possible decompositions to estimate the copula parameters and to choose the best-fitted vine copula structure for a given d-dimensional variable. However, in reality with higher dimensions (i.e., d ≥ 3)d≥3), the number of possible decompositions increases significantly as d ! /2d!/2 (i.e., 3 C-Vine (D-Vine) copulas for three-dimensional variables, 12 D-vine and 12 C-vine copulas for four-dimensional variables, 60 D-vine and 60 C-vine copulas for five-dimensional variables, etc.). To avoid the evaluations for all possible decompositions, we may first look at the rank-based correlation structure, starting from T_1, to achieve the proper vine decomposition.

Similar to the discussion in Section 5.3.5, with the proper study of rank-based correlation structure, we can modify the model selection using sequential MLE (Aas et al., 2009) for decomposition with the tree levels {T₁, T₂, …, T_d − 1}T1T2…Td−1 in what follows:

1. Select the copula family and estimate the parameters for T₁T1 using the original data: (a) the parameters may be estimated using MLE; (b) the best-fitted copula can be selected by minimizing AIC or BIC and assessed with the goodness-of-fit study that will be discussed in Section 5.3.7.
2. Transform observations required in T₂T2 with the use of the copula fitted in T₁T1 and its corresponding h(⋅)h⋅ function.
3. Select the copula family and estimate the parameters for T₂T2. The best-fitted copula in T₂T2 is selected in the same way as in T₁T1.
4. Repeat steps 2 and 3 until we reach T_d − 1Td−1.

Based on the previously discussed model selection, we know the copulas selected do not need to belong to the same copula families (D-vine copula in Example 5.15, as an example). In addition, we should note that the sequential MLE may not result in a globally optimal solution. To avoid this problem, we may estimate all the parameters simultaneously using algorithm 3 for C-vine (algorithm 4 for D-vine) copulas for the selected vine structure with the parameters estimated using the sequential MLE as the initial estimates. Here, we will show how to estimate the parameters simultaneously.

Example 5.17 Re-work Example 5.16: (1) estimate the copula parameters simultaneously using the same decomposition and copula families as Example 5.16; and (2) simulate the random variates for the sample size of 100 from the fitted copula functions.

Solution:

Estimate the copula parameters simultaneously.

Estimate the parameters for D-vine copula.

In Example 5.16, we have estimated the copula parameters sequentially for the D-vine copula as follows:
- T₁ : θ₁₁ = 3.8545; θ₁₂ = 3.0942; θ₁₃ = 4.3949T1:θ11=3.8545;θ12=3.0942;θ13=4.3949 (the Gumbel–Hougaard copula family)
- T₂ : θ₂₁ = 1.9708; θ₂₂ = 0.7916T2:θ21=1.9708;θ22=0.7916 (the Frank copula family)
- T₃ : θ₃₁ = − 0.4281T3:θ31=−0.4281 (the Frank copula family)
To estimate the parameters simultaneously, we apply algorithm 4 (Equation (5.41)) to write the log-likelihood function for the D-vine copula as follows:

L1=∑i=1nlnc12u1iu2iθ11+lnc23u2iu3iθ12+lnc34u3iu4iθ13

v₁₁ = h(u₁, u₂; θ₁₁); v₁₂ = h(u₃, u₂; θ₁₂); v₁₃ = h(u₂, u₃; θ₁₂); v₁₄ = h(u₄, u₃; θ₁₃)
v11=hu1u2θ11;v12=hu3u2θ12;v13=hu2u3θ12;v14=hu4u3θ13

L2=∑i=1nlnc13∣2v11iv12iθ21+lnc34∣2v13iv14iθ22

v₂₁ = h(v₁₁, v₁₂; θ₂₁); v₂₂ = h(v₁₄, v₁₃; θ₂₂)
v21=hv11v12θ21;v22=hv14v13θ22

L3=∑i=1nlnc14∣23v21iv22iθ31
Finally, we have the overall log-likelihood as L = L₁ + L₂ + L₃L=L1+L2+L3, where n is the sample size.

Using the parameters estimated sequentially as initial estimates, we obtain the parameters simultaneously by maximizing the final LL (or equivalently minimizing –L):

θ₁₁ = 3.7723, θ₁₂ = 3.1705, θ₁₃ = 4.3913, θ₂₁ = 1.9931, θ₂₂ = 0.7811, θ₃₁ =  − 0.4325
θ11=3.7723,θ12=3.1705,θ13=4.3913,θ21=1.9931,θ22=0.7811,θ31=−0.4325

Overall log-likelihood is L = L₁ + L₂ + L₃ = 184.988L=L1+L2+L3=184.988

AIC =  − 2L + 2length(Θ) =  − 2(184.988) + 2(6) =  − 357.976
AIC=−2L+2lengthΘ=−2184.988+26=−357.976

BIC =  − 2L +  ln (n)length(Θ) =  − 2(184.988) +  ln (60)(6) =  − 345.409
BIC=−2L+lnnlengthΘ=−2184.988+ln606=−345.409
Estimate the parameters for C-vine copula.

In Example 5.16, we have estimated the copula parameters sequentially for the C-vine copula as follows:
- T₁ : θ₁₁ = 3.8545; θ₁₂ = 3.0834; θ₁₃ = 2.5704T1:θ11=3.8545;θ12=3.0834;θ13=2.5704 (the Gumbel–Hougaard copula family)
- T₂ : θ₂₁ = 1.2618; θ₂₂ = 1.2672T2:θ21=1.2618;θ22=1.2672 (the Gumbel–Hougaard copula family)
- T₃ : θ₃₁ = 1.9590T3:θ31=1.9590 (the Gumbel–Hougaard copula family)
To estimate parameters simultaneously using the C-vine copula, we apply algorithm 3 (Equation (5.40)) to write the likelihood function for the C-vine copula as follows:

L1=∑i=1nlnc12u1iu2iθ11+lnc13u1iu3iθ12+lnc14u1iu4iθ13

v₁₁ = h(u₂, u₁; θ₁₁); v₁₂ = h(u₃, u₁, θ₁₂); v₁₃ = h(u₄, u₁; θ₁₃)
v11=hu2u1θ11;v12=hu3u1θ12;v13=hu4u1θ13

L2=∑i=1nlnc23∣1v11iv12iθ21+lnc24∣1v11v13θ22

v₂₁ = h(v₁₂, v₁₁; θ₂₁); v₂₂ = h(v₁₃, v₁₁; θ₂₂)
v21=hv12v11θ21;v22=hv13v11θ22

L₃ = ln (c_34 ∣ 12(v₂₁, v₂₂; θ₃₁))
L3=lnc34∣12v21v22θ31
Finally, we have the overall log-likelihood as L = L₁ + L₂ + L₃L=L1+L2+L3.

Again, using the parameters estimated sequentially as initial estimates from Example 5.16, we can estimate the parameters simultaneously by maximizing LL (or minimizing –L) as follows:

θ₁₁ = 3.9280, θ₁₂ = 2.9592, θ₁₃ = 2.5509, θ₂₁ = 1.2463, θ₂₂ = 1.2285, θ₃₁ = 2.0333
θ11=3.9280,θ12=2.9592,θ13=2.5509,θ21=1.2463,θ22=1.2285,θ31=2.0333

The log-likelihood is evaluated as follows: L = 181.673, AIC = − 351.346, BIC = − 338.780L=181.673,AIC=−351.346,BIC=−338.780.

From the log-likelihood value, we see that the log-likelihood value obtained from the D-vine copula is slightly higher than that obtained from the C-vine copula. The AIC and BIC values (D-vine) are slightly smaller than those for the C-vine copula.
Simulate random variates

Using the same procedure as in Example 5.16, Figures 5.15(a) and 5.15(b), compare pseudo-observations with those simulated from the D-vine and C-vine copulas, respectively. The simulation plots show a similar comparison between the fitted D-vine and C-vine copulas.

Figure 5.15 (a) Comparison of pseudo-observations with those simulated from the fitted D-vine copula; (b) comparison of pseudo-observations with those simulated from the fitted C-vine copula.

Comparing with Example 5.16, there are minimal differences for the log-likelihood value, AIC and BIC obtained for D-vine and C-vine copulas. In addition, the sequential estimation method is more direct and easier to apply than is the simultaneous estimation method.

5.3.7 Goodness-of-Fit Test

Aas et al. (2009) proposed to use the probability integral transform (PIT, i.e., Rosenblatt’s transform) to test the goodness-of-fit for the pair-copula decomposition. Previously, in Section 5.2.5 we have discussed Rosenblatt’s transform. In what follows, we will illustrate the PIT algorithm for the C-vine and D-vine copulas (Aas et al., 2009). For a d-dimensional random variable x = {x₁, x₂, …x_i…, , x_n}, x_i = {x_{i, 1}, …, x_{i, d}}x=x1x2…xi…xn,xi=xi,1…xi,d, the PIT is defined as follows:

Z1=F(X1≤x1),Z2=F(X2≤x2|X1=x1)…Zi=F(Xi≤xi|X1=x1,…,Xi−1=xi−1)

(5.51)

For a C-vine copula, the conditional distribution is computed using Equation (5.37) as given in algorithm 5 (Aas et al., 2009) that may be explained with dd-dimensional copula variable of sample size nn, as follows:

1. Set z_{1, 1} = x_{1, 1} = u_{1, 1}z1,1=x1,1=u1,1. Here the first subscript represents the dimension, and second represents the sample considered.
2. Use loops to compute z_i, i = 2, …, dzi,i=2,…,d.

for i = 2 to d
fori=2tod

z_{i, 1} = x_{i, 1}
zi,1=xi,1

for j = 1 to d − 1
forj=1tod−1

z_{i, 1} = h(temp, z_{j, 1}; θ_{j, i − j})
zi,1=htempzj,1θj,i−j

end
end

end
end
3. Repeat steps 1 and 2 n times.

The D-vine copula applies Equation (5.38) to compute the conditional distributions for PIT, which is given as algorithm 6 in Aas et al. (2009). It again may be explained for a dd-dimensional D-vine copula variables of sample size nn using x₁ = [x₁₁, x₂₁, …, x_d1]x1=x11x21…xd1 as follows:

1. Set z₁₁ = x₁₁ = u₁₁z11=x11=u11. The subscripts are defined exactly same as those in algorithm 5.
2. Compute the conditional distribution of z_{2, 1} = C_2 ∣ 1z2,1=C2∣1 and C_1 ∣ 2C1∣2:

z₂₁ = h(x₂₁, x₁₁; θ₁₁);
z21=hx21x11θ11;

setting s₂₁ = x₂₁;
settings21=x21;

computings₂₂ = h(x₁₁, x₂₁; θ₁₁)
computing s22=hx11x21θ11
3. Compute the conditional distribution for x₃₁|x₁₁, x₂₁, …x_d1|x_11,. . x_(d − 1)1x31x11x21…xd1x11,..xd−11:
- for i = 3 to dfori=3tod
- z_i1 = h(x_i1, x_(i − 1)1; θ_1(i − 1))zi1=hxi1xi−11θ1i−1 % temporary: representing C_{i ∣ i − 1}Ci∣i−1.
- for j = 2 to i − 1forj=2toi−1
- z_i1 = h(z_i1, s_{i − 1, 2(j − 1)}; θ_{j, i − j})zi1=hzi1si−1,2j−1θj,i−j
- endend
- stop if i = di=d. Otherwise we need to continue the loop
- set s_i1 = x_i1; s_i2 = h(s_(i − 1)1, s_i1; θ_1(i − 1)); s_i3 = h(s_i1, s_(i − 1)1; θ_1(i − 1))si1=xi1;si2=hsi−11si1θ1i−1;si3=hsi1si−11θ1i−1
- for j = 1 to i − 3forj=1toi−3
- s_i(2j + 2) = h(s_{(i − 1)2j}, s_i(2j + 1); θ_{(j + 1)(i − j − 1)});si2j+2=hsi−12jsi2j+1θj+1i−j−1;
- s_i(2j + 3) = h(s_i(2j + 1), s_{(i − 1)2j}; θ_{(j + 1)(i − j − 1)})si2j+3=hsi2j+1si−12jθj+1i−j−1
- end
- s_{i(2i − 2)} = h(s_{(i − 1)(2i − 4)}, s_{i(2i − 3)}; θ_(i − 1)1)si2i−2=hsi−12i−4si2i−3θi−11
- end
4. Repeat steps 1–3 n times.

With the use of the PIT, the goodness-of-fit test may be performed in two ways: by applying the Anderson–Darling test and by applying the new procedure based on PIT proposed by Genest et al. (2007b).

Applying the Anderson–Darling Test

Compared to the new procedure proposed by Genest et al. (2007b), the Anderson–Darling test has inferior performance. However, we are still going to introduce this formal test here. Using the variables after PIT, we define the following:

χ2=χi2=∑j=1dΦ−1Zij2i=12…n,

(5.52a)

where χ²χ2 follows the chi-square distribution with the degree of freedom (d.f.= d; i.e., the dimension of the multivariate random variable). The nonparametric CDF of χ2χ2 computed from Equation (5.52a) may then be estimated as follows:

Gnt=1n+1∑i=1n1χ2≤t,t>0

(5.52b)

Under the null hypothesis of Zs being independent and uniformly distributed, the Anderson–Darling test statistic is given as (Genest et al., 2007a):

Ak=−n−1n∑i=1n2i−1lnGχi2+ln1−Gχn+1−i2,

(5.53)

where χ12≤…≤χn2 are the order statistics corresponding to χ12,…χn2.

To avoid the misidentification of the limiting probability distribution, the P-value is estimated using the parametric bootstrap method for large integer N. Repeat the following steps for every k ∈ {1, …, N}k∈1…N:

a. Generate a random sample X1,k∗,…,Xn,k∗ from the vine copula C_{θ_n}Cθn and compute their associated rank vectors: R1,k∗,…,Rn,k∗.
b. Compute Ui,k∗=Ri,k∗/n+1 for i ∈ {1, …, n}i∈1…n.
c. Reestimate parameters (i.e., θn,k∗) for the vine copula using U1,k∗…Un,k∗ and compute χk∗={χ1,k2∗,…,χn,k2∗}, where χi,k2∗=∑j=1dΦ−1Zij,k∗2 with Zij,k∗ is determined from algorithm 5 or 6 (or simply using Equation (5.11)).
d. Compute the Anderson–Darling test statistics Ak∗ using χk∗ from Equation (5.53).

The approximate P-value for the test is then given by ∑k=1N1Ak∗>A/N.

Applying the New Procedure Based on PIT Proposed by Genest et al. (2007b)

As discussed in Section 4.7.1, the null hypothesis is ZZ (after Rosenblatt’s transform), being close to C_⊥C⊥, where Z = {Z₁, …Z_i, …, Z_n}, Z_i = {Z₁, Z₂, …, Z_d}Z=Z1…Zi…Zn,Zi=Z1Z2…Zd as follows:

1. Compute D_nDn and test statistics SnB using the fitted copula model as follows:

Dnu=1n∑i=1nZi≤u,u∈01d(5.54)

SnB=n∫01dDnu−C⊥u2du=n3d−12d−1∑i=1n∏k=1d1−Zik2+1n∑i=1n∑j=1n∏k=1d1−Zik∨Zjk(5.55)

where Z_ik ∨ Z_jk = max (Z_ik, Z_jk)Zik∨Zjk=maxZikZjk.
2. For some large integer N, repeat the following steps for k = {1, 2, …, N}k=12…N:
1. a. Generate a random sample X1,k∗,…,Xn,k∗ from the vine copula C_{θ_n}Cθn and compute their associated rank vectors: R1,k∗,…,Rn,k∗.
2. b. Compute Ui,k∗=Ri,k∗/n+1 for i ∈ {1, …, n}.i∈1…n.
3. c. Reestimate parameters (i.e., θn,k∗) for the vine copula using U1,k∗…Un,k∗ and compute Z1,k∗,…,Zn,k∗ using an appropriate algorithm (algorithm 5 or 6) or simply using Equation (5.11).
4. d. Compute Dn,k∗ and Sn,kB∗ using Equations (5.54) and (5.55) with reestimated parameter θn,k∗.

The appropriate P-value for the test is then given as follows: ∑k=1N1Sn,kB∗>SnB/N.

Example 5.18 Assess the GoF for the C- or D-vine copula constructed in Example 5.15 for trivariate analysis with both the Anderson–Darling test and the new procedure based on PIT proposed by Genest et al. (2007b) discussed in the preceding section.

Solution: Previously, we have shown that in the case of trivariate random variables, it is indifferent between C- and D-vine copulas. From Example 5.15, we have estimated the parameters for the Clayton–Clayton–Frank copula sequentially as follows:

T₁: Clayton copula— θ₁₁ = 4.1728 (u₁, u₂); θ₁₂ = 8.3834 (u₂, u₃)θ11=4.1728u1u2;θ12=8.3834u2u3
T₂: Frank copula— θ₂₁ = − 3.8431 (C_1 ∣ 2, C_3 ∣ 2)θ21=−3.8431C1∣2C3∣2

Based on the Rosenblatt transform, Equation (5.53) may be rewritten for the three-dimensional C- or D-vine copulas as follows:

Z1=u1Z2=Cu2U1=u1Z3=Cu3U1=u1U2=u2=∂C13∣2C3∣2C1∣2∂C1∣2

(5.56)

With the estimated parameters using the sequential MLE and Equation (5.56), Table 5.7 lists Zs along with test statistics.

Table 5.7. Computed Zs and corresponding test statistics for three-dimensional C- or D-vine copulas.

	Observed
u₁	u₂	u₃	Z₁	Z₂	Z₃
0.241	0.138	0.103	0.241	0.049	0.152
0.241	0.172	0.172	0.241	0.134	0.690
0.241	0.241	0.276	0.241	0.424	0.723
0.241	0.586	0.655	0.241	0.973	0.304
0.793	0.828	0.897	0.793	0.627	0.793
0.483	0.345	0.379	0.483	0.135	0.869
0.931	0.914	0.621	0.931	0.696	0.042
0.724	0.759	0.724	0.724	0.575	0.306
0.414	0.621	0.586	0.414	0.833	0.087
0.759	0.414	0.310	0.759	0.041	0.176
0.862	0.793	0.793	0.862	0.458	0.687
0.655	0.517	0.448	0.655	0.211	0.281
0.414	0.379	0.552	0.414	0.336	0.974
0.569	0.448	0.414	0.569	0.204	0.419
0.569	0.690	0.690	0.569	0.687	0.252
0.414	0.310	0.241	0.414	0.164	0.120
0.241	0.552	0.862	0.241	0.965	0.933
0.069	0.034	0.034	0.069	0.026	0.813
0.241	0.276	0.345	0.241	0.572	0.790
0.069	0.069	0.069	0.069	0.423	0.378
0.897	0.914	0.931	0.897	0.730	0.841
0.655	0.655	0.483	0.655	0.473	0.029
0.069	0.103	0.138	0.069	0.811	0.771
0.241	0.207	0.207	0.241	0.267	0.538
0.655	0.724	0.759	0.655	0.611	0.482
0.517	0.483	0.517	0.517	0.362	0.653
0.828	0.862	0.828	0.828	0.665	0.448
0.966	0.966	0.966	0.966	0.854	0.949

Notes:

Anderson–Darling test statistic: An = 0.3572, P = 0.878 (with N = 1,000).

Rosenblatt (SnB) test statistic: SnB = 0.0417, P = 0.532 (with N = 1,000).

The formal GoF results using the Anderson–Darling and SnB tests show that with 1,000 parametric bootstrap simulations, the fitted Clayton–Clayton–Frank copula may properly model the dependence of the studied trivariate random variables.

Example 5.19 Assess the GoF for the D- and C-vine copulas constructed in Example 5.16 with both of the two GoF approaches previously discussed.

Solution:

1. D-vine copula

For the four-dimensional random variable, the parameters were estimated sequentially for the D-vine copula in Example 5.16 as follows:
- T₁:T1: Gumbel–Hougaard copula
- θ₁₁ = 3.8545 (u₁, u₂), θ₁₂ = 3.0942 (u₂, u₃), θ₁₃ = 4.3949 (u₃, u₄);θ11=3.8545u1u2,θ12=3.0942u2u3,θ13=4.3949u3u4;
- T₂:T2: Frank copula
- θ₂₁ = 1.9708 (C_1 ∣ 2, C_3 ∣ 2), θ₂₂ = 0.7916 (C_2 ∣ 3, C_4 ∣ 3);θ21=1.9708C1∣2C3∣2,θ22=0.7916C2∣3C4∣3;
- T₃:T3: Frank copula
- θ₃₁ = − 0.4281 (C_1 ∣ 23, C_4 ∣ 23)θ31=−0.4281C1∣23C4∣23.
Now based on the PIT, Equation (5.53) can be rewritten for the four-dimensional D-vine copula as follows:

Z1=u1Z2=Cu2U1=u1=∂Cu1u2θ11∂u1Z3=Cu3U1=u1U2=u2=∂C13∣2C3∣2C1∣2∂C1∣2Z4=Cu4U1=u1U2=u2U3=u3=∂C14∣23C4∣23C1∣23∂C1∣23(5.57)

Table 5.8 lists the computed values from PIT using Equation (5.57) with the fitted four-dimensional D-vine copula.

Table 5.8. Computed Zs and corresponding test statistics for the D-vine copula.

Z₁Z1	Z₂Z2	Z₃Z3	Z₄Z4
0.194	0.804	0.801	0.828
0.819	0.939	0.150	0.278
0.614	0.620	0.513	0.685
0.235	0.368	0.638	0.499
0.792	0.406	0.882	0.841
0.433	0.722	0.717	0.275
0.130	0.638	0.214	0.301
0.570	0.574	0.956	0.141
0.128	0.819	0.596	0.101
0.218	0.145	0.670	0.951
0.468	0.263	0.370	0.729
0.490	0.373	0.337	0.827
0.194	0.102	0.030	0.588
0.120	0.744	0.458	0.754
0.676	0.322	0.843	0.271
0.990	0.589	0.683	0.713
0.657	0.894	0.996	0.762
0.226	0.287	0.646	0.101
0.828	0.743	0.546	0.959
0.373	0.477	0.081	0.724
0.698	0.412	0.690	0.138
0.645	0.830	0.410	0.928
0.025	0.531	0.304	0.943
0.298	0.477	0.820	0.330
0.906	0.863	0.890	0.470
0.658	0.127	0.519	0.848
0.302	0.122	0.430	0.125
0.581	0.129	0.943	0.839
0.371	0.655	0.095	0.245
0.169	0.980	0.115	0.726
0.041	0.594	0.034	0.791
0.982	0.410	0.039	0.637
0.585	0.008	0.584	0.562
0.618	0.898	0.462	0.354
0.280	0.977	0.276	0.851
0.902	0.992	0.098	0.686
0.440	0.925	0.596	0.981
0.243	0.180	0.647	0.963
0.044	0.567	0.798	0.046
0.122	0.503	0.701	0.306
0.497	0.867	0.423	0.495
0.701	0.365	0.752	0.133
0.323	0.906	0.987	0.544
0.013	0.622	0.599	0.214
0.651	0.774	0.812	0.118
0.190	0.736	0.996	0.897
0.520	0.975	0.393	0.165
0.926	0.793	0.287	0.062
0.468	0.459	0.646	0.858
0.868	0.766	0.663	0.660
0.422	0.973	0.811	0.076
0.888	0.251	0.613	0.301
0.372	0.918	0.940	0.812
0.132	0.335	0.293	0.114
0.429	0.178	0.179	0.156
0.390	0.434	0.478	0.059
0.983	0.765	0.902	0.495
0.980	0.917	0.310	0.401
0.308	0.503	0.122	0.562
0.932	0.370	0.786	0.452

Notes:

An (Equation 4.55): An = 0.7411, P-value = 0.261.

SnB (Equation 4.56): SnB = 0.0362, P-value = 0.08.

To approximate the P-value using the parametric bootstrap simulation method, we will use N = 1,000 as an example. It is known that the larger the N value, the closer to the true P-value for the GoF study.

2. C-vine copula

For the four-dimensional random variable, parameters were estimated sequentially for the C-vine copula in Example 5.15 as follows:

T₁ (Gumbel): θ₁₁ = 3.8545, (u₁, u₂)θ11=3.8545,u1u2; θ₁₂ = 3.0834, (u₁, u₃)θ12=3.0834,u1u3; θ₁₃ = 2.5704, (u₁, u₄)θ13=2.5704,u1u4.
T2 (Gumbel): θ₂₁ = 1.2618 (C_2 ∣ 1, C_3 ∣ 1); θ₂₂ = 1.2672 (C_2 ∣ 1, C_4 ∣ 1)θ21=1.2618C2∣1C3∣1;θ22=1.2672C2∣1C4∣1
T3 (Gumbel): θ₃₁ = 1.9590 (C_3 ∣ 12, C_4 ∣ 12)θ31=1.9590C3∣12C4∣12

According to the C-vine structure, the PIT of Equation (5.57) is rewritten as follows:

Z1=u1Z2=Cu2U1=u1=∂Cu1u2∂u1Z3=Cu3U1=u1U2=u2=∂CC3∣1C2∣1∂C2∣1Z4=Cu4U1=u1U2=u2U3=u3=∂CC4∣21C3∣21∂C3∣21

(5.58)

Table 5.9 lists the computed Zs and corresponding test statistics for the fitted C-vine copula.

Table 5.9. Computed Zs and the corresponding test statistics for the fitted C-vine copula.

Z₁Z1	Z₂Z2	Z₃Z3	Z₄Z4
0.194	0.804	0.805	0.846
0.819	0.939	0.157	0.296
0.614	0.620	0.539	0.727
0.235	0.368	0.654	0.511
0.792	0.406	0.905	0.859
0.433	0.722	0.731	0.306
0.130	0.638	0.223	0.319
0.570	0.574	0.972	0.102
0.128	0.819	0.609	0.140
0.218	0.145	0.667	0.971
0.468	0.263	0.378	0.724
0.490	0.373	0.357	0.832
0.194	0.102	0.025	0.409
0.120	0.744	0.472	0.786
0.676	0.322	0.862	0.257
0.990	0.589	0.719	0.771
0.657	0.894	0.998	0.461
0.226	0.287	0.660	0.114
0.828	0.743	0.568	0.978
0.373	0.477	0.086	0.617
0.698	0.412	0.717	0.169
0.645	0.830	0.420	0.936
0.025	0.531	0.314	0.969
0.298	0.477	0.843	0.314
0.906	0.863	0.904	0.448
0.658	0.127	0.441	0.838
0.302	0.122	0.401	0.137
0.581	0.129	0.940	0.816
0.371	0.655	0.098	0.231
0.169	0.980	0.213	0.596
0.041	0.594	0.041	0.749
0.982	0.410	0.048	0.416
0.585	0.008	0.232	0.469
0.618	0.898	0.471	0.416
0.280	0.977	0.354	0.743
0.902	0.992	0.138	0.476
0.440	0.925	0.599	0.987
0.243	0.180	0.650	0.981
0.044	0.567	0.801	0.062
0.122	0.503	0.707	0.318
0.497	0.867	0.435	0.531
0.701	0.365	0.774	0.154
0.323	0.906	0.991	0.289
0.013	0.622	0.614	0.249
0.651	0.774	0.830	0.140
0.190	0.736	0.998	0.728
0.520	0.975	0.429	0.250
0.926	0.793	0.297	0.135
0.468	0.459	0.673	0.895
0.868	0.766	0.686	0.717
0.422	0.973	0.779	0.133
0.888	0.251	0.586	0.360
0.372	0.918	0.942	0.742
0.132	0.335	0.293	0.125
0.429	0.178	0.175	0.168
0.390	0.434	0.500	0.081
0.983	0.765	0.925	0.468
0.980	0.917	0.314	0.442
0.308	0.503	0.128	0.506
0.932	0.370	0.800	0.474

Notes:

An (Equation 4.53): An = 0.7365, P-value = 0.276 (with N = 1,000).

SnB (Equation 4.54): SnB = 0.03, P-value = 0.415 (with N = 1,000).

5.3.8 JCDF for d-Dimensional Vine Copulas

Let X = {X₁, …, X_d}X=X1…Xd be a random vector with marginal distributions F_i(x_i) = u_i = P(X_i ≤ x_i)Fixi=ui=PXi≤xi and conditional distributions F(x_i| x₁, …, x_i − 1) = P(X_i ≤ x_i| X₁ ≤ x₁, …, X_i − 1 ≤ x_i − 1)Fxix1…xi−1=PXi≤xiX1≤x1…Xi−1≤xi−1. From the probability theory, the joint probability distribution F(x₁, x₂, …, x_d)Fx1x2…xd can be expressed as follows:

F(x₁, …, x_d) = P(X₁ ≤ x₁, …, X_d ≤ x_d) = C(U₁ ≤ u₁, …, U_d ≤ u_d)

Fx1…xd=PX1≤x1…Xd≤xd=CU1≤u1…Ud≤ud

Then, with the given vine-copula structure, the joint probability distribution may be evaluated starting from the top-level T_d − 1Td−1 of the given pair-copula decomposition. In what follows, we illustrate how to derive the JCDF for C-vine and D-vine copulas using three-dimensional, four-dimensional, and five-dimensional random variables as examples.

JCDF for Three-Dimensional Variables

Using Figure 5.10(a) as an example and applying the total probability theory, we have the following:

F(x₁, x₂, x₃) = P(X₁ ≤ x₁, X₃ ≤ x₃| X₂ ≤ x₂)P₂(x₂)

Fx1x2x3=PX1≤x1X3≤x3X2≤x2P2x2(5.59a)

Let u₁ = F₁(x₁), u₂ = F₂(x₂), u₃ = F₃(x₃)u1=F1x1,u2=F2x2,u3=F3x3 and θ₁₁, θ₁₂, θ₂₁θ11,θ12,θ21 represent the copula parameters for (u₁, u₂); (u₂, u₃); and (u₁|u₂, u₃|u₂)u1u2;u2u3;andu1u2u3u2, respectively. Then, we have the following:

P(X₁ ≤ x₁, X₃ ≤ 3₃| X₂ ≤ x₂) = C_{1, 3 ∣ 2}(C_1 ∣ 2(U₁ ≤ u₁| U₂ ≤ u₂), C_3 ∣ 2(U₃ ≤ u₃| U₂ ≤ u₂); θ₂₁)

PX1≤x1X3≤x3X2≤x2=C1,3∣2C1∣2U1≤u1U2≤u2C3∣2U3≤u3U2≤u2θ21(5.59b)

C1∣2U1≤u1U2≤u2=Cu1u2θ11u2;C3∣2U3≤u3U2≤u2=Cu2u3θ12u2

(5.59c)

JCDF for Four-Dimensional D-Vine Variables

Using Figure 5.11(a) as an example, we have the following:

F(x₁, x₂, x₃, x₄) = P(X₁ ≤ x₁, X₄ ≤ x₄| X₂ ≤ x₂, X₃ ≤ x₃)C(u₂, u₃)

Fx1x2x3x4=PX1≤x1X4≤x4X2≤x2X3≤x3Cu2u3(5.60a)

Let θ₁₁, θ₁₂, θ₁₃, θ₂₁, θ₂₂, θ₃₁θ11,θ12,θ13,θ21,θ22,θ31 represent the copula parameters for T₁, T₂, and T₃, respectively. Then we have the following:

P(X₁ ≤ x₁, X₄ ≤ x₄| X₂ ≤ x₂, X₃ ≤ x₃)

PX1≤x1X4≤x4X2≤x2X3≤x3

=C_14 ∣ 23(C_1 ∣ 23(u₁| U₂ ≤ u₂, U₃ ≤ u₃), C_4 ∣ 23(u₄| U₂ ≤ u₂, U₃ ≤ u₃); θ₃₁)

=C14∣23C1∣23u1U2≤u2U3≤u3C4∣23u4U2≤u2U3≤u3θ31(5.60b)

C1∣23u1U2≤u2U3≤u3=C13∣2Cu1u2θ11u2Cu2u3θ12u2θ21/Cu2u3θ12u2

(5.60c)

C4∣23u4U2≤u2U3≤u3=C24∣3Cu3u4θ13u3Cu2u3θ12u3θ22/Cu2u3θ12u3

(5.60d)

JCDF for Four-Dimensional C-Vine Variables

Using Figure 5.11(b) as an example, Equation (5.60a) can be rearranged as follows:

F(x₁, x₂, x₃, x₄) = P(X₃ ≤ x₃, X₄ ≤ x₄| X₁ ≤ x₁, X₂ ≤ x₂)C(u₁, u₂)

Fx1x2x3x4=PX3≤x3X4≤x4X1≤x1X2≤x2Cu1u2(5.61a)

Let θ₁₁, θ₁₂, θ₁₃, θ₂₁, θ₂₂, θ₃₁θ11,θ12,θ13,θ21,θ22,θ31 represent the copula parameters for T₁, T₂, and T₃, respectively. Then we have the following:

P(X₃ ≤ x₃, X₄ ≤ x₄| X₁ ≤ x₁, X₂ ≤ x₂)

PX3≤x3X4≤x4X1≤x1X2≤x2

=C_34 ∣ 12(C_3 ∣ 12(u₃| U₁ ≤ u₁, U₂ ≤ u₂), C_4 ∣ 12(u₄| U₁ ≤ u₁, U₂ ≤ u₂); θ₃₁)

=C34∣12C3∣12u3U1≤u1U2≤u2C4∣12u4U1≤u1U2≤u2θ31(5.61b)

C3∣12u3U1≤u1U2≤u2=C23∣1Cu1u2θ11u1Cu1u3θ12u1θ21/Cu1u2θ11u1

(5.61c)

C4∣12u4U1≤u1U2≤u2=C24∣1Cu1u4θ13u1Cu1u2θ12u1θ22/Cu1u2θ11u1

(5.61d)

JCDF for Five-Dimensional D-Vine Variables

Using Figure 5.8 as an example, we have the following:

F(x₁, x₂, x₃, x₄, x₅) = P(X₁ ≤ x₁, X₅ ≤ x₅| X₂ ≤ x₂, X₃ ≤ x₃, X₄ ≤ x₄)P(x₂, x₃, x₄)

Fx1x2x3x4x5=PX1≤x1X5≤x5X2≤x2X3≤x3X4≤x4Px2x3x4(5.62a)

Let {θ₁₁, θ₁₂, θ₁₃, θ₁₄}, {θ₂₁, θ₂₂, θ₂₃}, {θ₃₁, θ₃₂}, θ₄₁θ11θ12θ13θ14,θ21θ22θ23,θ31θ32,θ41 represent the copula parameters for T₁, T₂, T₃, and T₄ respectively. Then we have the following:

Px2x3x4=Cu2u3u4=C24∣3u2u4U3≤u3u3=C24∣3Cu2u3θ12u3Cu3u4θ13u3θ22

(5.62b)

PX1≤x1X5≤x5X2≤x2X3≤x3X4≤x4=C15∣234Px1X2≤x2X3≤x3X4≤x4Px5X2≤x2X3≤x3X4≤x4θ41

(5.62c)

PX1≤x1X2≤x2X3≤x3X4≤x4=C14∣23Px1X2≤x2X3≤x3Px4X2≤x2X3≤x3θ31

(5.62d)

PX5≤x5X2≤x2X3≤x3X4≤x4=C25∣34Px5X3≤x3X4≤x4Px2X3≤x3X4≤x4θ32

(5.62e)

PX1≤x1X2≤x2X3≤x3=C13∣2Cu1u2θ11u2Cu2u3θ12u2θ21/Cu2u3θ12u2

(5.62f)

PX4≤x4X2≤x2X3≤x3=C24∣3Cu3u4θ13u3Cu2u3θ12u3θ22/Cu2u3θ12u3

(5.62g)

PX2≤x2X3≤x3X4≤x4=C24∣3Cu2u3θ12u3Cu3u4θ13u3θ22/Cu3u4θ13u3

(5.62h)

PX5≤x5X3≤x3X4≤x4=C35∣4Cu3u4θ13u4Cu4u5θ14u4θ23/Cu4u5θ14u4

(5.62i)

JCDF for Five-Dimensional C-Vine Variables

Using Figure 5.9 as an example, we have the following:

Fx1x2x3x4x5=PX1≤x1…X5≤x5=PX4≤x4X5≤x5X1≤x1X2≤x2X3≤x3Px1x2x3

(5.63a)

Fx1x2x3=Cu1u2u3=C23∣1Cu1u2θ11u1Cu1u3θ12u1θ21u1

(5.63b)

PX4≤x4X5≤x5X1≤x1X2≤x2X3≤x3=C45∣123PX4≤x4X1≤x1X2≤x2X3≤x3PX5≤x5X1≤x1X2≤x2X3≤x3θ41

(5.63c)

PX4≤x4X1≤x1X2≤x2X3≤x3=C34∣12PX4≤x4X1≤x1X2≤x2PX3≤x3X1≤x1X2≤x2θ31

(5.63d)

PX5≤x5X1≤x1X2≤x2X3≤x3=C35∣12PX5≤x5X1≤x1X2≤x2PX3≤x3X1≤x1X2≤x2θ32

(5.63e)

PX3≤x3X1≤x1X2≤x2=C23∣1Cu1u3θ12u1Cu1u2θ11u1θ21Cu1u2θ11u1

(5.63f)

PX4≤x4X1≤x1X2≤x2=C24∣1Cu1u4θ13u1Cu1u2θ11u1θ22Cu1u2θ11u1

(5.63g)

PX5≤x5X1≤x1X2≤x2=C25∣1Cu1u5θ14u1Cu1u2θ11u1θ23Cu1u2θ11u1

(5.63h)

Example 5.20 Compute the JCDF and compare it with the empirical JCDF, using the data and vine copula constructed in Example 5.15.

Solution: The empirical copula can be computed using the following:

Cnu=1n∑i=1nui1≤u1ui2≤u2ui3≤u3;u=u1u2u3

(5.64)

Applying the parameters estimated for the vine structure in Example 5.14, we have the joint distribution function for the given Clayton–Clayton–Frank vine copula as follows:

JCDF=u23.8431ln1+e3.8431A−1e3.8431B−1e3.8431−1

where

A=Cu1U2≤u2=u1−4.1728+u2−4.1728−1−14.1728u2

B=Cu3U2≤u2=u2−8.3834+u3−8.3834−1−18.3834u2

The quantile-quantile (QQ) plot shown in Figure 5.16 shows that the JCDF estimated from the vine copula underestimates the joint distribution.

Figure 5.16 Comparison of empirical JCDF versus JCDF computed from the vine copula.

It should be noted that we have only shown how to compute the joint CDF from vine copula in this chapter. In the application chapters that follow, we will further discuss joint and conditional return periods obtained from copula using real-world examples.

5.4 Summary

This chapter focuses on the theoretical aspects of the asymmetric Archimedean copula for the analysis in higher dimensions. Two types of asymmetric Archimedean copulas are discussed: (1) nested Archimedean copulas; and (2) vine copulas.

The nested Archimedean copulas include fully nested, partially nested, and general nested Archimedean copulas. Nested Archimedean copulas (NAC) requires the following: (i) the nested generating function must be completely monotonic; and (ii) with the increasing levels in the NAC structure, the dependence of the upper level needs to be weaker than the lower level. Compared to the symmetric Archimedean copulas (i.e., EAC forcing all the variables to share the same degree of pair dependence), the NAC is more flexible and may better model the dependence structure.

Vine copula includes D-vine, C-vine, and R-vine copulas. A vine copula is constructed based on the multivariate probability density decomposition. With the bivariate copula as the building block for the vine copula, the vine copula allows the free identification of the bivariate copula for each pair of variables for each level in the vine structure. Compared to EAC and NAC, the vine copula is most flexible, with D-vine copulas being more flexible than C-vine copulas. With the flexibility offered by the vine copula, the copula modeling in higher dimensions may also be computationally time consuming.

References

Aas, K., Czado, C., Frigessi, A., and Bakken, H. (2009). Pair-copula constructions of multiple dependence. Insurance: Mathematics and Economics, 44, 182–198, doi:10.1016/j.insmatheco.2007.02.001.

Bedford, T. and Cooke, R. M. (2001). Probability density decomposition for conditionally dependent random variables modeled by vines. Annals of Mathematics and Artificial Intelligence, (32), 245–268.

Bedford, T. and Cooke, R. M. (2002). Vines – a new graphical model for dependent random variables. Annals of Statistics, (30), 1031–1068.

Berg, D. and Aas, K. (2007), Models for construction of multivariate dependence, Technical report, Norwegian Computing Center.

Embrechts, P., Lindskog, F., and McNeil, A. (2003). Modelling dependence with copulas and applications to risk management. In Rachev, S. T. ed. Handbook of Heavy Tailed Distributions in Finance. North-Holland: Elsevier.

Frees, E. W. and Valdez, E. A. (1998). Understanding relationships using copulas. North American Actuarial Journal, 2(1), 1–25

Genest, C. and Favre, A.-C. (2007). Everything you always wanted to know about copula modeling but were afraid to ask. Journal of Hydrologic Engineering, 12(4), 347–368.

Genest, C., Favre, A.-C., Beliveau, J., and Jacques, C. (2007a). Metaelliptical copulas and their uses in frequency analysis of multivariate hydrological data. Water Resources Research, 43, W09401, doi:10.1029/2006WR005275.

Genest, C., Rémillard, B., and Beaudoin, D. (2007b). Goodness-of-fit tests for copulas: A review and a power study. Insurance: Mathematics and Economics. doi:10.1016/j.insmatheco.2007.10.005.

Joe, H. (1996). Families of m-variate distributions with given margins and m(m-1)/2 bivariate dependence parameters. In R¨uschendorf, L., Schweizer B., and Taylor, M. D., ed. Distributions with Fixed Marginals and Related Topics. Institute of Mathematical Statistics, Hayward, CA, 120–141.

Joe, H. (1997). Multivariate Models and Dependence Concept. Chapman & Hall, New York.

Kurowicka, D. and Cooke, R. M. (2004). Distribution – free continuous Bayesian belief nets. In Fourth International Conference on Mathematical Methods in Reliability Methodology and Practice. Wiley, Santa Fe, 309–322.

Kurowicka, D. and Cooke, R. M. (2006). Uncertainty Analysis with High Dimensional Dependence Modelling. Wiley, New York.

McNeil, A. J. (2007). Sampling nested Archimedean copulas. http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.395.5498&rep=rep1&type=pdf Nelsen, R. B. (2006). An Introduction to Copulas. Springer-Verlag, New York.

Rosenblatt, M. (1952). Remarks on a multivariate transformation. Annals of Mathematical Statistics, 23(3), 470–472.

Savu, C. and Trede, M. (2010). Hierarchies of Archimedean copulas. Quantative Finance, 10, 295–304.

Whelan, N. (2004). Sampling from Archimedean copulas. Quantitat Finance, 4(3), 339–52.

Additional Reading

Francesco, S. and Salvatore G. (2007). Fully nested 3-copula: procedure and application on hydrological data. Journal of Hydrologic Engineering, 12(4), 420–430.

Salvatori, G. and Francesco, S. (2006). Asymmetric copula in multivariate flood frequency analysis. Advanced in Water Resources, 29, 1155–1167.

Salvadori, G., De Michele, C., Kottegoda, N., and Rosso, R. (2007). Extremes in Nature: An Approach Using Copulas. Water Science and Technology Library, Vol. 56, Springer, Dordrecht.

Salvadori, G. and De Michele, C. (2007), On the use of copulas in hydrology: theory and practice. Journal of Hydrologic Engineering, 12(4), 369– 380.

Appendix

With the use of Example 5.8, the density functions for M3, M4, M5, M6, and M12 copulas are derived.

M3 Copula

∂C∂u1=e−θ2u1S2e−θ1u3−1e−θ2u2−1e−θ1−1e−θ2−1S1S2−1e−θ1u3−1e−θ1−1+1

(M3–1)

∂2C∂u1∂u2=θ1e−θ2u1+u2S22e−θ2u1−1e−θ1u3−12e−θ2u2−1e−θ2−12S12s2−1e−θ3u3−1+e−θ1−12−θ2e−θ2u1+u2S2e−θ1u3−1e−θ2−1S1s2−1e−θ1u3−1+e−θ1−1+θ2−θ1e−θ2u1+u2S2e−θ2u1−1e−θ2u2−1e−θ1u3−1e−θ2−12S12S2−1e−θ1u3−1+e−θ1−1

(M3–2)

∂3C∂u1∂u2∂u3=θ1θ2e−θ2u1+u2−θ1u3S2e−θ2−1S1S3+θ12−θ1θ2e−θ2u1+u2−θ1u3S2e−θ2u1−1e−θ2u2−1e−θ2−12S12S3−θ1θ2e−θ2u1+u2−θ1u3S2S2−1e−θ1u3−1e−θ2−1S1S3+2θ12e−θ2u1+u2−θ1u3S2S2−1e−θ2u1−1e−θ2u2−1e−θ1u3−1e−θ1−1e−θ2−12S12S33+θ1S2θ2S2−3θ1S2+θ1−θ2e−θ2u1+u2−θ1u3e−θ2u1−1e−θ2u2−1e−θ1u3−1e−θ2−12S12S32

(M3–3)

where

S1=e−θ2u1−1e−θ2u2−1e−θ2−1+1;S2=S1θ1θ2;S3=S2−1e−θ1u3−1+e−θ1−1

M4 Copula

∂C∂u1=u1−θ2−1u1−θ2+u2−θ2−1θ1θ2−1u1−θ2+u2−θ2−1θ1θ2+u3−θ1−1−1θ1−1

(M4–1)

∂2C∂u1∂u2=u1−θ2−1u2−θ2−1u1−θ2+u2−θ2−1θ1θ2−2u1−θ2+u2−θ2−1θ1θ2+u3−θ1−1−1θ1−1−θ1−θ2+1+θ1u1−θ2+u2−θ2−1θ1θ2u1−θ2+u2−θ2−1θ1θ2+u3−θ1−1−1

(M4–2)

∂3C∂u1∂u2∂u3=1+θ1u1u2−θ2−1u3−θ1−1u1−θ2+u2−θ2−1θ1θ2−2u1−θ2+u2−θ2−1θ1θ2+u3−θ1−1−1θ1−2

−θ1−θ2+1+2θ1u1−θ2+u2−θ2−1θ1θ2u1−θ2+u2−θ2−1θ1θ2+u3−θ1−1−1

(M4–3)

M5 Copula

∂C∂u1=1−u1θ2−1−1−u1θ2−11−u2θ21−u1θ2+1−u2θ2−1−u1θ21−u22θθ1θ2−1{1−u1θ2+1−u2θ2−1−u1θ21−u2θ2θ1θ21−1−u3θ1+1−u3θ1}1θ1−11−1−u3θ1

(M5–1)

∂2C∂u1∂u2=G1G2+G3w1θ1−1−1+1−u3θ1+G4G5w1θ1−2−1+1−u3θ12

(M5–2)

∂3C∂u1∂u2∂u3=G1G2+G3−θ11−u3θ1−1w1θ1−1+−1+1−u3θ11θ1−1w1θ1−2∂w∂u3+G4G5(2−1+1−u3θ1−θ11−u3θ1−1w1θ1−2+−1+1−u3θ121θ1−2w1θ1−2w1θ1−3∂w∂u3)

(M5–3)

where

w=1−u1θ2+1−u2θ2−1−u1θ21−u2θ2θ1θ2−1+1−u3θ1+1−u3θ1

G1=1−u1θ2−11−u2θ2−11−u1θ2+1−u2θ2−1−u1θ21−u2θ2θ1θ2−2

G₂ = (θ₁ − 1)(1 − (1 − u₁)^θ₂ − (1 − u₂)^θ₂ + (1 − u₁)^θ₂(1 − u₂)^θ₂)

G2=θ1−11−1−u1θ2−1−u2θ2+1−u1θ21−u2θ2

G₃ = − θ₂ + 1 − (1 − u₁)^θ₂ − (1 − u₂)^θ₂ + ((1 − u₁)(1 − u₂))^θ₂

G3=−θ2+1−1−u1θ2−1−u2θ2+1−u11−u2θ2

G₄ = (θ₁ − 1)(1 − u₁)^{θ₂ − 1}(1 − u₂)^{θ₂ − 1}(−1 + (1 − u₁)^θ₂)(−1 + (1 − u₂)^θ₂)

G4=θ1−11−u1θ2−11−u2θ2−1−1+1−u1θ2−1+1−u2θ2

G5=1−u1θ2+1−u2θ2−1−u1θ21−u2θ22θ1θ2−2

∂w∂u3=θ11−u3θ1−11−u1θ2+1−u2θ2−1−u1θ21−u2θ2θ1θ2−1−1

M6 Copula

∂C∂u1=1u1−lnu1θ2−1Gθ1θ2−1w1θ1−1e−w−1θ1

(M6–1)

∂2C∂u1∂u2=1u1u2−lnu1θ2−1−lnu2θ2−1e−w−1θ1G2θ1θ2−2w2θ1−2+θ2−θ1Gθ1θ2−2w1θ1−1+θ1−1G2θ1θ2−2w1θ1−2

(M6–2)

∂3C∂u1∂u2∂u3=1u1u2u3−lnu1θ2−1−lnu2θ2−1−lnu3θ1−1e−w1θ1(G2θ1θ2−2w3θ1−3+2θ1−2w2θ1−3+θ2−θ1Gθ1θ2−2w2θ1−2+θ1−12θ1−1G2θ1θ2−2w1θ1−3+θ1−1G2θ1θ2−2w2θ1−3+θ1−1θ2−θ1Gθ1θ2−2w1θ1−2)

(M6-3)

where

G=−lnu1θ2+−lnu2θ2;w=−lnu3θ1+−lnu1θ2+−lnu2θ2θ1θ2

M12 Copula

∂C∂u1=u1−1−1θ2−1u1−1−1θ2+u2−1−1θ2θ1θ2−1w1θ1−1u121+w1θ12

(M12–1)

∂2C∂u1∂u2=u1−1−1θ2−1u2−1−1θ2−1u12u22u1−1−1θ2+u2−1−1θ2θ1θ2−2(θ2−θ1w1θ1−11+w1θ12+θ1−1u1−1−1θ2+u2−1−1θ2θ1θ2w1θ1−21+w1θ12+2u1−1−1θ2+u2−1−1θ2θ1θ2w2θ1−21+w1θ13)

(M12–2)

∂3C∂u1∂u2∂u3=u1−1−1θ2−1u2−1−1θ2−1u3−1−1θ1−1u12u22u32u1−1−1θ2+u2−1−1θ2θ1θ2−2(θ2−θ1θ1−11+w1θ12w1θ1−2+21+w1θ1w2θ1−21+w1θ14+θ1−1u1−1−1θ2+u2−1−1θ2θ1θ22θ1−11+w1θ12w1θ1−3+21+w1θ1w2θ1−31+w1θ14+2u1−1−1θ2+u21−1−1θ2θ1θ22θ1−21+w1θ13w2θ1−3+31+w1θ12w3θ1−31+w1θ16)

where: w=u1−1−1θ2+u2−1−1θ2θ1θ2+u3−1−1θ1

Tags: Copulas and their Applications in Water Resources Engineering

Oct 12, 2020 | Posted by drzezo in Water and Sewage | Comments Off

Civil Engineer Key

Fastest Civil Engineer Engine

5 – Asymmetric Copulas

Abstract

5.1 Construction of Higher-Dimensional Copulas

5.2 Nested Archimedean Copulas (NAC)

5.2.1 Fully Nested Archimedean Copulas (FNAC)

Example 5.2 Derive the M4 copula using information given in Example 5.1.

Example 5.3 Derive M5 copula using the information given in Example 5.1.

Example 5.4 Derive the M6 copula using the information given in Example 5.1.

Example 5.5 Derive the M12 copula using the information given in Example 5.1.

Example 5.6 Derive a four-dimensional FNAC copula function based on the bivariate Frank copula.

5.2.2 Partially Nested Archimedean Copulas (PNAC)

Example 5.7 Using the bivariate Frank copula as the building block to derive a four-dimensional PNAC function for the structure given in Figure 5.4.

5.2.3 General Case

5.2.4 Parameter Estimation for Nested Copulas

Example 5.8 Derive the density function for three-dimensional FNAC (Equation (5.4) corresponding to Figure 5.2).

Example 5.9 Derive the density function for four-dimensional FNAC (i.e., Equation (5.1) corresponding to Figure 5.1).

Example 5.10 Derive the density function for the copula function represented by Figure 5.4.

5.2.5 Simulation for Nested Copulas

Example 5.11 Assuming the pseudo-observations given in Table 5.1 may be modeled with the M6 copula, (1) estimate the copula parameters both simultaneously and sequentially using MLE; and (2) simulate the random variables with a sample size of 50.

Table 5.1. Trivariate pseudo-observations.

Table 5.2. Pseudo-observations for Example 5.12.

5.3 Pair-Copula Construction (PCC)

5.3.1 Principle of Pair-Copula Decomposition of General Multivariate Distribution

Example 5.13 Derive the hh function for the bivariate Gumbel–Hougaard copula.

5.3.2 Vines

Table 5.3. Five-dimensional D-vine.

Three Variables

Four Variables

Five Variables

d-Dimensional Variables

5.3.3 Conditional Independence and the Pair-Copula Decomposition

5.3.4 Simulation from Vine Copulas

Simulation from a C-Vine Copula

Simulating the Random Variables for a D-Vine Copula

5.3.5 Parameter Estimation for a Specified Pair-Copula Decomposition

Parameter Estimation for a C-Vine Copula

Parameter Estimation for a D-Vine Copula

Parameter Estimation for Basic Three-Variable Model

Table 5.4. Data and results for Example 5.14.

Table 5.5. Conditional probability distributions computed for T2 and T3 for fitted D-Vine copula

Table 5.6. Conditional probability distributions computed for T2 and T3 of a fitted C-Vine copula.

5.3.6 Selection of Vine Copula Structure

Example 5.17 Re-work Example 5.16: (1) estimate the copula parameters simultaneously using the same decomposition and copula families as Example 5.16; and (2) simulate the random variates for the sample size of 100 from the fitted copula functions.

5.3.7 Goodness-of-Fit Test

Applying the Anderson–Darling Test

Applying the New Procedure Based on PIT Proposed by Genest et al. (2007b)

Example 5.18 Assess the GoF for the C- or D-vine copula constructed in Example 5.15 for trivariate analysis with both the Anderson–Darling test and the new procedure based on PIT proposed by Genest et al. (2007b) discussed in the preceding section.

Table 5.7. Computed Zs and corresponding test statistics for three-dimensional C- or D-vine copulas.

Notes:

Example 5.19 Assess the GoF for the D- and C-vine copulas constructed in Example 5.16 with both of the two GoF approaches previously discussed.

Table 5.8. Computed Zs and corresponding test statistics for the D-vine copula.

Notes:

Table 5.9. Computed Zs and the corresponding test statistics for the fitted C-vine copula.

Notes:

5.3.8 JCDF for d-Dimensional Vine Copulas

JCDF for Three-Dimensional Variables

JCDF for Four-Dimensional D-Vine Variables

JCDF for Four-Dimensional C-Vine Variables

JCDF for Five-Dimensional D-Vine Variables

JCDF for Five-Dimensional C-Vine Variables

Example 5.20 Compute the JCDF and compare it with the empirical JCDF, using the data and vine copula constructed in Example 5.15.

5.4 Summary

References

Additional Reading

Appendix

M3 Copula

M4 Copula

M5 Copula

M6 Copula

M12 Copula

Share this:

Related

Related posts:

Table 5.5. Conditional probability distributions computed for T₂ and T₃ for fitted D-Vine copula

Table 5.6. Conditional probability distributions computed for T₂ and T₃ of a fitted C-Vine copula.