Skew-T Distribution
Skew-T Distribution
71, 2023
1053-587X © 2023 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission.
See https://www.ieee.org/publications/rights/index.html for more information.
Authorized licensed use limited to: Pontificia Universidad Javeriana. Downloaded on April 02,2024 at 00:12:07 UTC from IEEE Xplore. Restrictions apply.
WANG et al.: EFFICIENT AND SCALABLE PARAMETRIC HIGH-ORDER PORTFOLIOS DESIGN VIA THE SKEW-t DISTRIBUTION 3727
Authorized licensed use limited to: Pontificia Universidad Javeriana. Downloaded on April 02,2024 at 00:12:07 UTC from IEEE Xplore. Restrictions apply.
3728 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 71, 2023
TABLE I
CONVENTIONAL NON-PARAMETRIC REPRESENTATIONS γ ν
OF HIGH-ORDER MOMENTS γ ν
γ ν
Number of Parameters Memory
to Estimate Complexity
Co-skewness Φ 1
N (N + 1) (N + 2) O N3
6 4
Co-kurtosis Ψ 1
24
N (N + 1) (N + 2) (N + 3) O N
TABLE II
FORMULATIONS AND COMPUTATIONAL COMPLEXITY OF
COMPUTING HIGH-ORDER MOMENTS IN NON-PARAMETRIC WAY
Formulation Complexity
3rd φ3 (w) wT Φ (w ⊗ w) O N 3
Fig. 1. Illustrations for the univariate generalized hyperbolic skew-t
central ∇φ3 (w) 3Φ (w ⊗ w) O N 3
distribution (μ = 0, Σ = 1).
moment ∇2 φ3 (w) 6Φ (I ⊗ w) O N 4
4th φ4 (w) wT Ψ (w ⊗ w ⊗ w) O N 4
central ∇φ4 (w) 4Ψ (w ⊗ w ⊗ w) O N 4
Suppose that a N -dimensional random vector x follows the
moment ∇2 φ4 (w) 12Ψ (I ⊗ w ⊗ w) O N5 ghMST distribution, i.e., x ∼ ghMST (μ, Σ, γ, ν). It has the
probability density function (pdf)
Ψ = E [(r − μ) (r − μ) ⊗ (r − μ) ⊗ (r − μ)] , (7) (x−μ)T Σ−1 γ ν
fghMST (x |μ, Σ, γ, ν ) = e N 1 · 2 ν2 2 · Γ 1ν ·
(2π) |Σ|
2 2 ( 2)
where μ = E [r]. As shown in Table I, the costs for storing Φ − ν+N
(ν + Q (x)) γ T Σ−1 γ ,
χ+Q(x) 4
and Ψ have a high complexity. This means that we may not γ T Σ−1 γ
· K− ν+N
2
be able to set up these matrices when the problem dimension (8)
is large.
In addition, the non-parametric approach also poses tremen- where ν ∈ R++ is the degree of freedom, μ ∈ RN is the lo-
dous challenges in computing the objectives values, gradients, cation vector, γ ∈ RN is the skewness vector, Σ ∈ RN ×N
and the Hessian of the third and fourth central moments for a is the scatter matrix, Γ is the gamma function, Q (x) =
(x − μ) Σ−1 (x − μ), and Kλ is the modified Bessel func-
T
given portfolio [23]. Here, we exhibit the corresponding com-
plexities in Table II. As a result, existing first-order methods tion of the second kind with index λ [31].
could not be efficient as they often require many iterations to Remark 1: In the following contexts, μ and Σ refer to the
converge while per-iteration cost is very high. On the other parameters of ghMST distribution, that is, to the location vector
hand, existing second-order methods are not scalable and scatter matrix and not to the mean vector and covari-
because
the complexity of computing ∇2 φ4 (w) is O N 5 . ance matrix.
Therefore, in the next section, we would present a parametric Interestingly, the ghMST distribution can be represented in
approach to model the skewness and kurtosis such that the a hierarchical structure as
concerns discussed above can be significantly eliminated. i.i.d 1 1
x|τ ∼ N μ + γ, Σ ,
τ τ
i.i.d ν ν
III. MODELING HIGH-ORDER MOMENTS USING GENERALIZED τ ∼ Gamma , , (9)
HYPERBOLIC MULTIVARIATE SKEW-t DISTRIBUTION 2 2
In this section, we illustrate how to apply a parametric dis- where N μ̃, Σ̃ denotes the multivariate Gaussian distribution
tribution to model the data and derive the high-order moments with mean vector μ̃ and covariance matrix Σ̃, and Gamma (a, b)
from the parametric model. To be more specific, this approach represents the gamma distribution of shape a and rate b.
assumes that the assets’ returns follow a multivariate gener- Fig. 1 illustrates the skewness and fat-tailness under the
alized hyperbolic skew-t distribution. Then, high-order mo- ghMST distribution. When γ is fixed, the higher the value
ments can be represented using the parameters of the fitted of ν, the thinner the tails. When ν is fixed, the larger the
distribution. To proceed, we will first present some preliminary value of γ, the heavier the skewness. Henceforth, the third- and
knowledge of the generalized hyperbolic skew-t distribution, fourth-moments are naturally embedded into the parameters of
followed by the derivation of efficient methods for computing the distribution.
high-order moments based on this distribution. In the literature, some restricted multivariate skew-t (rMST)
distributions1 [37] are also capable of modeling asymmetry
and fat-tailness. In this paper, we choose to use the ghMST
A. ghMST Distribution
distribution for two reasons. Foremost, the ghMST distribu-
The generalized hyperbolic multivariate skew-t (ghMST) tion is the only skew-t distribution that we can fit within a
distribution [24], [25], is a sub-class of the generalized hy-
1 Variants of rMST distribution include Gupta’s skew-t [32], Pyne’s skew-t
perbolic distribution [26], which is often used in economics
[33], Branco’s skew-t [34], and Azzalini’s skew-t [35]. It can be shown that
to model the data with skewness and heavy tails [27], [28], these variants have similar forms and can characterize the same distribution
[29], [30]. after some parametrization [36].
Authorized licensed use limited to: Pontificia Universidad Javeriana. Downloaded on April 02,2024 at 00:12:07 UTC from IEEE Xplore. Restrictions apply.
WANG et al.: EFFICIENT AND SCALABLE PARAMETRIC HIGH-ORDER PORTFOLIOS DESIGN VIA THE SKEW-t DISTRIBUTION 3729
Authorized licensed use limited to: Pontificia Universidad Javeriana. Downloaded on April 02,2024 at 00:12:07 UTC from IEEE Xplore. Restrictions apply.
3730 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 71, 2023
TABLE III Proof: Let w ∈ W be the fixed point of G (·; η), i.e., w =
COMPUTATIONAL COMPLEXITY OF COMPUTING G (w ; η). Hence, w is the optimal solution to the following
HIGH-ORDER MOMENTS USING ghMST DISTRIBUTION
convex optimization problem:
Objective
Gradient
Hessian
2
3-rd moment O N 2 O N 2 O N 3
minimize 1
2 w − (w − η∇f (w ))2
w (17)
4-th moment O N2 O N2 O N3 subject to w ∈ W.
gradient information. As a result, the proposed algorithm can Therefore, for any y ∈ W, we have
exhibit superior scalability over state-of-the-art methods. T
(y − w ) (w − (w − η∇f (w )))
T
IV. PROPOSED METHODS FOR SOLVING MVSK PORTFOLIOS = η (y − w ) ∇f (w ) ≥ 0, (18)
In this section, we explore new practical algorithms for solv- which already indicates that w is the stationary point of
ing Problem (6) under the ghMST distribution. The proposed Problem (6).
method iteratively minimizes the objective values via searching Using Lemma 5, we can recast Problem (6) into the following
a fixed point of a projected gradient mapping. The section is optimization problem
organized as follows. We first recast the optimization prob-
find w ∈ W, subject to w = G (w; η) . (19)
lem (6) as a fixed-point problem. After that, we introduce a
fixed-point acceleration scheme to solve the fixed point more This well-known fixed-point problem can be solved by
efficiently. To overcome the convergence issues caused by the the fixed-point iteration method [45], which iterates the
acceleration scheme, we further enhance the robustness of the following update
fixed-point acceleration method and accomplish our algorithm.
Finally, we provide an analysis of the complexity and conver- wk+1 = G wk ; η , (20)
gence of our proposed methods.
in which the function G should be Lipschitz continuous with
Lipschitz constant L < 1. In practice, this conventional ap-
A. Constructing the Fixed-Point Problem proach is often criticized for slow convergence. Hence, in the
Considering a continuous vector-to-vector mapping G: rest part of this section, we will introduce an acceleration
RN → RN , a point w is a fixed point of function G when it scheme that significantly improves its convergence.
satisfies w = G (w). In optimization,
many iterative methods
aim at generating a sequence w1 , w2 , . . . that is expected B. Fixed-Point Acceleration
to converge to a stationary point via a designed update rule
wk+1 = G wk . As a result, when those algorithms converge, We first reformulate the fixed-point problem as finding a root
the obtained w is also the fixed point of G. In this subsection, of a residual function R : RN → RN
we will introduce the exact expression of G of interest and how R (w; η) = G (w; η) − w. (21)
solving Problem (6) can be transformed into finding a fixed
point of function G. If the problem is unconstrained, the non-smooth version of
The function G we consider is selected as Newton-Raphson method [46] solves the fixed-point problem
Δ via iterating the following update formula
G wk ; η = PW wk − η∇f wk , (15)
wk+1 = wk − M−1 wk ; η R wk ; η , (22)
where η > 0 is the step size and the operator PW is defined as
k
a projection onto a unit simplex [43] where M w ; η ∈ RN ×N ∈ ∂R wk ; η and ∂R wk ; η is
2 the Clarke’s generalized Jacobian of R evaluated at w = wk
PW wk = arg min w − wk 2 , (16)
w∈W [47]. However, (22) is not applicable in our case. On one hand,
which is a continuous vector-valued function defined the acceleration may render iterates infeasible, i.e., wk+1 ∈ / W.
on w ∈ RN . To make up for it, a heuristic alternative to (22) is
Remark 4: In fact, the choice of G is not unique, but (15)
wk+1 = PW wk − M−1 wk ; η R wk ; η . (23)
is preferred because it is simple to manipulate. Instead of call- k
ing a quadratic programming solver,
we
can design a water- On the other hand, M w ; η is generally intractable to obtain.
filling algorithm [44] to solve G wk ; η efficiently. Details are But we notice that the classical directional derivative evaluated
elaborated in Section E of the Appendix. The simplicity of at w = wk still exists and is given as
solving G plays an important role in promoting the efficiency
k R wk + hd; η − R wk ; η
and scalability of the proposed algorithm. Dd R w ; η = lim . (24)
Given any η > 0, the fixed point of G is the stationary point h→0 h
of Problem (6). This is shown in Lemma 5. Then, according to[46, Lemma
forany direction d, there
2.2],
Lemma 5: The set of fixed point of G (·; η), i.e., w = exists a matrix M wk ; η ∈ ∂R wk ; η such that
G (w; η), coincides with that of the stationary points of
Problem (6). Dd R wk ; η = M wk ; η d, (25)
Authorized licensed use limited to: Pontificia Universidad Javeriana. Downloaded on April 02,2024 at 00:12:07 UTC from IEEE Xplore. Restrictions apply.
WANG et al.: EFFICIENT AND SCALABLE PARAMETRIC HIGH-ORDER PORTFOLIOS DESIGN VIA THE SKEW-t DISTRIBUTION 3731
Hence, by assigning h = 1 and d = G wk ; η − wk to (24), Meanwhile, we require another constraint
we can construct the secant equation at w = wk as k
k y1 − wk , y2k − y1k ≥ 0. (35)
M wk ; η R wk ; η ≈ V w ;η , (26)
We hope that the direction of first-level acceleration should
where the function V : RN → RN is defined as be similar to the direction of second-level acceleration. The
Δ inequality (35), which is equivalent to
V wk ; η = R G wk ; η ; η − R wk ; η k
R w ; η , R wk ; η − αk V wk ; η ≥ 0, (36)
= G G wk ; η ; η − 2G wk ; η + wk . (27)
k another constraint for Nthe value of α , i.e., α ≥
k k
Here, we replace the matrix M wk ; η by the scaled identity provides
k −1 b w , where the function b : R → R is denoted as
matrix α I such that the inverse of it can be easily derived.
The value of αk is therefore determined by approximating the b wk
following equation ⎧
⎨ R(wk ;η)22
k −1 k if R wk ; η , V wk ; η < 0,
α R w ; η ≈ V wk ; η , (28) = R(wk ;η),V (wk ;η)
⎩−∞ otherwise.
whose details will be elaborated later. As a result, we have the (37)
formulation for the first-level fixed-point acceleration, i.e.,
Δ Therefore, the value of αk is computed as the solution to the
y1k = wk − αk R wk ; η . (29) following constrained least square problem
2
Intuitively,as a replacement to (23), the projection of the new minimize R wk ; η − αV wk ; η |α|
point PW y1k is expected to provide smaller residual values α k (38)
subject to b w ≤ α < 0,
compared to wk .
Inspired by the ‘squared extrapolation method’ [48], we whose solution can be easily obtained as
introduce the second-level acceleration by defining
αk = max − R wk ; η V wk ; η , b wk . (39)
Δ
y2k = y1k − αk R y1k ; η . (30) Δ
k
In principle, we can also simulate yi+1 = yik − αk R yik ; η
This strategy, inspired by [49], can be seen as taking two for i > 2, but the formulations are typically more complicated
successive first-level acceleration
using
the same step length. to derive and more levels of approximation is more likely to
Interestingly, the value of R y1k ; η can be approximated by produce invalid acceleration.
manipulating the secant equations. To be more specific, we Compared to the conventional update (20), the proposed
assign different values of d to construct
the secant equations. method only includes some small extra computational costs
In (26), d is set to d1 = G wk ; η − wk . Now, we set at each iteration, while significantly improve the efficiency in
practice. However, like many other fixed point acceleration
d2 = −αk G wk ; η − wk = −αk d1 . (31)
k k methods, directly iterating (34) may not yield robust results. In
This indicates that the approximation of R y1 ; η − R w ; η other words, we may obtain a sequence that does not converge.
can be obtained
by multiplying a scaling factor −αk to Hence, we will provide our solutions to further improve the
V wk ; η , i.e., robustness of the proposed fixed-point acceleration.
R y1k ; η − R wk ; η ≈ −αk V wk ; η . (32)
C. A Robust Fixed Point Acceleration (RFPA) Algorithm
Therefore, we obtain the closed-form approximation for y2k as
To establish
a stable convergence, we require that the
Δ
y2k = wk − αk R wk ; η − αk R wk ; η − αk V wk ; η sequence f wk should be monotone, i.e.,
2
= wk − 2αk R wk ; η + αk V wk ; η . (33) ∀k : f wk+1 ≤ f wk . (40)
Eventually, the update for w is finalized as The strategy is illustrated as follows. When the fixed-point
acceleration
k fails to improve the objective,
i.e., f wk+1 >
wk+1 = PW wk − 2αk R wk ; η f w , we first set wk+1 = G wk ; η . Then, we keep de-
2 creasing it by η ← βη with a scaling factor β ∈ (0, 1) until the
+ αk V w k ; η . (34)
following condition is met
Now we introduce how to compute the value of αk . In the T k+1
f wk+1 ≤ f wk + ∇f wk w − wk
literature, αk is usually estimated by minimizing a discrepancy
1
wk − wk+1 2 .
measure
based
on the secant
equation
(28). From [50], we select + (41)
2
R wk ; η − αV wk ; η 2 |α| as our discrepancy measure. 2η
In addition, because the term R wk ; η in (29) can be seen Once the condition (41) holds, the sequence f wk is then
as a direction to achieve small objective values, it is naturally monotone with the details provided in Section F of the Ap-
to impose the constraint αk ≤ 0 such that the acceleration is pendix. Eventually, we summarize the proposed robust fixed
performed along with descent direction. point acceleration (RFPA) algorithm in Algorithm 1.
Authorized licensed use limited to: Pontificia Universidad Javeriana. Downloaded on April 02,2024 at 00:12:07 UTC from IEEE Xplore. Restrictions apply.
3732 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 71, 2023
Algorithm 1 Robust Fixed Point Acceleration (RFPA) algo- point of function G, we can obtain the stationary point of
rithm for solving Problem (6). Problem (6).
Theorem 6: If wk = wk+1 , then wk is a stationary point of
1: Initialize w0 ∈ W, η, η0 , β
Problem (6).
2: for k = 0, 1, 2,. . . do k Proof: See Section G of the Appendix.
3: Compute R wk ; η , V w ; η Theorem 6 indicates that the algorithm can obtain the
4: αk = max − R wk ; η V wk ; η , b wk .
stationary point of Problem (6) if it terminates with wk =
5: wk+1 = 2 wk+1 , which always holds in empirical studies as shown in
PW wk − 2αk R wk ; η + αk V wk ; η .
Section VI-C.
6: if f wk+1 > f wk then
7: η = η0 .
8: Update wk+1 = G wk ; η . V. EXTENSION: SOLVING MVSK-TILTING PORTFOLIOS WITH
9: while (41) not satisfied do GENERAL DETERIORATION MEASURE
10: η ← βη , go to step 8. Our proposed framework provides an efficient and scalable
11: end while discipline for handling high-order moments, therefore presents
12: end if great potential for more advanced and sophisticated applica-
13: Terminate loop if converges. tions, like multi-period portfolio optimization problems [51],
14: end for [52], [53], incorporating diversification into the high-order de-
signs [54], [55], and increasing the robustness of current MVSK
If no fixed
point acceleration is applied, we only iterate formulation [56]. In this section, we explore an interesting
wk+1 = G wk ; η that satisfies (41), the RFPA algorithm example of extending our framework to other portfolios.
would reduce to the projected gradient descent (PGD) method. In portfolio theory, though the MVSK framework finds a
The main motivation of executing projected gradients is to solution on the efficient frontiers, choosing proper values for
enlarge the difference between wk+1 and wk . Theoretically, λ may be difficult and the optimal weights are often concen-
whether the fixed-point acceleration would significantly im- trated into some positions, resulting in a greater idiosyncratic
prove the convergence is decided by the numerical properties risk [57]. Therefore, we can generalize the idea of the RFPA
at wk . Therefore, if the difference of wk and wk+1 is not large algorithm for solving another important high-order portfolio
enough while the fixed-point acceleration at wk is not success- called the MVSK-Tilting problem with general deterioration
ful, the algorithm tends to reject the fixed-point acceleration at measures. This MVSK-Tilting portfolio aims at improving a
wk+1 due to their similar numerical properties. given portfolio that is not sufficiently optimal from the MVSK
perspective by tilting it toward a direction that concurrently
ameliorates all the objectives [58], [59].
D. Complexity Analysis and Convergence Analysis The problem of interest is formulated as
The overall complexity of the proposed RFPA algorithm is minimize −δ + λ · gdet (w)
O N 2 . Specifically, the per-iteration cost of the proposed w,δ
RFPA algorithm comes from two parts: computing the gradi- subject to φ1 (w) ≥ φ1 (w0 ) + d1 δ,
ent ∇f wk and solving a projection problem PW . With the φ2 (w) ≤ φ2 (w0 ) − d2 δ, (42)
help of the parametric skew-t distribution, the computational φ3 (w) ≥ φ3 (w0 ) + d3 δ,
complexity of computing the gradient is reduced to O N 2 . φ4 (w) ≤ φ4 (w0 ) − d2 δ,
For solving the projection problems, the computational com- w ∈ W,
plexity mainly depends on finding proper values of the dual T
variables via bisection. According to Section E of the Appendix, where d = d1 d2 d3 d4 ≥ 0 represents the relative
the primary cost of the water-filling algorithm is to sort an importance of each target, gdet (w) is a differentiable function
array of numbers. Therefore, the corresponding complexity is that corresponds to an assigned deterioration measure with re-
O (N log N ). In conclusion, regardless of the number of outer spect to w0 , and λ is the regularization coefficient. For example,
iterations, the gdet (w) can represent a tracking error
overall
complexity of the proposed RFPA algo-
rithm is O N 2 . T
gdet (w) = (w − w0 ) Cov [r] (w − w0 ). (43)
On the contrary, if we apply the non-parametric modeling
of the high-order moments, then the bottleneck of all the al- Implicitly, the point w0 refers to a reference portfolio that
gorithms would be the computation
of the gradient or the satisfies w0 = arg minw∈W gdet (w), indicating that the penalty
Hessian, which are O N 4 or O N 5 , respectively. After we would be imposed when we tilt w away from w0 .
assume the returns follow a parametric skew-t distribution, the As the key for the success of the RFPA algorithm is to form
complexity of the second-order methods, like Q-MVSK algo- a separable function G such that the fixed point of G is the
rithm
and sequential quadratic programming method, becomes stationary point we want to obtain. The function G corresponds
O N 3 due to the complexity of evaluating ∇2 φ4 (w). to an optimization problem that has the following properties:
The convergence of the RFPA algorithm for MVSK portfo- • The objective function of the optimization problem is
lio optimization is given as Theorem 6. By solving the fixed separable.
Authorized licensed use limited to: Pontificia Universidad Javeriana. Downloaded on April 02,2024 at 00:12:07 UTC from IEEE Xplore. Restrictions apply.
WANG et al.: EFFICIENT AND SCALABLE PARAMETRIC HIGH-ORDER PORTFOLIOS DESIGN VIA THE SKEW-t DISTRIBUTION 3733
Authorized licensed use limited to: Pontificia Universidad Javeriana. Downloaded on April 02,2024 at 00:12:07 UTC from IEEE Xplore. Restrictions apply.
3734 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 71, 2023
Authorized licensed use limited to: Pontificia Universidad Javeriana. Downloaded on April 02,2024 at 00:12:07 UTC from IEEE Xplore. Restrictions apply.
WANG et al.: EFFICIENT AND SCALABLE PARAMETRIC HIGH-ORDER PORTFOLIOS DESIGN VIA THE SKEW-t DISTRIBUTION 3735
TABLE IV
EMPIRICAL ORDERS OF COMPLEXITY
VII. CONCLUSION
Fig. 6. Investigation on the empirical complexity of RFPA and Q-MVSK In this paper, we have proposed a high-order portfolio design
algorithms.
framework with the help of the parametric skew-t distribution
RFPA algorithm appears to be more stable compared to the and a robust fixed point acceleration. The parametric approach
PGD method. is practical for modeling the skewness and kurtosis of portfolio
3) Comparison on Scalability: Interestingly, implied by returns in high-dimensional settings. By assuming the returns
Fig. 5, first-order methods, including RFPA and PGD, appear follow a ghMST distribution, we can alleviate the difficulties
to be more scalable than the second-order Q-MVSK algorithm. caused by the high complexity of traditional methods and accel-
To better investigate this phenomenon, we will be conducting a erate all existing algorithms to a certain extent. Additionally, the
comparison of these algorithms using a synthetic data set, where proposed RFPA algorithm immensely cut down the number of
the parameter Θ is randomly generated. iterations for first-order methods. Numerical simulations have
As shown in Fig. 6, the proposed RFPA algorithm has demonstrated the outstanding efficiency and scalability of our
a significantly lower complexity compared to the Q-MVSK proposed framework over the state-of-the-are benchmarks.
algorithm, as its every single iteration does not contain pro-
cedures with high complexity. Meanwhile, PGD method also APPENDIX
enjoys the benefits of low complexity but its overall efficiency
is worse than the RFPA method. We also fit the empirical A. Computational Time of Different Estimation Methods
orders of the four methods considered. The relative results are Fig. 8 depicts the computational time of different estimation
shown in Table IV. It turns out that theempirical computational methods. It can be observed that fitting the ghMST distribution
complexity of our method is O N 2 and the complexity of is much more efficient than others.
the second-order method Q-MVSK is around O N 3 . The
results of numerical simulations coincide with the discussion in B. Proof for Lemma 2
Section IV-D. The proof starts with a fact that the central moments of a
Gaussian variable X̃ ∼ N μ̃, Σ̃ is given by
C. Empirical Convergence of the Proposed RFPA Algorithm
E[X̃i ] = μ̃i ,
According to Theorem 6, when wk = wk+1 , the algorithm
terminates at a stationary point of Problem (6). Though exact E[X̃i X̃j ] = μ̃i μ̃j + Σ̃ij ,
equality is often unattainable, empirically, the relative differ- E[X̃i X̃j X̃k ] = μ̃i μ̃j μ̃k + μ̃i Σ̃jk + μ̃j Σ̃ik + μ̃k Σ̃ij ,
ence of w, denoted as E[X̃i X̃j X̃k X̃l ] = μ̃i μ̃j μ̃k μ̃l + (Σ̃ij μ̃k μ̃l + · · · + Σ̃kl μ̃i μ̃j )
Δ
Relative Error wk = wk − wk−1 wk , (55) 6 items
+ (Σ̃ij Σ̃kl + Σ̃ik Σ̃jl + Σ̃il Σ̃jk ). (56)
would tend to zero. To show this, we conduct experiments using
i.i.d
term of the hierarchical structure r|τ ∼
real-world data sets with different problem dimensions. The Then,
given the first
values of (55) are computed at each iteration. From Fig. 7 we N μ + τ1 γ, τ1 Σ , we have
observe that the differences all reduce to very small
numbers.
1
Empirical studies show that the residual value R wk ; η would E [ri |τ ] = μi + γi , (57)
τ
Authorized licensed use limited to: Pontificia Universidad Javeriana. Downloaded on April 02,2024 at 00:12:07 UTC from IEEE Xplore. Restrictions apply.
3736 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 71, 2023
Authorized licensed use limited to: Pontificia Universidad Javeriana. Downloaded on April 02,2024 at 00:12:07 UTC from IEEE Xplore. Restrictions apply.
WANG et al.: EFFICIENT AND SCALABLE PARAMETRIC HIGH-ORDER PORTFOLIOS DESIGN VIA THE SKEW-t DISTRIBUTION 3737
Given W = w 1T w = 1, w ≥ 0 , the Lagrangian of Prob- G. Proof of Theorem 6
lem (65) is
Proof: When wk = wk+1 , we may have wk+1 = PW wk −
1 2
L (w, ψ, γ) = w − wk − η∇f wk 2 2αk R wk ; η + αk V wk ; η or wk+1 = G wk ; η .
2 T
2
−ψ w+γ 1 w−1 ,
T
(66) (i) We first analyze the first case where wk+1 = PW yk ,
in which
where ψ and γ are dual variables associated with the constraints Δ 2
yk = wk − 2αk R wk ; η + αk V wk ; η . (75)
w ≥ 0 and 1T w = 1, respectively. The KKT conditions are
By applying the contraposition, we prove the following state-
η∇f wk + w − wk − ψ + γ1 = 0, ment instead
ψ w = 0. (67)
∀wk ∈ W : R wk ; η = 0 ⇒ PW yk = wk . (76)
Hence, we have
For simplicity,
we denote α = −αk > 0. Note that α = 0 as
wi = max 0, wik − η ∇f w k
i
−γ . (68) R w ; η = 0.
k
exists and is unique. The root provides a dual optimal of the the projection of yk onto W is itself, i.e., PW yk = yk .
KKT system. We can easily solve γ and w via bisection. Consequently, we obtain
wk+1 = PW yk = wk . (78)
F. Monotonicity of the Sequence f wk (B) If α ∈ (1, ∞). We will first show that the following
According to the projection theorem [67], i.e., inequality holds for any wk
Δ
∀x, z : z − x, PW (z) − PW (x) ≥ PW (z) ξ = R wk ; η , R wk ; η + αV wk ; η ≥ 0. (79)
2
−PW (x)2 , (71) In principle,
we consider
the following
three cases based on the
value of R wk ; η , V wk ; η .
we apply z = wk − η∇f wk and x = wk to obtain
(B.1) If R wk ; η , V wk ; η ≥ 0, then b wk = −∞.
2 (79) holds as ∀α > 1:
−η∇f wk , wk+1 − wk ≥ wk+1 − wk 2 , (72)
2
ξ = R wk ; η + α R wk ; η , V wk ; η ≥ 0. (80)
or equivalently
k
1 2 (B.2) If R w ; η , V wk ; η < 0 and b wk =
∇f wk , wk+1 − wk ≤ − wk+1 − wk 2 . (73) R(wk ;η)2
2
η = −∞, we have α ≤ −b wk . In this case,
R(wk ;η),V (wk ;η)
Hence, from the inequality (41) we have (79) holds as ∀α ∈ 1, −b wk :
T k+1 2
f wk+1 ≤ f wk + ∇f wk w − wk ξ = R w k ; η + α R w k ; η , V w k ; η
2
1
wk − wk+1 2 ≥ R wk ; η − b wk R wk ; η , V wk ; η = 0.
+
2η 2 (81)
k 1 k+1 2 k k k
≤ f w − w − w k 2 (B.3) If R w ; η , V w ; η < 0 but b w → −∞
η
; η → 0, the value of α can
k
1 2 tok Vw
due be either
+ w − wk+1 2
k R w ; η V wk ; η → ∞ or −b wk → ∞. When
2η
α = R wk ; η V wk ; η , we suppose
k 1
wk − wk+1 2 ≤ f wk , (74)
=f w − k
2η 2
R w ; η , V wk ; η
which indicates that the sequence f wk is then monotone. = R wk ; η V wk ; η cos θR,V , (82)
Authorized licensed use limited to: Pontificia Universidad Javeriana. Downloaded on April 02,2024 at 00:12:07 UTC from IEEE Xplore. Restrictions apply.
3738 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 71, 2023
in which θR,V is the angle between R wk ; η and V wk ; η . [6] E. Jondeau and M. Rockinger, “Conditional volatility, skewness, and
Hence, as cos θR,V ∈ [−1, 1], we obtain kurtosis: Existence, persistence, and comovements,” J. Econ. Dyn.
Control, vol. 27, no. 10, pp. 1699–1737, 2003.
k 2 R wk ; η k [7] B. O. Bradley and M. S. Taqqu, “Financial risk and heavy tails,” in
ξ = R w ;η + R w ; η , V wk ; η Handbook of Heavy Tailed Distributions in Finance. Amsterdam, The
V (w ; η)
k
Netherlands: Elsevier, 2003, pp. 35–103.
k 2
= R w ;η (1 + cos θR,V ) ≥ 0. (83) [8] J. V. Rosenberg and T. Schuermann, “A general approach to integrated
risk management with skewed, fat-tailed risks,” J. Financial Econ., vol.
R(wk ;η)
2 79, no. 3, pp. 569–614, 2006.
When α = −b wk = − R(wk ;η),V (wk2;η) , it is obvious that [9] K. Gaurav and P. Mohanty, “Effect of skewness on optimum portfolio
selection,” IUP J. Appl. Finance, vol. 19, no. 3, p. 56, 2013.
2
ξ = R wk ; η − b wk R wk ; η , V wk ; η = 0.
[10] D. Maringer and P. Parpas, “Global optimization of higher order
moments in portfolio selection,” J. Global Optim., vol. 43, no. 2,
(84) pp. 219–230, 2009.
[11] R. C. Scott and P. A. Horvath, “On the direction of preference for
Therefore, (79) holds. As a consequence, we can compare moments of higher order than the variance,” J. Finance, vol. 35, no.
the following two terms 4, pp. 915–919, 1980.
[12] L. T. DeCarlo, “On the meaning and use of kurtosis,” Psychol. Methods,
yk − wk = αR wk ; η vol. 2, no. 3, p. 292, 1997.
[13] E. Jondeau and M. Rockinger, “Optimal portfolio allocation under higher
+ α R wk ; η + αV wk ; η ,
moments,” Eur. Financial Manage., vol. 12, no. 1, pp. 29–55, 2006.
yk − G wk ; η = (α − 1) R wk ; η [14] C. R. Harvey, J. C. Liechty, M. W. Liechty, and P. Müller, “Portfo-
lio selection with higher moments,” Quant. Finance, vol. 10, no. 5,
+ α R wk ; η + αV wk ; η , (85) pp. 469–485, 2010.
[15] J. He, Q.-G. Wang, P. Cheng, J. Chen, and Y. Sun, “Multi-period mean-
by evaluating the difference of their squared 2 norms, i.e., variance portfolio optimization with high-order coupled asset dynamics,”
k IEEE Trans. Autom. Control, vol. 60, no. 5, pp. 1320–1335, May 2015.
y − w k 2 − y k − G w k ; η 2 [16] L. Martellini and V. Ziemann, “Improved estimates of higher-order
2 comoments and implications for portfolio selection,” Rev. Financial
= (2α − 1) R wk ; η Stud., vol. 23, no. 4, pp. 1467–1502, 2010.
[17] E. Jondeau, “Asymmetry in tail dependence in equity portfolios,” Com-
+ 2α R wk ; η , R wk ; η + αV wk ; η . (86) put. Statist. Data Anal., vol. 100, pp. 351–368, 2016.
[18] S. Liu, P.-Y. Chen, B. Kailkhura, G. Zhang, A. O. Hero III, and
Then, we obtain the following strict inequality
P. K. Varshney, “A primer on zeroth-order optimization in signal process-
k
y − w k 2 − y k − G w k ; η 2 > 0 (87)
ing and machine learning: Principals, recent advances, and applications,”
IEEE Signal Process. Mag., vol. 37, no. 5, pp. 43–54, Sep. 2020.
k k
as α > 1 and R w ; η > 0. Therefore,
PW y = wk as [19] B. Babu and M. M. L. Jehan, “Differential evolution for multi-objective
optimization,” in Proc. Congr. Evol. Comput. (CEC), vol. 4. Piscataway,
there exists a feasible point G w ; η ∈ W that is closer to yk
k
NJ, USA: IEEE, 2003, pp. 2696–2703.
compared to wk . [20] S. Kshatriya and P. K. Prasanna, “Genetic algorithm-based portfolio
Hence, we have shown that PW yk = wk if R wk ; η = optimization with higher moments in global stock markets,” J. Risk,
vol. 20, no. 4, pp, 1–26, 2018.
0. As a result, we have obtained the following statement [21] T. P. Dinh and Y.-S. Niu, “An efficient DC programming approach for
portfolio decision with higher moments,” Comput. Optim. Appl., vol. 50,
PW yk = wk ⇒ R wk ; η = 0. (88) no. 3, pp. 525–554, 2011.
[22] Y.-S. Niu and Y.-J. Wang, “Higher-order moment portfolio optimization
Then, wk is a stationary point of Problem (6) according to via the difference-of-convex programming and sums-of-squares,” 2019,
Lemma 5. arXiv:1906.01509.
(ii) We then analyze the second case where [23] R. Zhou and D. P. Palomar, “Solving high-order portfolios via successive
convex approximation algorithms,” IEEE Trans. Signal Process., vol. 69,
wk+1 = PW wk − η ∇f wk . (89) pp. 892–904, 2021.
[24] K. Aas and I. H. Haff, “The generalized hyperbolic skew Student’s
Then, wk is a stationary point of Problem (6) when wk+1 = wk t-distribution,” J. Financial Econometrics, vol. 4, no. 2, pp. 275–
with the proof directly from [68], Theorem 9.10]. 309, 2006.
[25] Y. Wei, Y. Tang, and P. D. McNicholas, “Mixtures of generalized
In conclusion, once we obtain wk+1 = wk from the proposed hyperbolic distributions and mixtures of skew t-distributions for model-
RFPA algorithm, wk is a stationary point of Problem (6). based clustering with incomplete data,” Comput. Statist. Data Anal., vol.
130, pp. 18–41, 2019.
[26] O. Barndorff-Nielsen, “Exponentially decreasing distributions for the
REFERENCES logarithm of particle size,” Proc. R. Soc. London A. Math. Phys. Sci.,
[1] H. M. Markowitz, “Portfolio selection,” J. Finance, vol. 7, no. 1, pp. 77– vol. 353, no. 1674, pp. 401–419, 1977.
91, 1952. [27] M. Hellmich and S. Kassberger, “Efficient and robust portfolio opti-
[2] H. M. Markowitz, “Foundations of portfolio theory,” J. Finance, mization in the multivariate generalized hyperbolic framework,” Quant.
vol. 46, no. 2, pp. 469–477, 1991. Finance, vol. 11, no. 10, pp. 1503–1516, 2011.
[3] C. Adcock, M. Eling, and N. Loperfido, “Skewed distributions in finance [28] W. Hu and A. Kercheval, “Risk management with generalized hyperbolic
and actuarial science: A review,” Eur. J. Finance, vol. 21, no. 13–14, distributions,” in Proc. 4th IASTED Int. Conf. Financial Eng. Appl.
pp. 1253–1281, 2015. Berkeley, CA, USA: ACTA Press, 2007, pp. 19–24.
[4] S. I. Resnick, Heavy-Tail Phenomena: Probabilistic and Statisti- [29] J. R. Birge and L. Chavez-Bedoya, “Portfolio optimization under a
cal Modeling. New York, NY, USA: Springer Science & Business generalized hyperbolic skewed t-distribution and exponential utility,”
Media, 2007. Quant. Finance, vol. 16, no. 7, pp. 1019–1036, 2016.
[5] P. N. Kolm, R. Tütüncü, and F. J. Fabozzi, “60 years of portfolio [30] M. Haas and C. Pigorsch, “Financial economics, fat-tailed distributions,”
optimization: Practical challenges and current trends,” Eur. J. Oper. Res., in Encyclopedia of Complexity and Systems Science, New York, NY,
vol. 234, no. 2, pp. 356–371, 2014. USA: Springer, vol. 4, no. 1, pp. 3404–3435, 2009.
Authorized licensed use limited to: Pontificia Universidad Javeriana. Downloaded on April 02,2024 at 00:12:07 UTC from IEEE Xplore. Restrictions apply.
WANG et al.: EFFICIENT AND SCALABLE PARAMETRIC HIGH-ORDER PORTFOLIOS DESIGN VIA THE SKEW-t DISTRIBUTION 3739
[31] O. E. Barndorff-Nielsen, T. Mikosch, and S. I. Resnick, Lévy [56] C. Chen and Y.-S. Zhou, “Robust multiobjective portfolio with higher
Processes: Theory and Applications. Boston, MA, USA: Springer moments,” Expert Syst. Appl., vol. 100, pp. 165–181, 2018.
Science & Business Media, 2012. [57] A. J. Prakash, C.-H. Chang, and T. E. Pactwa, “Selecting a portfolio with
[32] A. Gupta, “Multivariate skew t-distribution,” Statist.: J. Theor. Appl. skewness: Recent evidence from US, European, and Latin American eq-
Statist., vol. 37, no. 4, pp. 359–363, 2003. uity markets,” J. Banking Finance, vol. 27, no. 7, pp. 1375–1390, 2003.
[33] S. Pyne et al., “Automated high-dimensional flow cytometric data [58] E. Jurczenko and B. Maillet, Multi-Moment Asset Allocation and Pricing
analysis,” Proc. Nat. Acad. Sci., vol. 106, no. 21, pp. 8519–8524, 2009. Models. Hoboken, NJ, USA: Wiley, 2006.
[34] M. D. Branco and D. K. Dey, “A general class of multivariate skew- [59] K. Boudt, D. Cornilly, F. Van Holle, and J. Willems, “Algorithmic
elliptical distributions,” J. Multivariate Anal., vol. 79, no. 1, pp. 99– portfolio tilting to harvest higher moment gains,” Heliyon, vol. 6, no. 3,
113, 2001. 2020, Art. no. e03516.
[35] A. Azzalini and A. Capitanio, “Distributions generated by perturbation [60] R. Zhou and D. P. Palomar, “Solving high-order portfolios via successive
of symmetry with emphasis on a multivariate skew t-distribution,” convex approximation algorithms,” IEEE Trans. Signal Process., vol. 69,
J. R. Statist. Soc.: Ser. B (Statist. Methodol.), vol. 65, no. 2, pp. 367– pp. 892–904, 2021.
389, 2003. [61] S. G. Johnson, “The NLopt nonlinear-optimization package,” 2014.
[36] S. X. Lee and G. J. McLachlan, “On mixtures of skew normal and skew [Online]. Available: http://ab-initio.mit.edu/nlopt
t-distributions,” Adv. Data Anal. Classification, vol. 7, no. 3, pp. 241– [62] B. A. Turlach and A. Weingessel, “Quadprog: Functions to solve
266, 2013. quadratic programming problems,” R Package Version 1.7, 2020. [On-
[37] S. K. Sahu, D. K. Dey, and M. D. Branco, “A new class of multivariate line]. Available: https://CRAN.R-project.org/package=quadprog
skew distributions with applications to Bayesian regression models,” [63] K. Boudt, W. Lu, and B. Peeters, “Higher order comoments of multifac-
Can. J. Statist., vol. 31, no. 2, pp. 129–150, 2003. tor models and asset allocation,” Finance Res. Lett., vol. 13, pp. 225–
[38] S. Lee and G. J. McLachlan, “Finite mixtures of multivariate skew 233, 2015.
t-distributions: Some recent and new results,” Statist. Comput., vol. 24, [64] A. Elminejad, T. Havranek, and Z. Irsova, “Relative risk aversion: A
no. 2, pp. 181–202, 2014. meta-analysis,” 2022.
[39] K. Wang, S.-K. Ng, and G. J. McLachlan, “Multivariate skew t mixture [65] R. B. Barsky, F. T. Juster, M. S. Kimball, and M. D. Shapiro, “Preference
models: Applications to fluorescence-activated cell sorting data,” in parameters and behavioral heterogeneity: An experimental approach in
Proc. Digit. Image Comput.: Techn. Appl. Piscataway, NJ, USA: IEEE, the health and retirement study,” Quart. J. Econ., vol. 112, no. 2,
2009, pp. 526–531. pp. 537–579, 1997.
[40] K. Wang, A. Ng, G. McLachlan, and M. S. Lee, “Package [66] G. G. Pennacchi, Theory of Asset Pricing. Boston, MA, USA:
’EMMIXskew’,” 2018. [Online]. Available: http://cran.nexr.com/web/ Pearson/Addison-Wesley, 2008.
packages/EMMIXskew/ [67] F. Facchinei and J.-S. Pang, Finite-Dimensional Variational Inequalities
[41] W. Breymann and D. Lüthi, “ghyp: A package on generalized hyperbolic and Complementarity Problems. Springer, 2003.
distributions,” Manual for R Package ghyp, 2013. [68] A. Beck, Introduction to Nonlinear Optimization: Theory, Algorithms,
[42] A. J. McNeil, R. Frey, and P. Embrechts, Quantitative Risk Management: and Applications With MATLAB. Philadelphia, PA, USA: SIAM, 2014.
Concepts, Techniques and Tools, Rev. ed. Princeton, NJ, USA: Princeton
University Press, 2015.
[43] L. Condat, “Fast projection onto the simplex and the 1 ball,” Math.
Program., vol. 158, no. 1, pp. 575–585, 2016.
[44] D. P. Palomar and J. R. Fonollosa, “Practical algorithms for a family
of waterfilling solutions,” IEEE Trans. Signal Process., vol. 53, no. 2,
pp. 686–695, Feb. 2005.
[45] K. L. Judd, Numerical Methods in Economics. Cambridge, MA, USA:
MIT Press, 1998.
[46] L. Qi and J. Sun, “A nonsmooth version of Newton’s method,” Math.
Program., vol. 58, no. 1, pp. 353–367, 1993.
[47] F. H. Clarke, Optimization and Nonsmooth Analysis. Philadelphia, PA,
USA: SIAM, 1990. Xiwen Wang received the B.Sc. degree in elec-
[48] R. Varadhan and C. Roland, “Squared extrapolation methods tronic information of science and technology from
(SQUAREM): A new class of simple and efficient numerical schemes for Nanjing University, Nanjing, China in June 2019.
accelerating the convergence of the EM algorithm,” Dept. of Biostatistics He is currently pursuing the Ph.D. degree with the
Working Papers, Johns Hopkins University, Working Paper 63, 2004. Department of Electronic and Computer Engineer-
[49] M. Raydan and B. F. Svaiter, “Relaxed steepest descent and Cauchy- ing at the Hong Kong University of Science and
Barzilai-Borwein method,” Comput. Optim. Appl., vol. 21, no. 2, Technology. His research interests include convex
pp. 155–167, 2002. optimization and fast algorithms with applications
[50] R. Varadhan and C. Roland, “Simple and globally convergent methods in financial engineering, machine learning, and
for accelerating the convergence of any EM algorithm,” Scand. J. Statist., operations research.
vol. 35, no. 2, pp. 335–353, 2008.
[51] M. W. Brandt, A. Goyal, P. Santa-Clara, and J. R. Stroud, “A simulation
approach to dynamic portfolio choice with an application to learning
about return predictability,” Rev. Financial Stud., vol. 18, no. 3, pp. 831–
873, 2005.
[52] F. Cong and C. W. Oosterlee, “Multi-period mean–variance portfolio Rui Zhou (Member, IEEE) received the B.Eng.
optimization based on Monte-Carlo simulation,” J. Econ. Dyn. Control, degree in information engineering from Southeast
vol. 64, pp. 23–38, 2016. University, Nanjing, China, in 2017, and the Ph.D.
[53] J. Skaf and S. Boyd, “Multi-period portfolio optimization with con- degree from the Hong Kong University of Sci-
straints and transaction costs,” unpublished, 2009. ence and Technology, Hong Kong, in 2021. He is
[54] P. J. Mercurio, Y. Wu, and H. Xie, “An entropy-based approach to currently a Research Scientist with the Shenzhen
portfolio optimization,” Entropy, vol. 22, no. 3, p. 332, 2020. Research Institute of Big Data, Shenzhen, China.
[55] Y.-l. Kang, J.-S. Tian, C. Chen, G.-Y. Zhao, Y.-f. Li, and Y. Wei, His research interests include optimization algo-
“Entropy based robust portfolio,” Physica A: Stat. Mech. Appl., vol. rithms, statistical signal processing, machine learn-
ing, and financial engineering.
583, 2021, Art. no. 126260.
Authorized licensed use limited to: Pontificia Universidad Javeriana. Downloaded on April 02,2024 at 00:12:07 UTC from IEEE Xplore. Restrictions apply.
3740 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 71, 2023
Jiaxi Ying received the Ph.D. degree from the Daniel P. Palomar (Fellow, IEEE) received the
Department of Electronic and Computer Engineer- bachelor’s degree in electrical engineering and the
ing at the Hong Kong University of Science and Ph.D. degree from the Technical University of
Technology, Hong Kong, in 2022. He is currently a Catalonia (UPC), Barcelona, Spain, in 1998 and
Postdoctoral Fellow with the Department of Mathe- 2003, respectively. He was a Fulbright Scholar at
matics at the same university. He was the recipient Princeton University during 2004–2006. He is a
of the Outstanding Master’s Thesis Award of Chi- Professor with the Department of Electronic &
nese Institute of Electronics, the Excellent Master Computer Engineering and the Department of In-
Thesis in Fujian Province, and the HKUST RedBird dustrial Engineering & Decision Analytics at the
Ph.D. Scholarship Program. His research interests Hong Kong University of Science and Technology
are mainly on the intersection of optimization, ma- (HKUST), Hong Kong, where he joined in 2006. He
chine learning, signal processing, and statistics. had previously held several research appointments, namely, at King’s College
London (KCL), London, U.K.; Stanford University, Stanford, CA, USA;
Telecommunications Technological Center of Catalonia (CTTC), Barcelona,
Spain; Royal Institute of Technology (KTH), Stockholm, Sweden; University
of Rome “La Sapienza,” Rome, Italy; and Princeton University, Princeton,
NJ, USA. His current research interests include applications of optimization
theory, graph methods, and signal processing in financial systems and big data
analytics. He was a recipient of a 2004/2006 Fulbright Research Fellowship,
the 2004, 2015, and 2020 (co-author) Young Author Best Paper Awards
by the IEEE Signal Processing Society, the 2015–2016 HKUST Excellence
Research Award, the 2002/2003 best Ph.D. prize in information technologies
and communications by the Technical University of Catalonia (UPC), the
2002/2003 Rosina Ribalta first prize for the Best Doctoral Thesis in infor-
mation technologies and communications by the Epson Foundation, and the
2004 prize for the best Doctoral Thesis in advanced mobile communications
by the Vodafone Foundation and COIT. He has been a Guest Editor of
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING 2016 special
issue on “Financial Signal Processing and Machine Learning for Electronic
Trading,” IEEE SIGNAL PROCESSING MAGAZINE 2010 special issue on “Convex
Optimization for Signal Processing,” IEEE JOURNAL ON SELECTED AREAS IN
COMMUNICATIONS 2008 special issue on “Game Theory in Communication
Systems,” and IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS
2007 special issue on “Optimization of MIMO Transceivers for Realistic
Communication Networks,” and an Associate Editor of IEEE TRANSACTIONS
ON INFORMATION THEORY and IEEE TRANSACTIONS ON SIGNAL PROCESSING.
Authorized licensed use limited to: Pontificia Universidad Javeriana. Downloaded on April 02,2024 at 00:12:07 UTC from IEEE Xplore. Restrictions apply.