Convergence Uniforme Index
Convergence Uniforme Index
Abstract
This study focuses on the nonparametric estimation of the conditional den-
sity of a scalar response variable given a random variable taking values in sep-
arable Hilbert space. We establish under general conditions the almost com-
plete convergence rates of the conditional density estimator under α-mixing
dependence, based on the single-index structure. We also demonstrate the
impact of this functional parameter on the mode estimation. Finally, the es-
timation of the functional index via the pseudo-maximum likelihood method
is discussed but not tackled.
1 Introduction
During the last two decades, functional data analysis (FDA) has become
more and more popular in modern statistics. Due to the rapid technology
development of accurate instruments, measurement could be taken continu-
ously over a period of time to produce data in functional form (i.e. curves)
(see Ramsay and Dallzel (1991)). Moreover, modelization of variables rep-
resented by curves has attracted the interest of several authors in the recent
statistical literature, and it has provided the powerful tool for exploratory
functional data analysis. Several variations have been proposed in Ramsay
and Silverman (2002, 2005) for parametric models or in Ferraty and Vieu
(2006) for nonparametric conditional models (mean, mode,...), and we refer
to Ferraty (2011) for recent advances. The goal of this work is to contribute
Conditional density and mode for strong mixing
in single index 357
to the functional data literature by studying some classes of semiparamet-
ric models. Note that such models are important in the statistical and
econometric modelization, due to its flexibility for dimension reduction, it
provides the best new ways to investigate problems in substantive economics
(see Horowitz (2009)). As a particular case, the single-index model has
proven useful in providing an optimal approach to compromise between non-
parametric and parametric models.
Single-index models when the explanatory variable is an element of a
finite-dimensional space have been studied extensively in both statistical and
econometric literatures, we quote, for instance, Härdle et al. (1993), Horowitz
(1996), Hristache et al. (2001a, b) and Delecroix et al. (2003). The main
focus of this work is to study the functional single index model for dependent
data by treating the problem of estimating the conditional density of a real
variable Y given a functional covariable X, when the explanation of Y given
X is done through its projection on one functional direction.
In nonparametric regression setting, it is a well-known fact that for fore-
casting and statistical inferences, the conditional density function is very
useful. It has been, widely nonparametrically studied in the multivariate
case, see Roussas (1968), Rosenblatt (1969) and Youndjé (1993). It pro-
vides the one of the best tool to estimate some characteristic feature of
the dataset, such as the conditional mode. Indeed, this last has received
a considerable attention, see, for instance, Collomb et al. (1987), Samanta
and Thavaneswaran (1990), Quintela-del-Rı́o and Vieu (1997), Berlinet et al.
(1998), and among others. However, little attention has been devoted to the
case when the explanatory variable takes values in abstract spaces, starting
by Gasser et al. (1998), they gave an approach to introduce a nonparametric
estimation of the mode when data is curve. Ferraty et al. (2006) showed that
the conditional mode provides an alternative method for prediction and gives
results slightly better than the classical regression. The consistency and the
asymptotic normality of the nonparametric conditional mode estimator were
obtained by Ezzahrioui and Ould Saı̈d (2008). Quintela-del-Rı́o et al. (2011)
gave some recent results about nonparametric conditional density estimation
with econometric applications.
For the functional single-index models, the literature is closely limited,
and only a few theoretical results have been obtained until now. The first
asymptotic properties in the fixed functional single-model were obtained
by Ferraty et al. (2003). They established the almost complete convergence,
in the i.i.d. case, of the link regression function of this model. Their results
were extended to the dependent case by Aı̈t Saidi et al. (2005). Where
the functional single-index is unknown, Aı̈t Saidi et al. (2008) proposed
358 S. Attaoui
We say that the sequence (Wn )n converges a.co. to zero, if and only if ∀η0 >
1
0, n≥1 P(|Wn | > η0 ) < ∞. Furthermore, we say that Wn = Oa.co. (wn ), if there ex-
ists η > 0, such that n≥1 P(|Wn | > ηwn ) < ∞. Note that this type of convergence
implies both the almost-sure convergence and the convergence in probability.
Conditional density and mode for strong mixing
in single index 359
2 Model
Let {(Xi , Yi ), 1 ≤ i ≤ n} be n random variables, identically distributed
as the random pair (X, Y ) with values in H × R, where H is a separable
real Hilbert space with the norm . generated by an inner product < ·, · >.
We consider the semi-metric dθ , associated to the single-index θ ∈ H defined
by ∀x1 , x2 ∈ H : dθ (x1 , x2 ) := | < x1 − x2 , θ > | . Under such topological
structure and for a fixed functional θ, we suppose that the conditional density
of Y given X = x denoted by f (.|x) exists and is given by
with the convention 0/0 = 0, where K and H are kernels function and
hK := hn,K (resp. hH := hn,H ) is a sequence of smoothing parameters
decreasing to zero as n goes to infinity.
3 Main Results
3.1. Pointwise Almost Complete Convergence. Throughout the paper,
when no confusion will be possible, we will denote by C or/and C some
strictly positive generic constants whose values are allowed to change.
We put, for any x ∈ H, and i = 1, . . . , n, Ki (θ, x) := K(h−1
K dθ (x, Xi )),
−1
and for all y ∈ R, Hi (y) := H(hH (y − Yi )). We denote by Bθ (x, hK ) :=
{ x1 ∈ H : | < x − x1 , θ > | ≤ hK }, the ball of center x and radius hK .
Next, to give our main result, we need to introduce the following basic
assumptions
(H2) The conditional density f (θ, y, x) satisfies the Hölder condition, that
is: ∀(y1 , y2 ) ∈ C 2 , ∀(x1 , x2 ) ∈ Nx × Nx ,
|f (θ, y1 , x1 ) − f (θ, y2 , x2 )| ≤ Cθ,x dθ (x1 , x2 )β1 + |y1 − y2 |β2 ,
β1 > 0, β2 > 0.
where
1 n
fN (θ, y, x) = Hi (y)Ki (θ, x),
nhH E [K1 (θ, x)]
i=1
1
n
fD (θ, x) = Ki (θ, x).
nE [K1 (θ, x)]
i=1
fN (θ, y, x)
with the fact that f(θ, y, x) = and E[fD (θ, x)] = 1.
fD (θ, x)
Finally, the proof of Theorem 1 is a direct consequence of the following
Lemmas
Lemma 1. (See Attaoui et al. (2011)) (Lemma 3.3)) Under the Assump-
tions (H1)-(H3), as n goes to infinity, we have
β β
E fN (θ, y, x) − f (θ, y, x) = O(hK1 ) + O(hH2 ). (4)
Furthermore, we have
∞
P fD (θ, x) ≤ 1/2 < ∞. (7)
n=1
Conditional density and mode for strong mixing
in single index 363
θ is given by
Thus, a natural nonparametric estimator of μθ denoted μ
This result can be obtained directly by using the second part of (H12) to-
gether with the following Lemma.
Lemma 4. Under the Assumptions (H1)-(H7) together with (H8)-(H10)
and (H12), we have as n goes to infinity
θ − μθ → 0, a.co.
μ (12)
where
1
n
L(θ) = log f(θ, Yj , Xj ).
n
j=1
Note that, this method has been studied by Delecroix et al. (2003) in the
real case where they showed that this technique has minimal variance among
all estimators. The asymptotic optimality of this procedure in functional
statistic, is an important prospect of the present work.
As an application, this approach can be used for answering the semi-
metric choice question. Indeed, it is well known that, in nonparametric
functional statistic, the projection-type semi-metric is very important for
increasing the concentration property. The functional index model is a par-
ticular case of this family of semi-metric, because it is based on the projection
on one functional direction. So, the estimation procedures of this direction
366 S. Attaoui
References
aı̈t saidi, a., ferraty, f., kassa, r. and vieu, p. (2005). Single functional index model
for a time series. R. Roumaine. Math. Pures et Appl., 50, 321–330.
aı̈t saidi, a., ferraty, f., kassa, r. and vieu, p. (2008). Cross-validated estimation in
the single functional index model. Statistics, 42, 475–494.
attaoui, s., laksaci, a. and ould-saı̈d, e. (2011). A note on the conditional density
estimate in the single functional index model. Statist. Probab. Lett., 81, 45–53.
attouch, m., laksaci, a. and ould-saı̈d, e. (2010). Asymptotic normality of a robust
estimator of the regression function for functional time series data. J. Korean Stat.
Soc., 39, 489–500.
berlinet, a., gannoun, a. and matzner-lober, e. (1998). Normalité asymptotique
d’estimateurs convergents du mode conditionnel. Canad. J of Statist., 26, 365–380.
collomb, g., härdle, w. and hassani, s (1987). A note on prediction via conditional
mode estimation. J. Statist. Plann. and Inf., 15, 227–236.
dabo-niang, s. and laksaci, a. (2007). Estimation nonparamtrique du mode conditionnel
pour variable explicative fonctionnelle. Pub. Inst. Stat. Univ., 3, 27–42.
delecroix, m., härdle, w. and hristache, m. (2003). Efficient estimation in conditional
single-index regression. J. Multivariate Anal., 86, 213–226.
demongeot, j., laksaci, a., madani, f. and mustapha, r. (2010). A Fast Functional
Locally Modeled Conditional Density and Mode for Functional Time-Series. In Recent
advances in functional data analysis and related topics. Contribution to statistics. (F
Ferraty, ed.). Physica.
ezzahrioui, m. and ould saı̈d, e. (2008). Asymptotic normality of the kernel estimators
of the conditional mode for functional data. J. Nonparametric Statist., 20, 1–18.
ezzahrioui, m. and ould saı̈d, e. (2010). Some asymptotic results of a nonparametric
conditional mode estimator for functional time series data. Statist. Neerlandica, 64,
171–201.
ferraty, f., peuch, a. and vieu, p. (2003). Modèle à indice fonctionnel simple. C. R.
Mathématiques Paris, 336, 1025–1028.
ferraty, f., laksaci, a and vieu, p. (2005). Functional time series prediction via condi-
tional mode. C. R., Math., Acad. Sci. Paris, 340, 5, 389–392.
ferraty, f. and vieu, p (2006). Nonparametric functional data analysis. Theory and
Practices. Springer-Verlag.
ferraty, f., laksaci, a and vieu, p. (2006). Estimating some characteristics of the condi-
tional distribution in nonparametric functional models. Statist. Inf. for Stoch. Proc.,
9, 47–76.
ferraty, f. and vieu, p. (2008). Erratum of: Non-parametric models for functional data,
with application in regression, time-series prediction and curve estimation. Non-
paramet. Statist., 20, 187–189.
Conditional density and mode for strong mixing
in single index 367
ferraty, f. (2011). Recent advances in functional data analysis and related topics.
Physica-Verlag.
ferraty, f. and park, j. (2011). Estimation of a Functional Single Index Model. In Recent
advances in functional data analysis and related topics. Contribution to statistics. (F.
Ferraty, ed.). Physica.
gaı̈ffas, s. and lecué, g. (2007). Optimal rates and adaptation in the single-index model
using aggregation. Electr. J of Statist., 1, 538–573.
gasser, t., hall, p. and presnell, b. (1998). Nonparametric estimation of the mode of
a distribution of random curves. J. Roy. Statist. Soc. Ser. B, 60, 681–691.
hall, p. and müller, h.g. (2009). Estimation of functional derivatives. Ann. Statist., 37,
3307–3329.
hristache, m., juditsky, a. and spokoiny, v. (2001a). Direct estimation of the index
coefficient in the single-index model. Ann. Statist., 29, 595–623.
hristache, m., juditsky, a., polzehl, j. and spokoiny, v. (2001b). Structure Adaptive
approach for dimension reduction. Ann. Statist., 29, 1537–1566.
horowitz, j.l. (1996). Semiparametric estimation of a regression model with an unknown
transformation of the dependent variable. Econometr., 64, 103–137.
horowitz, j.l. (2009). Semiparametric and Nonparametric Methods in Econometrics.
Springer Series in Statistics.
laksaci, a., lemdani, m. and ould-saı̈d, e. (2011). Asymptotic results for an L1 -norm
kernel estimator of the conditional quantile for functional dependent data with ap-
plication to climatology. Sankhayā, 73-A, Part 1, 125–141.
lin, z. and lu, c. (1996). Limit theory of mixing dependent random variables. Mathematics
and its applications, Sciences Press, Kluwer Academic Publishers, Beijing.
quintela-del-rı́o, a. and vieu, ph (1997). A nonparametric conditional mode estimate.
J. Nonparametr. Stat., 8, 253–268.
quintela-del-rı́o, a., ferraty, f. and vieu, ph (2011). Nonparametric Conditional Den-
sity Estimation for Functional Data. Econometric Applications. In Recent advances
in functional data analysis and related topics. Contribution to statistics. (F. Ferraty,
ed.). Physica.
ramsay, j.o. and dallzel, c. (1991). Some tools for functional data analysis. J. R. Statist.
Soc. B, 53, 539–572.
ramsay, j.o. and silverman, b.w. (2002). Applied functional data analysis: Methode and
Case Studies. Springer-Verlag, New York.
ramsay, j.o. and silverman, b.w. (2005). Functional data analysis, 2nd edn. Springer,
New York.
rio, e. (2000). Théorie asymptotique des processus aléatoires faiblement dépendants (in
french). Springer, Mathématiques & Applications 31.
rosenblatt, m. (1969). Conditional probability density and regression estimates. In Multi-
variate Analysis II. (P. R. Krishnaiah, ed.). Academic Press, New York N.Y., pp. 25–
31.
roussas, g.g. (1968). On some properties of nonparametric estimates of probability den-
sity functions. Bull. Soc. Math. Greece (N.S.) 9, 29–43.
samanta, m. and thavaneswaran, a. (1990). Nonparametric estimation of the conditional
mode. Comm. Statist. Theory Methods, 19, 4515–4524.
youndjé, e. (1993) Estimation nonparamétrique de la densité conditionnelle par la
méthode du noyau (in french). PhD thesis, University of Rouen.
368 S. Attaoui
Appendix
with
Δi (x) = Hi (y)Ki (θ, x) − E Hi (y)Ki (θ, x) .
So, the proof of this Lemma will be based on the application of the Fuck-
Nagaev exponential-type inequality (see Rio (2000), pp. 87). For that, we
need to evaluate the asymptotic behavior of s2n quantity defined as
n
n
s2n = |Cov(Δi , Δj )| = s2∗
n + nV ar(Δi ).
i=1 j=1
with J1,n and J2,n are the sums of covariance over the sets S1 and S2 respec-
tively, defined by:
J1,n = |Cov(Δi , Δj )|, and J2,n = |Cov(Δi , Δj )|
S1 S2
where
S1 = {(i, j), such that 1 ≤ |i − j| ≤ mn }, and
Conditional density and mode for strong mixing
in single index 369
Then,
J1,n ≤ Ch2H E[Ki (θ, x)Kj (θ, x)]
S1
So,
1/a
φθ,x (hK )
J1,n = O nmn h2H φθ,x (hK ) .
n
370 S. Attaoui
Now, let us focus on the sum over S2 . Since the variable (Δi )1≤i≤n is bounded
(i.e, Δi ∞ < ∞), we can use the Davydov-Rio’s inequality. Wherefore, we
have for all i = j;
Thus,
s2∗
n = J1,n + J2,n
1/a
φθ,x (hK )
≤ Cn mn h2H φθ,x (hK ) + nm−a
n
n
φθ,x (hK ) −1/a
By choosing mn = h−1
H n , we obtain
s2∗
n = o(nhH φθ,x (hK )).
Finally,
n
P fN (θ, y, x) − E fN (θ, y, x) > λ ≤ P Δi (x) > λnhH E[K1 (θ, x)]
i=1
≤ C (A1 + A2 ) .
Conditional density and mode for strong mixing
in single index 371
where
− 2r
(λnhH E[K1 (θ, x)])2
A1 = 4 1+
rs2n
a+1
4cn r
A2 = .
r λnhH E[K1 (θ, x)]
nhH φθ,x (hK ) log n
Setting λ = η , in the term A2 , then we get:
nhH E[K1 (θ, x)]
A2 ≤ Cn−1−ζ . (17)
Because r ∼ C(log n)2 , and the fact that s2n = O(nhH φθ,x (hK )), we can
write
− r2
η 2 log n
A1 ≤ C 1 + = exp{−(η 2 /32) log n} (18)
16r
Finally, by combining results (17) and (18), we get for η large enough
log n
P fN (θ, y, x) − E fN (θ, y, x) > η ≤ Cn−1−ζ .
nhH φθ,x (hK )
372 S. Attaoui
It yields
1 n
fD (θ, x) − E fD (θ, x) = i (x),
Δ
nE[Ki (θ, x)]
i=1
and
n
n
s2
n =
i, Δ
|Cov(Δ j )| = i, Δ
|Cov(Δ j )| +nV ar(Δ
i)
i=1 j=1 i=j
s2∗
n
≤ C i, Δ
Cov(Δ j) + C i, Δ
Cov(Δ j ) + nV ar(Δ
i ),
S1 S2
where
S1 = {(i, j), such that 1 ≤ |i − j| ≤ mn }, and
S2 = {(i, j), such that mn + 1 ≤ |i − j| ≤ n − 1}, mn → ∞.
By (H4), as K is bounded, due to the right inequality in (H7b), we get,
i, Δ
Cov(Δ j ) ≤ C [P((Xi , Xj ) ∈ Bθ (x, hK ) × Bθ (x, hK ))
+P(Xi ∈ Bθ (x, hK )) × P(Xj ∈ Bθ (x, hK ))]
φθ,x (hK ) 1/a
≤ C φθ,x (hK ) + (φθ,x (hK ))2
n
φθ,x (hK ) 1/a
= O φθ,x (hK ) ,
n
and
i, Δ
∀i = j : |Cov(Δ j )| ≤ Cα(|i − j|).
So,
1/a
φθ,x (hK )
sn2∗ ≤ C φθ,x (hK ) +C α(|i − j|)
n
S1 S2
1/a
φθ,x (hK )
≤ Cnmn φθ,x (hK ) + Cn2 m−a
n .
n
Conditional density and mode for strong mixing
in single index 373
−1/a
φθ,x (hK )
Taking, mn = , we obtain
n
i we have
For the variance of Δ
i ) = E(Δ
V ar(Δ 2 ) ≤ CV ar(Ki (θ, x)).
i
Finally,
s2
n = O (nφθ,x (hK )) .
A2 ≤ Cn−1−ζ . (20)
Finally, by combining results (19) and (20), we get for η large enough
log n
P fD (θ, x) − E fD (θ, x) > η
≤ Cn−1−ζ . (21)
nφθ,x (hK )
Now, we have
P fD (θ, x) ≤ 1/2 ≤ P fD (θ, x) − 1 > 1/2
≤ P fD (θ, x) − E fD (θ, x) > 1/2 ,
θ (x)
This allows us to write directly for y = μ
∀ > 0, ∃δ(), P (|μθ (x) − μθ (x)| > ) ≤ P (|f (θ, μθ (x), x) − f (θ, μθ (x), x)| > δ()) .
sup sup fN (θ, y, x) − E fN (θ, y, x) ≤ sup sup fN (θ, y, x) − fN (θ, yk(y) , x)
θ∈ΘH y∈C θ∈ΘH y∈C
T1
+ sup sup fN (θ, yk(y) , x) − fN (tj(θ) , yk(y) , x)
θ∈ΘH y∈C
T2
+ sup sup fN (tj(θ) , yk(y) , x) − E fN (tj(θ) , yk(y) , x)
θ∈ΘH y∈C
T3
+ sup sup E fN (tj(θ) , yk(y) , x) − E fN (θ, yk(y) , x)
θ∈ΘH y∈C
T4
+ sup sup E fN (θ, yk(y) , x) − E fN (θ, yk(y) , x) .
θ∈ΘH y∈C
T5
376 S. Attaoui
• For (T1 ) and (T5 ): By the fact that K is bounded and because the
Lipschitz’s condition of H imposed in (H9), we get
C 1
sup sup fN (θ, y, x) − fN (θ, yk(y) , x) ≤ sup
θ∈ΘH y∈C hH E[K1 (θ, x)] y∈C n
n
yk(y) − Yi
× H y − Yi − H
hH hH
i=1
|y − yk(y) |
≤ sup
y∈C h2H φθ,x (hK )
ln
≤ C .
h2H φθ,x (hK )
Since ln = O n−(3α+1)/2 , α > 0, and by using the second part of (H9), we
get
ln log n
=o .
h2H φθ,x (hK ) nhH φθ,x (hK )
• For (T2 ) and (T4 ): Using the Lipschitzian property of K and the bound-
edness of H, together with the Cauchy-Schwartz’s inequality, we can write
C 1
sup sup fN (tj(θ) , yk(y) , x) − fN (θ, yk(y) , x) ≤ sup
θ∈ΘH y∈C hH E[K1 (θ, x)] θ∈ΘH n
n
× Ki (tj(θ) , x) − Ki (θ, x)
i=1
x − Xi θ − tj(θ)
≤ C sup
θ∈ΘH hH hK φθ,x (hK )
τn
≤ C .
hH φθ,x (hK )
• For (T3 ): It suffices to prove that for some positive real η large enough
∞
log n
P sup sup fN (tj(θ) , yk(y) , x) − E fN (tj(θ) , yk(y) , x) > η < ∞.
n=1 θ∈ΘH y∈C nhH φθ,x (hK )
log n
P T3 > η = P max max fN (tj(θ) , yk(y) , x)
nhH φθ,x (hK ) 1≤j≤dn 1≤k≤zn
log n
−E fN (tj(θ) , yk(y) , x) > η
nhH φθ,x (hK )
≤ dn zn max max P fN (tj(θ) , yk(y) , x)
1≤j≤dn 1≤k≤zn
log n
−E fN (tj(θ) , yk(y) , x) > η
nhH φθ,x (hK )
Secondly, the proof of the probability in the right side of the previous in-
equality runs along the lines of Lemma 2. So, based on this result and by
3
the fact that dn zn ≤ C(τn ln )−1 ≤ n 2 (α+1) , we arrive finally at:
log n
P max max fN (tj(θ) , yk(y) , x) − E fN (tj(θ) , yk(y) , x) > η
1≤j≤dn 1≤k≤zn nhH φθ,x (hK )
3
≤ C n 2 (α+1)−1−ζ
The choice of ζ > 32 (α + 1), combined with equations (24) and (25), allows
to finish the proof of Lemma 6.
Proof of Lemma 7. The proof of this Lemma is derived directly from
result of Lemma 6; case when we take H ≡ 1. Thus, we write
∞
log n
P sup fD (θ, x) − E fD (θ, x) > η =
θ∈ΘH nφθ,x (hK )
n=1
∞
log n
P sup 1 − fD (θ, x) > η < ∞. (26)
θ∈ΘH nφθ,x (hK )
n=1
378 S. Attaoui
Consequently,
1 1
P inf fD (θ, x) ≤ ≤ P sup 1 − fD (θ, x) >
θ∈H 2 θ∈H 2