General Linear Models in Small Area Estimation: An Assessment in Agricultural Surveys
General Linear Models in Small Area Estimation: An Assessment in Agricultural Surveys
Small area estimation: a simple outline Small area models include random area-specific effects
regression-synthetic area estimators The basic area level model
in
i = 1,...,m
2 E ( ui ) = 0, V ( ui ) = u
1) is the vector of auxiliary data 2) the parameters of interest that are assumed to be related to the vector 1) 3) iid random effects (normal)
GENERAL LINEAR MODELS IN SMALL AREA ESTIMATION: AN ASSESSMENT IN AGRICULTURAL SURVEYS
i = xT + zi ui + e 5) i i
i = 1,...,m
5) this model involves design random variables and, at the same time, the model-based random variables. It is an example of general linear mixed model (GLMM) with diagonal covariance structure
Qualitative data
Qualitative data are becoming relevant in the agricultural economics field for two major reasons: firstly theoretical development stress the relevance of discrete and intrinsically qualitative phenomena, secondly the increasing sophistication of the statistics approach in the field allows economist to draw quantitative conclusions from discrete data The qualitative data about households, including the role of women, services availability, presence/absence of infrastructures are considered as relevant factor in the analysis that require close consideration in any economic model in the field
GENERAL LINEAR MODELS IN SMALL AREA ESTIMATION: AN ASSESSMENT IN AGRICULTURAL SURVEYS
Qualitative data
Agricultural economists are also interested to the social analysis of the rural territory. The segmentation of the universe, based on qualitative variables (such as gender, age, education) becomes relevant to define the dynamics of specific groups and to analyze issues of interest The shift of the policy focus from producers support to rural development in high income countries is one of the major factor determining the new interest in the analysis of the qualitative aspects of agriculture In the contest of qualitative data analyses, both continuos and binary or nonnumeric data are available by the large data sets exploration of some arrays, such as in agricultural census data. The complete exploitation of that large number of informations about farms is often feasible only with some explorative data analyses, in particular homogeneity and correspondence analysis
GENERAL LINEAR MODELS IN SMALL AREA ESTIMATION: AN ASSESSMENT IN AGRICULTURAL SURVEYS
Qualitative data
On the other hand, it is recognized that some complex aspects of farms structure are correctly pointed out if we implement in economic models, at the same time, all possible information Small area statistics are powerful methods in estimating small area farms characteristics, but some agricultural policies need further information, especially those related with particular classes of farms The apparteinance of farms in well-recognized classes, jointly used with other area information, is then a basic policy-makers tool From this standpoint is very useful try to achieve small area random effects models that combine continuous and categorical predictors and use binary response variables. The goal is to estimate proportions of farms
GENERAL LINEAR MODELS IN SMALL AREA ESTIMATION: AN ASSESSMENT IN AGRICULTURAL SURVEYS
One approach in literature was done in the contest of HB framework. He and Sun, 2000, given an example of hierarchical Bayes estimation procedure of a logisticlinear mixed model in hunting success rates at the subarea level for post-season harvest surveys The model implements fixed week effects and random geographic effects, in the contest of autoregressive (AR) and conditional autoregressive (CAR) approach to the analysis of spatial correlations between neighboring sub-areas. The process of estimation needs, as in the case of the GLM represented by the logistic-linear model above, Gibbs sampling procedures
GENERAL LINEAR MODELS IN SMALL AREA ESTIMATION: AN ASSESSMENT IN AGRICULTURAL SURVEYS
We introduce in the paper a Monte Carlo NewtonRaphson ML procedure (McCulloch, 1997) in estimating parameters in the following general logistic-linear mixed model log( p /( 1 p )) = logit( p ) = XT i + ui ik ik ik k The estimation problem in closed form likelihood integral expressions is proposed to solve numerically via Monte Carlo approach. Another problem is how to generate starting values of the parameters in likelihood expressions if, previously, we dont specify the vector of random effects. A natural way to solve the problem is to adopt the Metropolis algorithm, that is a simple Markov Chain Monte Carlo (MCMC) algorithm
GENERAL LINEAR MODELS IN SMALL AREA ESTIMATION: AN ASSESSMENT IN AGRICULTURAL SURVEYS
The basic characteristic of a MCMC is that the sequence of generated points takes a kind of random walk in parameter space, instead of each point being generated, one independently from another Moreover, the probability of jumping from one point to an other depends only on the last point and not on the entire previous history (this is the peculiar property of a Markov chain) The paper shows the Monte Carlo approach to the Newton-Raphson procedure of estimating logistic linear parameters estimation via an iterative procedure that leads to convergent MLE estimates, under assumption of normality
GENERAL LINEAR MODELS IN SMALL AREA ESTIMATION: AN ASSESSMENT IN AGRICULTURAL SURVEYS
Conclusions
In this paper, a Monte Carlo Newton-Raphson algorithm has been outlined, assuming normality of random area effects, in order to approach the MLE estimation issues related to the logistic-linear mixed model, in the context of qualitative small area estimation As generally recognized, the focus of the recent economic theory on qualitative data can be summarize in two major points: the increasing interest in the analysis of discrete phenomena, and the explanatory power of qualitative variable in describing the current trend in the agricultural sector Statistical methods able to convey the qualitative information in the estimation models are able to increase efficiency
GENERAL LINEAR MODELS IN SMALL AREA ESTIMATION: AN ASSESSMENT IN AGRICULTURAL SURVEYS
Conclusions
Due to the availability of large sets of informations about sample units and, in general, local areas of interest, provided by the full exploitation and analysis of the survey questionnaires, one of the most important questions is how to implement large sets of continuous and categorical variables in small area models In fact, many basic informations about units and areas are both continuous and categorical, and in many cases only the categorical ones can lead to appropriate assessments of specific issues From this poin of view, the logistic-linear mixed model can be an useful tool, measuring random area-specific effects and performing satisfactory area level analyses
GENERAL LINEAR MODELS IN SMALL AREA ESTIMATION: AN ASSESSMENT IN AGRICULTURAL SURVEYS
Thank you
Please find much more methodological details in the paper available on the conference website e-mail to: [email protected]