Introduction_to_data_analysis
Introduction_to_data_analysis
net/publication/23235068
CITATIONS READS
2 19,714
1 author:
SEE PROFILE
All content following this page was uploaded by Vicki Jean Adams on 21 June 2018.
Categorical Nominal Named categories with no implied order Blood groups, breed, gender, neuter status
Ordinal Ordered categories where the differences between categories Scoring systems, cancer staging, onset of
are not necessarily equal disease (peracute, acute, chronic)
Continuous Interval Equal distances between values but the zero point is arbitrary IQ, ordinal data with equal-appearing categories
Ratio Above as for interval and a meaningful zero; data usually Weight, age, temperature, blood pressure
obtained by measurement
Journal of Small Animal Practice Vol 49 August 2008 Ó 2008 British Small Animal Veterinary Association 375
Editorial
X ¼ independent Categorical Contingency tables & chi-square or Fisher’s T-tests for 2 groups, Analysis of variance
or exposure variable(s) exact tests, logistic regression (ANOVA) for .2 groups
Continuous Logistic regression Correlation, Linear regression
a
Used to evaluate whether or not an event occurred
b
Used to evaluate how much of an outcome occurred
independent or exposure variable was the cardiac disease status of Vicki Adams graduated from the Western College of Veterinary
the cats (normal or affected with hypertrophic cardiomyopathy), Medicine in Saskatoon in 1990 and went on to complete a one-
a binary categorical variable. Additional independent variables year small animal internship at the University of Minnesota.
that were included in the analysis included the R-R interval, After seven years in general and emergency small animal
age and weight (all continuous variables). practice, she returned to the University of Saskatchewan to
do research. Having obtained an MSc in the epidemiology of
rabies in wildlife, Vicki completed a PhD in small animal
Statistical tests epidemiology investigating owner compliance with veterinary
Based on the classification of the independent and dependent var- recommendations and prescribed medications. Vicki started
iables, there are four basic types of data sets that can occur. For working at the Animal Health Trust in January 2003 and is
each of these there are different methods of statistical analysis currently Head of the Small Animal Epidemiology Unit.
available. The statistical models presented in Table 2 include
methods for evaluating how much of an outcome occurred or
whether or not an event occurred. Acknowledgements
Therefore, in the ischaemic myelopathy study a contingency With grateful thanks to Drs Carl Ribble and John Campbell for
table or cross-tabulation with chi-square or Fisher’s exact test is their very wise words.
the appropriate approach to analysis to examine the effect of each
of the independent variables mentioned above with the dichot-
omous categorical outcome variable (successful or unsuccessful References
KOFFAS, H., DUKES-MCEWAN, J., CORCORAN, B. M., MORAN, C. M., FRENCH, A., SBOROS, V.,
outcome). In the echocardiography study with a categorical main SIMPSON, K., ANDERSON, T., & MCDICKEN, W. N. (2008) Colour M-mode tissue
exposure variable, several continuous independent variables and Doppler imaging in healthy cats and cats with hypertrophic cardiomyopathy.
Journal of Small Animal Practice 49, 330-338
a continuous outcome variable, linear regression was an appro- CARDWELL, J. M. (2008) An overview of study design. Journal of Small Animal Prac-
priate approach to the analysis. This study found statistically sig- tice 49, 217-218
DE RISIO, L., ADAMS, V., DENNIS, R., MCCONNELL, F. & PLATT, S. (2008) Association of
nificant differences in several of the tissue Doppler imaging clinical and magnetic resonance imaging findings with outcome in dogs sus-
measurements. pected to have ischemic myelopathy: 50 cases (2000–2006). Journal of the
American Veterinary Medical Association 233, 129-135
This approach can be extended to all types of data, including
‘‘messy’’ data that might include the presence of repeated meas-
urements on individual animals or the occurrence of unbalanced Further reading
data sets due to missing data. The ability to extend this approach DOHOO, I. R., MARTIN, W. & STRYHN, H. (2003) Veterinary epidemiologic research.
Atlantic Veterinary College Inc. University of Prince Edward Island, Prince
allows us to consider some of the more advanced methods of sta- Edward Island, Canada, 706 pp
tistical analysis such as multiple regression (with $2 independent HULLEY, S. B., CUMMINGS, S. R., BROWNER, W. S., GRADY, D., HEARST, N. & NEWMAN, T. B.
(2001) Designing clinical research: An epidemiologic approach. 2nd edn. Lip-
variables that can be a mix of continuous and categorical), mixed pincott Williams & Wilkins, Philadelphia, PA, USA
or multi-level models (with $2 levels of measurements that need KATZ, M. H. (2006) Multivariable Analysis. A Practical Guide for Clinicians. 2nd
edn. Cambridge: Cambridge University Press, Cambridge, UK
to be taken into account, such as when looking at kittens within PETT, M. A. (1997) Nonparametric Statistics For Health Care Research. Sage
litters or clinicians within a practice) and survival analysis. Publications, London, UK
PFEIFFER, D. U. (2002) Veterinary Epidemiology - An Introduction. Available from:
http://www.vetschools.co.uk/EpiVetNet/epidivision/Pfeiffer/files/Epinotes.
pdf (accessed 9 July 2008). Royal Veterinary College, Herts, UK
SACKET, D. L., HAYNES, R. B., GUYATT, G. H. & TUGWELL, P. (1991) Clinical Epidemi-
Vicki Adams ology: A Basic Science for Clinical Medicine. 2nd edn. Little, Brown & Co.,
Animal Health Trust Boston, MA, USA
376 Journal of Small Animal Practice Vol 49 August 2008 Ó 2008 British Small Animal Veterinary Association