0% found this document useful (0 votes)

32 views

Data Management and Analysis For Successful Clinical Research

Here are some issues with this data sheet: - Variables are not clearly defined (e.g. what is 24hrhct?) - Missing data is coded inconsistently (e.g. ?, >, <) - Variable types are mixed (e.g. age has both numeric and text values) - Variable values are inconsistent (e.g. height in both inches and cm) - Variable names are not clear or unique (e.g. both drugs have "blood pressure") - Rows and columns are not properly aligned - Data quality issues like typos or impossible values This sheet would need to be restructured to have a consistent format with clear, unique variable names and

Uploaded by

Ken Khumancha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

32 views

Data Management and Analysis For Successful Clinical Research

Uploaded by

Ken Khumancha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 26

Data Management and Analysis for

Successful Clinical Research

Lily Wang, PhD

Department of Biostatistics
Vanderbilt University
Goals of This Presentation
• Provide an overview on data
management and analysis aspects of
clinical research
• Minimize errors in datasets
• Ensure statistical software packages
will recognize data correctly
• Facilitate efficient data analysis for
projects
2
An Overview of the Process
1. Write the protocol
- consult mentors, colleagues and
visit us to finalize specific aims,
testable hypothesis and study design
2. Create a Data Dictionary
3. Create a Patient Directory
4. Prepare datasets for statistical
analysis
3
An Overview
5. The statisticians will assist with
statistical tests
6. Review results, start thinking about
writing the paper
7. Additional tables and figures
8. Write the paper/abstract

4
Timeline
• For abstract, please send us datasets at
least 4 weeks in advance
• Please contact us even if you don’t
have the dataset ready, so we can
schedule other projects and leave
room for yours

5
1. Writing the Proposal
• Background
• Why this research is important
• Be concise
• Specific Aims, Testable Hypothesis
• Be focused, clearly conceptualized, and
feasible
• The most important section of the proposal
• Consult mentors, colleagues and visit us

6
1. Writing the Proposal
• Methods/Experimental Design
• Participants
• Inclusion/Exclusion Criteria
• Recruiting Process
• How the measurements will be made

7
1. Writing the Proposal
• Challenges/Potential Problems
• Loss to follow up
• Bias - Confounding variables and other
sources
• Human Subjects Protection Plan
• Informed consent
• Adverse events
• Privacy, confidentiality issues

8
Bias
Definition - any systematic error in the
design, conduct or analysis of a study
that results in a mistaken estimate of
an exposure’s effect on the risk of
disease

9
Confounding - definition
In a study of whether factor A is a
cause of disease B, we say a third
factor, factor X is a confounder if
• Factor X is a known risk factor for
disease B
• Factor X is associated with factor A, but
is not a result of factor A

10
Confounding – an example
coffee drinking and pancreatic cancer

11
Confounding – an example
coffee drinking and pancreatic cancer
If an association is observed between
coffee drinking and pancreas cancer,
then
• The coffee => cancer
or
• Smoking is a risk factor for cancer
and smoking is associated with
coffee drinking
12
1. Writing the Proposal
Confounding – ways to deal with it
• in design phase
• match cases to controls on confounding
variables
• in analysis phase
• stratification
• adjustment

13
1. Writing the Proposal
• Statistical Analysis (provided by the
statisticians)
• Sample size/Power calculations
• Analysis Plan

14
1. Writing the Proposal
• A good example
• Dr Malow’s template

15
2. Create a Data Dictionary
Name Description Units Type Values
(Permissible
ranges)
group treatment group discrete 1= placebo, 2=trt

age age in years year continuous 10 – 79

bp_sys systolic blood mmHg continuous 100 – 160
pressure
bp_dias diastolic blood mmHg continuous 80 – 150
pressure
date0 date for baseline date mm/dd/yyyy
assessment

16
3. Create a Patient Directory
ID FirstName LastName Address Phone ...
1 John Smith
2 Mary Ann
3 Joe Kim

• Include any other information you

like to record for reference
• Keep this file to yourself, and don’t
send it to us

17
4. Prepare datasets for Statistical
Analysis – A good example
ID group age sex ht wt bp_sys bp_dias stage race date0 complic
1 1 25 1 61 350 120 80 3 3.0 1/15/1999 0
2 1 65 2 68 161 140 90 2 1.0 2/5/1999 1
3 1 25 1 47 150 160 110 4 2.0 1/15/1998 1
4 1 31 1 66 161 140 105 2 2.0 4/1/1999 0
5 1 42 2 72 177 130 70 2 1.0 2/15/1999 0
6 1 45 2 67 160 120 80 1 2.0 3/6/1999 0
7 1 44 1 72 145 120 80 1 1.0 2/28/1999 0
8 1 55 1 72 161 120 95 4 2.0 6/15/2000 1
9 1 0.5 2 66 174 160 110 3 4.0 12/14/2000 1
10 1 21 2 60 155 190 120 2 2.0 11/14/2000 0

18
4. Prepare datasets for
Statistical Analysis
• First - strip off any confidential
information (name, address, phone #)
• Rows - each subject (sample,
observations)
• Columns - each measurement
(variable)

19
4. Preparing datasets
• Variable Names (column labels)
• No special characters (“<“ etc) except
“_”
• Start with letters, not numbers
• Less than 8 characters
• Should be unique
• No spaces

20
≠

4. Preparing datasets
• Data Values
• Be consistent: “M” ≠“m”, date format,
upper/lower case
• No spaces
• No embedded formula – use “paste
special”, then “paste values”
• Missing data: leave it as blank
• Unless there are different reasons for missing, code
them as different values

21
4. Preparing datasets
• Only 1 variable in each column, use
separate columns for non-mutually
exclusive values
• Derived variables – statisticians can
do those
• Keep all information as continuous
variables, information can’t be
recovered
22
4.Preparing datasets
• It’s OK to have separate data sheets
for demographic info and clinical
measurements
• As long as there is a unique identifier
(ID) that links all data sheets

23
4. Preparing Datasets
• If you are in a hurry
• Record data in a file and call it “Raw_xxx.xls”
• Later transform it into the desired format
• It’s OK to format only those needed for
analysis and send only these variables to the
statisticians
• Good idea: visit us after you’ve entered the
first 5 patients and completed the data
dictionary
24
What’s wrong with this data sheet?
Comparison of Drug A and Drug B
Drug A Age of Patient Patient Height Weight 24hrhct blood pressure tumor Race Date complications
Gender (inches) (pound) stage enrolled

1 25 Male 61" >350 38% 120/80 2-3 Hipanic 1/15/99 no

2 65+ female 5'8" 161 32 140/90 II White 2/05/1999 yes
3 ? Male 120cm 12 >160/110 IV Black Jan 98 yes, pneumonia
4 31 m 5'6" obse 40 140 sys 105 dias ? ican-Americ ?
5 42 f >6 ft normal 39 missing =>2 W Feb 99
6 45 f 5.7 160 29 80/120 NA B last fall n
7 unknown ? 6 145 35 normal 1 W 2/30/99 n
8 55 m 72 161.45 12/39 120/95 4 ican-Americ 6-15-00 y
9 6 months f 66 174 38 160/110 3 Asian 14/12/00 y
10 21 f 5'

Drug B
1 55 m 61 145 normal 120/80 120/90 IV ative Americ 6/20/ 3
2 45 f 4"11 166 ? 135/95 2b none 7/14/99 n
3 32 male 5'13" 171 38 140/80 not staged NA 8/30/99 n
4 44 na 65 ? 40 120/80 2 ? 09/01/00 n
5 66 fem 71 0 41 140/90 4 w Sep 14th y, sepsis
6 71 unknown 172 199 38 >160/110 3 b unknown y, died
7 45 m ? 204 32 140 sys 105 dias 1 b 12/25/00 n
8 34 m NA 145 36 130 3 w July 97 n
9 13 m 66 161 39 166/115 2a w 06/06/99 n
10 66 m 68 176 41 1120/80 3 w 01/21/58 n

Average 45 65 155 38

25
Acknowledgement
• Guideline for data collection and data
entry
http://biostat.mc.vanderbilt.edu/wiki/Main/TheresaScott

• “10 Data Entry Commandments”,

“Spreadsheet from Heaven/Hell”
http://biostat.mc.vanderbilt.edu/wiki/Main/DanielByrne

ICT583 Data Science Applications - Final Assignment - Individual - UPDATED!!! - Explanation
0% (1)
ICT583 Data Science Applications - Final Assignment - Individual - UPDATED!!! - Explanation
5 pages
The Dark Grimoire
64% (33)
The Dark Grimoire
17 pages
Bronch O: OSC Pe Cleaning Guide
100% (1)
Bronch O: OSC Pe Cleaning Guide
1 page
Data analysis 2025
No ratings yet
Data analysis 2025
17 pages
Pima Tutorial
No ratings yet
Pima Tutorial
8 pages
2002-10-04 How To Learn Everything You Ever Wanted To Know About Biostatistics
No ratings yet
2002-10-04 How To Learn Everything You Ever Wanted To Know About Biostatistics
97 pages
Preparing Data For Analysis Using Microsoft Excel
No ratings yet
Preparing Data For Analysis Using Microsoft Excel
8 pages
Preparing Data For Analysis Using Microsoft Excel: Tools and Issues
No ratings yet
Preparing Data For Analysis Using Microsoft Excel: Tools and Issues
9 pages
PPR R101 Session 3 SMART
No ratings yet
PPR R101 Session 3 SMART
36 pages
FETPI2.0 d20 Data Planning Management 2020-06
No ratings yet
FETPI2.0 d20 Data Planning Management 2020-06
62 pages
Biostat Lec Part 3 (SV)
No ratings yet
Biostat Lec Part 3 (SV)
4 pages
Arsh DPT Research Lec 1 Research Proposal
No ratings yet
Arsh DPT Research Lec 1 Research Proposal
32 pages
HCI - Notes-Ch3
100% (1)
HCI - Notes-Ch3
44 pages
Data Practices
No ratings yet
Data Practices
48 pages
How To Write Statistical Analysis Section
No ratings yet
How To Write Statistical Analysis Section
12 pages
2011 02 08 Data Analysis
No ratings yet
2011 02 08 Data Analysis
47 pages
Statistical Analysis
No ratings yet
Statistical Analysis
51 pages
Planning A Research: Dr.M.Logaraj, M.D., Professor of Community Medicine SRM Medical College
No ratings yet
Planning A Research: Dr.M.Logaraj, M.D., Professor of Community Medicine SRM Medical College
36 pages
Pertemuan - 12 Metopen
No ratings yet
Pertemuan - 12 Metopen
40 pages
Lecture1 Introduction To Biostatistics
No ratings yet
Lecture1 Introduction To Biostatistics
18 pages
Biostatistics
No ratings yet
Biostatistics
53 pages
BMR 18
No ratings yet
BMR 18
13 pages
1 - Introduction To Health Care Data Analytics (Bagian 2)
No ratings yet
1 - Introduction To Health Care Data Analytics (Bagian 2)
31 pages
9- Data Management
No ratings yet
9- Data Management
61 pages
Lecture 1
No ratings yet
Lecture 1
38 pages
Biological Data Science Lecture2
No ratings yet
Biological Data Science Lecture2
12 pages
Statistics For Psychologists (Calculating and Interpreting Basic Statistics Using SPSS) - Craig A. Wendorf
No ratings yet
Statistics For Psychologists (Calculating and Interpreting Basic Statistics Using SPSS) - Craig A. Wendorf
96 pages
Data Analysis
No ratings yet
Data Analysis
84 pages
data interpretation
No ratings yet
data interpretation
134 pages
Step 1: Define Your Questions
No ratings yet
Step 1: Define Your Questions
4 pages
4 Healthcare Data Analytics
No ratings yet
4 Healthcare Data Analytics
121 pages
L9 Planning Data Management & Analysis
No ratings yet
L9 Planning Data Management & Analysis
26 pages
27-11 Statistics
No ratings yet
27-11 Statistics
57 pages
Plan For Data Processing and Analysis: Kngmany Chaleunvong
No ratings yet
Plan For Data Processing and Analysis: Kngmany Chaleunvong
21 pages
Introduction To Biostatistics: DR Asim Waris
0% (1)
Introduction To Biostatistics: DR Asim Waris
37 pages
6. Data Management
No ratings yet
6. Data Management
30 pages
1 s2.0 S0895435604002823 Main
No ratings yet
1 s2.0 S0895435604002823 Main
6 pages
Research Methodology and Biostatistics - Syllabus & Curriculum - M.D (Hom) - WBUHS
100% (1)
Research Methodology and Biostatistics - Syllabus & Curriculum - M.D (Hom) - WBUHS
5 pages
Exams Paper B Critical Review Syllabus May2011
No ratings yet
Exams Paper B Critical Review Syllabus May2011
7 pages
Biostatistics Ch.1 – Kopie
No ratings yet
Biostatistics Ch.1 – Kopie
5 pages
Topic 1 - W1-3 Introduction To Biostatistics
No ratings yet
Topic 1 - W1-3 Introduction To Biostatistics
52 pages
Over View of Research Process
No ratings yet
Over View of Research Process
27 pages
Stat Analysis
No ratings yet
Stat Analysis
10 pages
Business Research Methods: Problem Definition and The Research Proposal
No ratings yet
Business Research Methods: Problem Definition and The Research Proposal
34 pages
What Non-Statisticians Need To Know About Statistics in Clinical Trials
No ratings yet
What Non-Statisticians Need To Know About Statistics in Clinical Trials
43 pages
Guidlines Collectingdataviaexcel
No ratings yet
Guidlines Collectingdataviaexcel
14 pages
Module 4 - Assignment Rakesh Thakor
No ratings yet
Module 4 - Assignment Rakesh Thakor
13 pages
Statistics in Research Analysis
No ratings yet
Statistics in Research Analysis
12 pages
FOR Raduate and Edical Tudents: Ntroduction To Iostatistics
No ratings yet
FOR Raduate and Edical Tudents: Ntroduction To Iostatistics
35 pages
School of Public Health: Haramaya University, Chms
100% (1)
School of Public Health: Haramaya University, Chms
40 pages
Last Assignment
No ratings yet
Last Assignment
5 pages
Introduction of EpiInfo
No ratings yet
Introduction of EpiInfo
38 pages
Health Statistics 1 3 1
No ratings yet
Health Statistics 1 3 1
170 pages
Contact Details:: Dr. Joy C. Chavez
No ratings yet
Contact Details:: Dr. Joy C. Chavez
54 pages
What Is Data Analysis?: Making Figures Speak (The Truth!)
No ratings yet
What Is Data Analysis?: Making Figures Speak (The Truth!)
44 pages
MWELESA BIOSTATISTICS
No ratings yet
MWELESA BIOSTATISTICS
125 pages
Research Methods For MSC MPH
No ratings yet
Research Methods For MSC MPH
81 pages
Statistics Lessons
No ratings yet
Statistics Lessons
57 pages
Dirty Data. Clean It Using SAS
No ratings yet
Dirty Data. Clean It Using SAS
40 pages
Research & the Analysis of Research Hypotheses: Volume 2
From Everand
Research & the Analysis of Research Hypotheses: Volume 2
Kathleen Thomas Allan, PhD
No ratings yet
Analyzing Quantitative Data: An Introduction for Social Researchers
From Everand
Analyzing Quantitative Data: An Introduction for Social Researchers
Debra Wetcher-Hendricks
No ratings yet
Statistics Super Review, 2nd Ed.
From Everand
Statistics Super Review, 2nd Ed.
The Editors of REA
5/5 (3)
Essay On Agriculture
100% (2)
Essay On Agriculture
3 pages
A Letter To Your Younger Brother Giving Him Career Prospect in The Banking Sector
No ratings yet
A Letter To Your Younger Brother Giving Him Career Prospect in The Banking Sector
2 pages
Complaint Letter For Bad Roads
0% (1)
Complaint Letter For Bad Roads
1 page
Manipuri Calendar 2016
No ratings yet
Manipuri Calendar 2016
12 pages
Informal Letter PDF
No ratings yet
Informal Letter PDF
5 pages
1s WANGKHEM MANICHAND MENS 4
No ratings yet
1s WANGKHEM MANICHAND MENS 4
2 pages
Clinical Data Management
No ratings yet
Clinical Data Management
5 pages
Reimbursement Claim Form PDF
No ratings yet
Reimbursement Claim Form PDF
2 pages
MSPDCL
No ratings yet
MSPDCL
1 page
Statement of Account: Date Narration Chq./Ref - No. Value DT Withdrawal Amt. Deposit Amt. Closing Balance
No ratings yet
Statement of Account: Date Narration Chq./Ref - No. Value DT Withdrawal Amt. Deposit Amt. Closing Balance
2 pages
Voucher Code: 00040-72774-35595-79166
No ratings yet
Voucher Code: 00040-72774-35595-79166
1 page
Voucher Code: 27703-79236-30073-53534
No ratings yet
Voucher Code: 27703-79236-30073-53534
1 page
Safety Standard One Pager-2
No ratings yet
Safety Standard One Pager-2
19 pages
JFMR Stakeholder Submission For UPR
No ratings yet
JFMR Stakeholder Submission For UPR
12 pages
Ojeda
No ratings yet
Ojeda
10 pages
causes of failure of state of Syria
No ratings yet
causes of failure of state of Syria
23 pages
ĐỀ 29
No ratings yet
ĐỀ 29
9 pages
R and S Phrases
No ratings yet
R and S Phrases
8 pages
Case Presentation: Neurology
No ratings yet
Case Presentation: Neurology
19 pages
Trodat 7097 Fast Drying Flash Ink Black - 718327 - S - Eu - GB
No ratings yet
Trodat 7097 Fast Drying Flash Ink Black - 718327 - S - Eu - GB
11 pages
Introduction To The Assessment of Adolescent and Adult Intelligence
No ratings yet
Introduction To The Assessment of Adolescent and Adult Intelligence
3 pages
P&G Kip Almalak v.O.F
No ratings yet
P&G Kip Almalak v.O.F
1 page
Beige Blue and Red Simple Clean Physical Education Principles of Fitness Training Educational Presentation 1
No ratings yet
Beige Blue and Red Simple Clean Physical Education Principles of Fitness Training Educational Presentation 1
26 pages
The Concept of Wind in Traditional Chinese Medicine: December 2016
No ratings yet
The Concept of Wind in Traditional Chinese Medicine: December 2016
11 pages
Motion Requirements and Proceedures
No ratings yet
Motion Requirements and Proceedures
42 pages
(For Doctors) Policy and Application_Intermediate Exam Sponsorship for Residents
No ratings yet
(For Doctors) Policy and Application_Intermediate Exam Sponsorship for Residents
6 pages
Benedetti F. Placebo and The New Physiology of The Doctor-Patient Relationship
No ratings yet
Benedetti F. Placebo and The New Physiology of The Doctor-Patient Relationship
40 pages
Lerner SF
No ratings yet
Lerner SF
9 pages
Fields of Specialization in Psychology
No ratings yet
Fields of Specialization in Psychology
1 page
High Risk Assessment of Dust
No ratings yet
High Risk Assessment of Dust
17 pages
Grade 10 Health
No ratings yet
Grade 10 Health
33 pages
Anatomi Panggul Dan Ukurannya
No ratings yet
Anatomi Panggul Dan Ukurannya
40 pages
Zbornik Radova Fis Komunikacije 2017 Sajt
No ratings yet
Zbornik Radova Fis Komunikacije 2017 Sajt
352 pages
Sa 1141 PDF
100% (2)
Sa 1141 PDF
66 pages
Ehs Packages New
No ratings yet
Ehs Packages New
132 pages
12th Physical Education Paper 1 and 2 for Cbse
No ratings yet
12th Physical Education Paper 1 and 2 for Cbse
32 pages
Consent Comp Veneers
No ratings yet
Consent Comp Veneers
3 pages
Cici Tugas FIXX
0% (1)
Cici Tugas FIXX
7 pages
Dental Clinic Software For Better Management - Dentsoftware
No ratings yet
Dental Clinic Software For Better Management - Dentsoftware
5 pages
NCP Deficient Knowledge
No ratings yet
NCP Deficient Knowledge
2 pages

Data Management and Analysis For Successful Clinical Research

Uploaded by

Data Management and Analysis For Successful Clinical Research

Uploaded by

Data Management and Analysis for

Successful Clinical Research

Lily Wang, PhD

age age in years year continuous 10 – 79

• Include any other information you

1 25 Male 61" >350 38% 120/80 2-3 Hipanic 1/15/99 no

• “10 Data Entry Commandments”,

You might also like