SlideShare a Scribd company logo
Sas - Introduction to working under change management
Introduction ToIntroduction To
SASSAS
Good Data
Management Practices
Four Statistical Packages
• SPSS
• Stata
• R
• SAS
• Point and Click
• Command Line
• Programs (the best way)
Three Ways to Work
Outline
• Sermon on SYNTAX
• Cleaning data and creating variables
• Never overwrite original data
• Practices that will help you keep track of your work
• Safeguarding your work
A Sermon on SYNTAX
• Command line and Point and Click
– Advantages:
• Quick, may require less learning
– Disadvantages:
• Takes longer the second time – you must wade through the
point and click menu rather than just change a word
• You do not have a record of what you have done
SPSS
The King of Point and Click
You can point and click to get files, create variables, change variable
values, and do analysis, and end up without a record of what you
have done. You will be sorry.
Or, you can use Point and Click as an aid as you write programs.
You can copy syntax created by Point and Click into your program.
In SPSS programs are written in a Syntax Window and they have the
extension of .sps when you save them.
You can modify SPSS defaults so that commands will be reflected in the
log. This allows you to copy commands from your log into your
program file. These changes also make debugging easier.
You will find information about how to
modify SPSS at the following URL.
STATA
You can point and click, issue commands on the command line, or
create .do files. “.do” files can store your programs.
R
With R you can point and click, issue
commands on the command line, or
create .R files. “.R” files store your
programs.
Results from P&C are reflected so you
can copy them into your program.
SAS
SAS allows some point and
click, but immediately offers
an editor where you can write
your programs. SAS
programs end with the .sas
extension, and are text files.
SAS features an enhanced
editor with cool color coding
that makes it easier to write
and debug programs.
Never clean data in the data view
Scenario 1:
You get a data set and find errors in it.
You change the values in the data window.
You save it with point and click, over-writing your original data.
Later you try to recall what changes you made, when and why. Of
course you can’t. You can’t even be sure that you made the
“corrections” for the proper cases.
You can’t look back at older data sets to confirm what you did. You
sit there sweating.
Scenario 2 same as Scenario 1 :
You save it with point and click, over-writing your original data and,
while you are saving the file,
1) Your computer goes down because of a power outage OR
2) There is a brief interruption in the network
HALF OF YOUR DATA SET IS LOST.
You cry.
Scenario 3:
You get a data set and find errors in it.
You write a program that:
1) gets the original data
2) makes changes in values with SYNTAX
3) Includes comments about the changes
4) saves the new file in a different name
Science marches forward.
Creating Variables and Recoding
is not the same as Cleaning Data
• You always want clean data
• You may not always want the recoded or created
variables
• Make new variables, but keep the old ones. (don’t
over-write) Use the original to check the new
Examples of Recoding/Creating
• Creating a series of dummies from a categorical variable
• Creating an index from a series of scale variables
• Creating a dichotomous or categorical variable from a continuous
variable
• Always consider MISSING VALUES
Sample SPSS Program
* CleanNew.sps .
* 10/10/05 created dummy for male .
Get file = ‘dirty.sav’ .
* Cleaning data, PJG, looked at survey form, educ for ID=1 should be 16, 10/9/05 .
If id = 1 educ = 16 .
* Create a dummy variable from “gender”.
If gender = ‘m’ male = 1 .
If gender = ‘f’ male = 0 .
If gender = ‘’ male = -9 .
Missing values male (-9) .
Variable label male ‘Male’ .
Value labels male 1 ‘Male’ 0 ‘Female’ .
Save outfile = ‘CleanNew.sav’ / drop gender .
Summary for Cleaning and Creating
variables
• Use syntax (programs) to create and clean variables
• Document when and why in your programs
• Save new file – do not over-write the old
It may be months between the
time that you finish a paper,
submit it, and get to revise it for
publication.
What you will need to know:
• The origin of your variables:
– What is the source for each variable
– How were they created?
• What programs created your final tables?
• What program files created the file you used for your final tables?
Create a Directory for the Project
• For example, c:MA_Thesis
• Store all of the programs and data in that directory and
subdirectories
Naming Conventions
• For every data file you have, you should have a program
file with a corresponding name.
• When you have finished your paper, create a program
file for each table. For example: table1.sas table2.sas
Document your work
• Write comments in your program.
• Put a file in your directory called a_note, readme, or
something similar that includes a brief description of the
project and important information.
Safeguarding your work
• Multiple backups – not all stored in the same basket
• Worry about the future
– Keep up with formats (cards, tapes, floppy disks, CDs, what
next? )
– Store in portable formats
For More Information click below link:
Follow Us on:
http://vibranttechnologies.co.in/sas-classes-in-mumbai.html
Thank You !!!

More Related Content

Viewers also liked (13)

PPT
Actividad3 david a. condori tantani
Antonio Condori
 
PDF
Within and Between Analysis (WABA).
COSTARCH Analytical Consulting (P) Ltd.
 
PPT
Introduction to Stata
Samaa Hazem Hosny
 
PDF
Introduction to Stata
izahn
 
PPTX
STATA - Summary Statistics
stata_org_uk
 
PDF
Introduction to STATA - Ali Rashed
Economic Research Forum
 
PPTX
STATA - Importing Data
stata_org_uk
 
PDF
Data management in Stata
izahn
 
PPTX
STATA - Introduction
stata_org_uk
 
PDF
Graphing stata (2 hour course)
izahn
 
PDF
Introduction to SAS
izahn
 
PPTX
STATA - Panel Regressions
stata_org_uk
 
PPTX
STATA - Time Series Analysis
stata_org_uk
 
Actividad3 david a. condori tantani
Antonio Condori
 
Within and Between Analysis (WABA).
COSTARCH Analytical Consulting (P) Ltd.
 
Introduction to Stata
Samaa Hazem Hosny
 
Introduction to Stata
izahn
 
STATA - Summary Statistics
stata_org_uk
 
Introduction to STATA - Ali Rashed
Economic Research Forum
 
STATA - Importing Data
stata_org_uk
 
Data management in Stata
izahn
 
STATA - Introduction
stata_org_uk
 
Graphing stata (2 hour course)
izahn
 
Introduction to SAS
izahn
 
STATA - Panel Regressions
stata_org_uk
 
STATA - Time Series Analysis
stata_org_uk
 

Similar to Sas - Introduction to working under change management (20)

PDF
Spss basics tutorial
santoshranjan77
 
PPT
SAS BASICS
Bhuwanesh Rawat
 
DOCX
Sample Questions The following sample questions are not in.docx
todd331
 
DOC
Introduction to SAS
Imam Jaffer
 
PDF
STATA_Training_for_data_science_juniors.pdf
AronMozart1
 
PPT
Sas training in hyderabad
Kelly Technologies
 
PDF
Spss tutorial 1
debataraja
 
PDF
Spss tutorial 1
kunkumabala
 
PDF
SPSS introduction Presentation
befikra
 
PPT
5116427.ppt
BAGARAGAZAROMUALD2
 
DOC
Introduction to sas
Dr P Deepak
 
PPTX
Data processing & Analysis: SPSS an overview
ATHUL RAVI
 
PPTX
Tableau Basic Questions
Sooraj Vinodan
 
PPT
INTRODUCTION TO SAS
Bhuwanesh Rawat
 
PDF
Insight
Umakant Bhardwaj
 
PPT
8323 Stats - Lesson 1 - 03 Introduction To Sas 2008
untellectualism
 
PDF
Stata tutorial university of princeton
Douglas Branco Dias Santana
 
PDF
Computer Tools for Academic Research
Miklos Koren
 
PPTX
chapter 1 PhD SPSS FINAL LECTURE. -.pptx
NoreenRafique3
 
PPTX
introduction-stata.pptx
Jacob Pratabaraj
 
Spss basics tutorial
santoshranjan77
 
SAS BASICS
Bhuwanesh Rawat
 
Sample Questions The following sample questions are not in.docx
todd331
 
Introduction to SAS
Imam Jaffer
 
STATA_Training_for_data_science_juniors.pdf
AronMozart1
 
Sas training in hyderabad
Kelly Technologies
 
Spss tutorial 1
debataraja
 
Spss tutorial 1
kunkumabala
 
SPSS introduction Presentation
befikra
 
5116427.ppt
BAGARAGAZAROMUALD2
 
Introduction to sas
Dr P Deepak
 
Data processing & Analysis: SPSS an overview
ATHUL RAVI
 
Tableau Basic Questions
Sooraj Vinodan
 
INTRODUCTION TO SAS
Bhuwanesh Rawat
 
8323 Stats - Lesson 1 - 03 Introduction To Sas 2008
untellectualism
 
Stata tutorial university of princeton
Douglas Branco Dias Santana
 
Computer Tools for Academic Research
Miklos Koren
 
chapter 1 PhD SPSS FINAL LECTURE. -.pptx
NoreenRafique3
 
introduction-stata.pptx
Jacob Pratabaraj
 
Ad

More from Vibrant Technologies & Computers (20)

PPT
Buisness analyst business analysis overview ppt 5
Vibrant Technologies & Computers
 
PPT
SQL Introduction to displaying data from multiple tables
Vibrant Technologies & Computers
 
PPT
SQL- Introduction to MySQL
Vibrant Technologies & Computers
 
PPT
SQL- Introduction to SQL database
Vibrant Technologies & Computers
 
PPT
ITIL - introduction to ITIL
Vibrant Technologies & Computers
 
PPT
Salesforce - Introduction to Security & Access
Vibrant Technologies & Computers
 
PPT
Data ware housing- Introduction to olap .
Vibrant Technologies & Computers
 
PPT
Data ware housing - Introduction to data ware housing process.
Vibrant Technologies & Computers
 
PPT
Data ware housing- Introduction to data ware housing
Vibrant Technologies & Computers
 
PPT
Salesforce - classification of cloud computing
Vibrant Technologies & Computers
 
PPT
Salesforce - cloud computing fundamental
Vibrant Technologies & Computers
 
PPT
SQL- Introduction to PL/SQL
Vibrant Technologies & Computers
 
PPT
SQL- Introduction to advanced sql concepts
Vibrant Technologies & Computers
 
PPT
SQL Inteoduction to SQL manipulating of data
Vibrant Technologies & Computers
 
PPT
SQL- Introduction to SQL Set Operations
Vibrant Technologies & Computers
 
PPT
Sas - Introduction to designing the data mart
Vibrant Technologies & Computers
 
PPT
Teradata - Architecture of Teradata
Vibrant Technologies & Computers
 
PPT
Teradata - Restoring Data
Vibrant Technologies & Computers
 
PPT
Datastage database design and data modeling ppt 4
Vibrant Technologies & Computers
 
PPT
Sql server select queries ppt 18
Vibrant Technologies & Computers
 
Buisness analyst business analysis overview ppt 5
Vibrant Technologies & Computers
 
SQL Introduction to displaying data from multiple tables
Vibrant Technologies & Computers
 
SQL- Introduction to MySQL
Vibrant Technologies & Computers
 
SQL- Introduction to SQL database
Vibrant Technologies & Computers
 
ITIL - introduction to ITIL
Vibrant Technologies & Computers
 
Salesforce - Introduction to Security & Access
Vibrant Technologies & Computers
 
Data ware housing- Introduction to olap .
Vibrant Technologies & Computers
 
Data ware housing - Introduction to data ware housing process.
Vibrant Technologies & Computers
 
Data ware housing- Introduction to data ware housing
Vibrant Technologies & Computers
 
Salesforce - classification of cloud computing
Vibrant Technologies & Computers
 
Salesforce - cloud computing fundamental
Vibrant Technologies & Computers
 
SQL- Introduction to PL/SQL
Vibrant Technologies & Computers
 
SQL- Introduction to advanced sql concepts
Vibrant Technologies & Computers
 
SQL Inteoduction to SQL manipulating of data
Vibrant Technologies & Computers
 
SQL- Introduction to SQL Set Operations
Vibrant Technologies & Computers
 
Sas - Introduction to designing the data mart
Vibrant Technologies & Computers
 
Teradata - Architecture of Teradata
Vibrant Technologies & Computers
 
Teradata - Restoring Data
Vibrant Technologies & Computers
 
Datastage database design and data modeling ppt 4
Vibrant Technologies & Computers
 
Sql server select queries ppt 18
Vibrant Technologies & Computers
 
Ad

Recently uploaded (20)

PDF
NASA A Researcher’s Guide to International Space Station : Physical Sciences ...
Dr. PANKAJ DHUSSA
 
PDF
[Newgen] NewgenONE Marvin Brochure 1.pdf
darshakparmar
 
PDF
Kit-Works Team Study_20250627_한달만에만든사내서비스키링(양다윗).pdf
Wonjun Hwang
 
PDF
SIZING YOUR AIR CONDITIONER---A PRACTICAL GUIDE.pdf
Muhammad Rizwan Akram
 
PDF
Newgen Beyond Frankenstein_Build vs Buy_Digital_version.pdf
darshakparmar
 
PDF
CIFDAQ Market Wrap for the week of 4th July 2025
CIFDAQ
 
PPTX
Mastering ODC + Okta Configuration - Chennai OSUG
HathiMaryA
 
PDF
AI Agents in the Cloud: The Rise of Agentic Cloud Architecture
Lilly Gracia
 
PPTX
COMPARISON OF RASTER ANALYSIS TOOLS OF QGIS AND ARCGIS
Sharanya Sarkar
 
PDF
Book industry state of the nation 2025 - Tech Forum 2025
BookNet Canada
 
PDF
“Computer Vision at Sea: Automated Fish Tracking for Sustainable Fishing,” a ...
Edge AI and Vision Alliance
 
PDF
The 2025 InfraRed Report - Redpoint Ventures
Razin Mustafiz
 
PDF
Transforming Utility Networks: Large-scale Data Migrations with FME
Safe Software
 
PDF
The Rise of AI and IoT in Mobile App Tech.pdf
IMG Global Infotech
 
PDF
Peak of Data & AI Encore AI-Enhanced Workflows for the Real World
Safe Software
 
PDF
LOOPS in C Programming Language - Technology
RishabhDwivedi43
 
PDF
Future-Proof or Fall Behind? 10 Tech Trends You Can’t Afford to Ignore in 2025
DIGITALCONFEX
 
PDF
How do you fast track Agentic automation use cases discovery?
DianaGray10
 
PDF
Agentic AI lifecycle for Enterprise Hyper-Automation
Debmalya Biswas
 
PPT
Ericsson LTE presentation SEMINAR 2010.ppt
npat3
 
NASA A Researcher’s Guide to International Space Station : Physical Sciences ...
Dr. PANKAJ DHUSSA
 
[Newgen] NewgenONE Marvin Brochure 1.pdf
darshakparmar
 
Kit-Works Team Study_20250627_한달만에만든사내서비스키링(양다윗).pdf
Wonjun Hwang
 
SIZING YOUR AIR CONDITIONER---A PRACTICAL GUIDE.pdf
Muhammad Rizwan Akram
 
Newgen Beyond Frankenstein_Build vs Buy_Digital_version.pdf
darshakparmar
 
CIFDAQ Market Wrap for the week of 4th July 2025
CIFDAQ
 
Mastering ODC + Okta Configuration - Chennai OSUG
HathiMaryA
 
AI Agents in the Cloud: The Rise of Agentic Cloud Architecture
Lilly Gracia
 
COMPARISON OF RASTER ANALYSIS TOOLS OF QGIS AND ARCGIS
Sharanya Sarkar
 
Book industry state of the nation 2025 - Tech Forum 2025
BookNet Canada
 
“Computer Vision at Sea: Automated Fish Tracking for Sustainable Fishing,” a ...
Edge AI and Vision Alliance
 
The 2025 InfraRed Report - Redpoint Ventures
Razin Mustafiz
 
Transforming Utility Networks: Large-scale Data Migrations with FME
Safe Software
 
The Rise of AI and IoT in Mobile App Tech.pdf
IMG Global Infotech
 
Peak of Data & AI Encore AI-Enhanced Workflows for the Real World
Safe Software
 
LOOPS in C Programming Language - Technology
RishabhDwivedi43
 
Future-Proof or Fall Behind? 10 Tech Trends You Can’t Afford to Ignore in 2025
DIGITALCONFEX
 
How do you fast track Agentic automation use cases discovery?
DianaGray10
 
Agentic AI lifecycle for Enterprise Hyper-Automation
Debmalya Biswas
 
Ericsson LTE presentation SEMINAR 2010.ppt
npat3
 

Sas - Introduction to working under change management

  • 4. Four Statistical Packages • SPSS • Stata • R • SAS
  • 5. • Point and Click • Command Line • Programs (the best way) Three Ways to Work
  • 6. Outline • Sermon on SYNTAX • Cleaning data and creating variables • Never overwrite original data • Practices that will help you keep track of your work • Safeguarding your work
  • 7. A Sermon on SYNTAX • Command line and Point and Click – Advantages: • Quick, may require less learning – Disadvantages: • Takes longer the second time – you must wade through the point and click menu rather than just change a word • You do not have a record of what you have done
  • 8. SPSS The King of Point and Click
  • 9. You can point and click to get files, create variables, change variable values, and do analysis, and end up without a record of what you have done. You will be sorry.
  • 10. Or, you can use Point and Click as an aid as you write programs. You can copy syntax created by Point and Click into your program. In SPSS programs are written in a Syntax Window and they have the extension of .sps when you save them.
  • 11. You can modify SPSS defaults so that commands will be reflected in the log. This allows you to copy commands from your log into your program file. These changes also make debugging easier.
  • 12. You will find information about how to modify SPSS at the following URL.
  • 13. STATA
  • 14. You can point and click, issue commands on the command line, or create .do files. “.do” files can store your programs.
  • 15. R
  • 16. With R you can point and click, issue commands on the command line, or create .R files. “.R” files store your programs. Results from P&C are reflected so you can copy them into your program.
  • 17. SAS
  • 18. SAS allows some point and click, but immediately offers an editor where you can write your programs. SAS programs end with the .sas extension, and are text files. SAS features an enhanced editor with cool color coding that makes it easier to write and debug programs.
  • 19. Never clean data in the data view
  • 20. Scenario 1: You get a data set and find errors in it. You change the values in the data window. You save it with point and click, over-writing your original data. Later you try to recall what changes you made, when and why. Of course you can’t. You can’t even be sure that you made the “corrections” for the proper cases. You can’t look back at older data sets to confirm what you did. You sit there sweating.
  • 21. Scenario 2 same as Scenario 1 : You save it with point and click, over-writing your original data and, while you are saving the file, 1) Your computer goes down because of a power outage OR 2) There is a brief interruption in the network HALF OF YOUR DATA SET IS LOST. You cry.
  • 22. Scenario 3: You get a data set and find errors in it. You write a program that: 1) gets the original data 2) makes changes in values with SYNTAX 3) Includes comments about the changes 4) saves the new file in a different name Science marches forward.
  • 23. Creating Variables and Recoding is not the same as Cleaning Data • You always want clean data • You may not always want the recoded or created variables • Make new variables, but keep the old ones. (don’t over-write) Use the original to check the new
  • 24. Examples of Recoding/Creating • Creating a series of dummies from a categorical variable • Creating an index from a series of scale variables • Creating a dichotomous or categorical variable from a continuous variable • Always consider MISSING VALUES
  • 25. Sample SPSS Program * CleanNew.sps . * 10/10/05 created dummy for male . Get file = ‘dirty.sav’ . * Cleaning data, PJG, looked at survey form, educ for ID=1 should be 16, 10/9/05 . If id = 1 educ = 16 . * Create a dummy variable from “gender”. If gender = ‘m’ male = 1 . If gender = ‘f’ male = 0 . If gender = ‘’ male = -9 . Missing values male (-9) . Variable label male ‘Male’ . Value labels male 1 ‘Male’ 0 ‘Female’ . Save outfile = ‘CleanNew.sav’ / drop gender .
  • 26. Summary for Cleaning and Creating variables • Use syntax (programs) to create and clean variables • Document when and why in your programs • Save new file – do not over-write the old
  • 27. It may be months between the time that you finish a paper, submit it, and get to revise it for publication.
  • 28. What you will need to know: • The origin of your variables: – What is the source for each variable – How were they created? • What programs created your final tables? • What program files created the file you used for your final tables?
  • 29. Create a Directory for the Project • For example, c:MA_Thesis • Store all of the programs and data in that directory and subdirectories
  • 30. Naming Conventions • For every data file you have, you should have a program file with a corresponding name. • When you have finished your paper, create a program file for each table. For example: table1.sas table2.sas
  • 31. Document your work • Write comments in your program. • Put a file in your directory called a_note, readme, or something similar that includes a brief description of the project and important information.
  • 32. Safeguarding your work • Multiple backups – not all stored in the same basket • Worry about the future – Keep up with formats (cards, tapes, floppy disks, CDs, what next? ) – Store in portable formats
  • 33. For More Information click below link: Follow Us on: http://vibranttechnologies.co.in/sas-classes-in-mumbai.html Thank You !!!