0% found this document useful (0 votes)

14 views

2023 Stata Lab Session

Uploaded by

Gianna Gleason

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views

2023 Stata Lab Session

Uploaded by

Gianna Gleason

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

Prof.

Narayani Lasala-Blanco

Stata Lab Session Handout No.1

1. Starting Stata

“Start > Programs > Quantitative Apps > Stata”

A Stata window will pop up.

Results

Stata is composed of four sub-windows: Review, Variables, Results, and Command Windows:

• Command Window (Command Line): All the commands can typed in this window. If you
type in a command and press the “Enter” key, the command will be executed. We will rarely
type commands directly here. Most of the time we will be using a do-file (more on this
below).
• Variables Window: The names of variables contained in the opened dataset will appear here.
• Results Window: The results of any command you execute will appear in this window.
• Review / Command Window: The commands you have executed appear in this window
(upper left corner)

2. Bring a dataset into Stata

1
The next thing you do to start your statistical analysis is to bring a data file into Stata. To open
up a data file, click on the “Open” icon which is located in the upper-left corner of the Stata
window as shown below.

A windows explorer window will appear (the directory appearing in the windows explorer may
be different from the example shown below). Open the (GSS) data files wherever you saved
them (Documents, Downloads etc)

Choose the data you want to read into Stata. We will use the “nes08” data file here.

NOTE: If you are using ANES data you’ll need to register.

Once you register and are able to log in you will see the links to download the data in a .dta
(STATA) format. Download the dataset and go back to STATA.

2
Once you are in STATA: Click Open… and select the downloaded dataset.

Error Message?

If you are working at a non University Lab computer you may see a memory error message like
when you try to bring a file into Stata:

3
This error happens due to a shortage of memory when the dataset is very large. If you see this
message, increase the memory size of Stata temporarily to 4 Megabyte (or more ) by typing in
the following command.

. set memory 4m

3. Create a “do-file”

You are required to submit the commands you used for assignments. As we progress in the
course you will see it is easy to get confused about which files correspond to which day of work.

A do-file is a command file that lets you submit several commands to Stata at the same time. It is
important to create a do file because if you need to make changes to your statistical analysis a
do-file simplifies this process. A do-file should contain a list of all (that is, all) the Stata
commands you use in your data analysis, preferably from start (after bringing the data file into
Stata) to finish.

To start a new do-file within the Stata program click on the envelope icon on the toolbar or click
“New Do-file” from the Windows pulldown menu (Windows: Do-file Editor: New Do-file).

The following window will appear. Save as any other file by going to File> Save as…

4
Work on your command window as demonstrated further below. Every time you execute a
command successfully simply select it, copy and paste the command in the do-file window. For
example, in the above screen we have successfully brought the NES 2008 data into Stata (notice
the variables are in the Variables window).

Tip #1: It is quite useful to drag and rearrange the do file and the remaining Stata windows.
If you have them side by side as shown below it is easier to use the do-file edit-as-you-go-
advantage. Stata Window Do-File Editor Window

Paste into DO file

Execute, Select &

Copy
5
Tip#2: Avoid the proliferation of files!!

You should only have 4 files per assignment:

1. A dataset (which you will not modify or save changes to)

2. A do file (which will contain all the commands you used for your analysis). You may
also have an “output” or “results” file from your data analysis, as a separate file form 3.

3. A Word or other word processing file, where you will paste your tables (Copy the tables
from STATA, paste onto word by selecting Courier font size 8)

4. A Word or other word processing file, where you will write your paper (5 pages
maximum of writing), which may have two appendixes: one with the (tables of the)
results you obtained which are not on the main body of the paper, and the second with all
the commands you executed, that is, a copy of your do-file.

In this lab session we will copy and paste each command executed correctly onto the do file.

4. Descriptive Statistics (1)

You can enlarge the Variable and Review windows by clicking and dragging the border line of
each window. The captured picture shows the resized (enlarged) Review and Variable windows.

In the Variable window, you will see the labels for the variables which were hidden before due to
the window size. Resizing allows you to look at the variable labels, if they are provided by the
data-creator.

6
Drag

In the Review window, you see the commands you executed. In the image shown above, the
command for opening the dataset appears. Even if you use the Menu-Bar option instead of typing
in the commands, Stata automatically converts the Menu-Bar activities into commands and
records the commands.

For this example, we use “V085084” which represents the survey respondent’s self-reported
liberal/conservative political ideology. To find out detail information for this variable, use
“codebook” command. Type into Command window,

. codebook V085084

Caution!

Stata is “case sensitive.” Stata distinguishes the upper letters from the lower letters. That is, “A”
and “a” are not the same. In our example, the variable name is “V085084.” This is “V085084”,
which starts with “V” not with “v”. If you type “codebook v085084” instead of “codebook
V085084”, Stata will say “no variables defined.”

Then, hit the return key to execute the command.

7
Notice at the bottom, only some example values and labels are shown (instead of all values and
labels). You can tell because the range of values is “[-9,7]”, i.e. -9 to 7, but it only shows labels
for values -7, 2, 4, and 5. Use the “labelbook” command to see all labels (this dataset does not
have it)

*Note: typing just “codebook” or “labelbook” without the specific variable name requests
detail information for ALL variables—you do not want to do this!

The information given by the “codebook” and “labelbook” does not include univariate statistics.

To get the tabulation of the frequency distribution, type:

. tab V085084

. tabulate V085084

8
In the Results window, we see this variable has seven substantive categories. That is, there are
seven options for answers, seven of which are “substantive” and refused (coded -9) don’t know
(coded -8) (-2 No Post Election IW).

The seven substantive categories are: “1” if the respondent is “extremely liberal”, “2” if the
respondent is a “liberal” and so on. The first four categories (“Refused”, “Don’t” and
“Haven’t..”’ “No post election..”) will be treated as “missing”, which means the response data is
missing. In some cases categories are assigned negative values or very high values, so that they
won’t be confused with substantive categories.

To get univariate statistics such as mean and standard deviation, use the “summarize” command:

. sum V085084

. summarize V085084

The mean for these categories is 1.126184. However, this is highly distorted due to including the
“missing” categories. We have to recode them, so that Stata designates them as “missing” cases
to exclude in statistical calculations.

4.1 Begin copying and pasting commands into the do-file

9
Tip#3: When you copy the commands from the Results window and paste them into your
do-file be careful NOT to include the “.” at the beginning of the line. Stata will not
recognize the Command.

Always verify that you did not get an error message in the Results window before copying and
pasting a command.

Suppose you type “Codebook” instead of “codebook” and since Stata is case sensitive, you will
get the following error message:

You can type the command again and then proceed to paste it on your do-file. Always type in
commands in the window and then paste into do-file.

In the screens above we proceed to select and copy the commands into the do-file window. All
successful commands typed into the Command window (in this and previous handouts in bold
and preceded by “.”) should be in your do file.

Note that commands may not go more than one line in the do-file.

Tip #4: To add notes to your do-file use asterisks. (You can use just one *, or any number
of them, for various levels of emphasis. Because Stata reads each new line as a new
command, make sure to use an asterisk on each line if you have an extended comment.)
The following code includes a note and then a command:

***This is a note about the variable named liberal

.tab liberal

Stata will ignore any text that has asterisks in front of it. Now you can put notes and reminders
into the text of your do-file. Including notes also helps other understand what you are trying to
do in your analysis.

5. Recoding – Treating the Missing Cases

10
Here, we recode categories, “ -9” to “-2” to “.” which is the symbol for “missing” in Stata. Use
the “recode” command.

. recode V085084 (-9 -8 -7 -2 = .), gen(ideol)

Type the name of the variable you want to recode after “recode” command. Then, in parentheses,
list the values of the original variable you want to recode on the left side of equal sign and the
value you want to replace them with on the right hand side. To keep the original variable intact,
create a new variable with recoded values by using the “gen” option. Put the name for the newly
recoded variable in the parenthesis. Here, we name it “ideol”.

Stata reports 735 differences from the original variable because there were 735 observations for
categories -9 to -2 together. This is a relatively small portion of the original 2322 observations.

Now, see the tabulation for this new variable.

. tab ideol

Notice the “missing” values (-2,-7,-8, and- 9) disappeared. The “tab” command excludes
missing cases automatically. To include the missing cases, add “missing” at the end of the
command line.
. tab ideol, missing

11
How does the new mean look? Does it look about right?

. sum ideol

Re-executing a previously executed command:

Move the cursor onto the command in the Review Window that you want to re-run, and double-
click the line. A single-click will make the command just appear in the Command Window.

6. Descriptive Statistics (2)

Now that we have treated missing values properly, let’s look at some other statistics for this
variable. To get “Standard Error of the Mean”, type:

. ci ideol

12
This command shows not only the standard error of the mean but also the 95% confidence
interval of the estimated mean. (This statistic will be covered in a later class.)

Histograms are often useful graphical ways to display data. Type:

. hist ideol

. histogram ideol

13
7. Recoding – Dichotomization

Let’s practice recoding (combining or collapsing categories) by dichotomizing the “ideol”

variables into two categories. Suppose you want to focus on the proportion of Liberals vs. Non-
Liberals. All “Moderates” will be classified into the “Non-Liberal” category in this example.

. recode ideol (1/3 = 1) (4/7 = 0) (miss = .), gen(liberal)

. recode ideol (1 2 3 = 1) (4 5 6 7 = 0) (miss = .), gen(liberal)

Values “1,” “2,” and “3” are recoded as “1” (Liberal), and values “4” through “7” are recoded as
“0” (Non-Liberal). Missing values are recoded to “.”, as they originally were. Notice that you
can use the “/” term in “recode” as a shortcut, i.e. “1/3” is equivalent to “ 12 3” in this command.
We name the resulting dichotomous variable “liberal”.

Check the result of recoding by typing:

. codebook liberal

8. Documenting your data: how to make labels

Usually it is convenient to give variables short names—like we did for variable “V085084” when
we generated “ideol” and “liberal”—as we will type these names many times. It is also
convenient to have more complete descriptions of the data attached to the dataset so that when
you come back a year later you can remember what everything means. The “label” and “note”
commands are used for this purpose.

The “label” command is used to label the values of the variables, e.g.,

. label define labelname 1 "Liberals" 0 "Non-Liberals"

. label values liberal labelname

Again, to view the changes type:

. tab liberal

14
You may also use the “note” command to add a note to any variable in the dataset.

. note variable: phrase

. note variable

This allows you to have more complete descriptions of the data attached to the dataset, so that
when you come back a year later you can remember what everything means. For example:

. note liberal: Work done on 2008-09-08

. note liberal: All moderates are grouped into “Non-Liberals”
. note liberal

9. Save

Once you have executed all the commands required to get the appropriate statistics for
completing Computer Exercise #2 (and copied and pasted each onto the do-file window) your
do-file for this exercise should look like this:

15
Save your do file (File-> Save as).

Now, test your do-file now by executing all commands.

Execute the command

.clear

16
This will clear STATA from all datasets.

Open the dataset again (as shown in page 2). Once the dataset is loaded, go to the icon shown
above of the do file window and click on this “Do” icon. This will execute in a batch mode all
the commands saved in the do- file.

Stata should execute all the commands and you should be able to retrieve all statistics from the
Results window. If error messages appeared you should revise your do-file

10. How to open a saved do file next session

If you want to modify your do-file in a following session open the do file editor as shown at the
beginning of the handout (Window> Do file editor> New do file). Then, open the File menu. An
untitled do-file will pop up, click Open… and search for your “stata lab 1.do” do-file (or
whatever name you have given it).

Stata Training Course
No ratings yet
Stata Training Course
43 pages
Stata For Dummies v1m
No ratings yet
Stata For Dummies v1m
12 pages
What Is Stata?
No ratings yet
What Is Stata?
16 pages
software material
No ratings yet
software material
13 pages
Applied Econometrics Using Stata
100% (2)
Applied Econometrics Using Stata
100 pages
A I S ECMT1020: N Ntroduction To Tata
No ratings yet
A I S ECMT1020: N Ntroduction To Tata
15 pages
STATA Capacity Building March 8
No ratings yet
STATA Capacity Building March 8
15 pages
Stata: A Brief Introduction
No ratings yet
Stata: A Brief Introduction
9 pages
Compiled by Solomon Kebede
No ratings yet
Compiled by Solomon Kebede
136 pages
Stock and Mark W. Watson
No ratings yet
Stock and Mark W. Watson
21 pages
Training at Gudar Campus
No ratings yet
Training at Gudar Campus
83 pages
Applied Econometrics Using Stata
100% (1)
Applied Econometrics Using Stata
100 pages
Stata Manual Introduction
No ratings yet
Stata Manual Introduction
24 pages
B203 Statahandout
No ratings yet
B203 Statahandout
8 pages
Stata Basics13
No ratings yet
Stata Basics13
23 pages
Stata0 2008 Quique Moral Benito
No ratings yet
Stata0 2008 Quique Moral Benito
8 pages
CH - 1 - Introduction To Econometrics Software Stata
No ratings yet
CH - 1 - Introduction To Econometrics Software Stata
35 pages
STATA Commands
No ratings yet
STATA Commands
42 pages
STATAforEconWorkshop2
No ratings yet
STATAforEconWorkshop2
15 pages
Ipa/J-Pal Staff Training STATA 101
No ratings yet
Ipa/J-Pal Staff Training STATA 101
24 pages
Stat A Guide
No ratings yet
Stat A Guide
10 pages
STATAforEconWorkshop1
No ratings yet
STATAforEconWorkshop1
12 pages
Sda Lab 1
No ratings yet
Sda Lab 1
6 pages
Stata Excel Spreadsheet
No ratings yet
Stata Excel Spreadsheet
43 pages
Stata For Survey Analysis
No ratings yet
Stata For Survey Analysis
164 pages
Introduction To Stata: Ucla Idre Statistical Consulting Group
No ratings yet
Introduction To Stata: Ucla Idre Statistical Consulting Group
119 pages
STATA Notes - by Ms Bing
No ratings yet
STATA Notes - by Ms Bing
82 pages
The Basics of STATA_2020
No ratings yet
The Basics of STATA_2020
15 pages
Stata Tutorial: Updated For Version 16
No ratings yet
Stata Tutorial: Updated For Version 16
49 pages
Stata Prirucnik
No ratings yet
Stata Prirucnik
75 pages
Chapter Three
No ratings yet
Chapter Three
100 pages
Introduction to Stata Software,MaU, 2022
No ratings yet
Introduction to Stata Software,MaU, 2022
93 pages
Introduction to Stata for data management
No ratings yet
Introduction to Stata for data management
7 pages
Stata Guide 1
No ratings yet
Stata Guide 1
19 pages
Stata Excel
No ratings yet
Stata Excel
44 pages
STATA Tutorial
100% (1)
STATA Tutorial
42 pages
Stata
No ratings yet
Stata
6 pages
Stata Review
No ratings yet
Stata Review
9 pages
Stata For Windows
No ratings yet
Stata For Windows
10 pages
Statabasics
No ratings yet
Statabasics
16 pages
Stata Training
No ratings yet
Stata Training
24 pages
Stata Application Part I
No ratings yet
Stata Application Part I
27 pages
An Introduction To Stata For Economists: Data Management
No ratings yet
An Introduction To Stata For Economists: Data Management
49 pages
Stata Intro
No ratings yet
Stata Intro
20 pages
Stata An Introduction Summer 2020
No ratings yet
Stata An Introduction Summer 2020
60 pages
Session 1: Module Introduction and Getting Started With Stata
No ratings yet
Session 1: Module Introduction and Getting Started With Stata
35 pages
EEA Stata Training Manual
100% (2)
EEA Stata Training Manual
85 pages
Stata Introduction To Stata
No ratings yet
Stata Introduction To Stata
12 pages
An Introduction To Stata For Economists: Data Analysis
No ratings yet
An Introduction To Stata For Economists: Data Analysis
48 pages
Stata Absolute Beginners
No ratings yet
Stata Absolute Beginners
38 pages
STATAforEconWorkshop3
No ratings yet
STATAforEconWorkshop3
12 pages
Data Analyses Stata Manual NYTS
No ratings yet
Data Analyses Stata Manual NYTS
40 pages
Command Window: Stata Results Window: Variables Window: Review Window
No ratings yet
Command Window: Stata Results Window: Variables Window: Review Window
3 pages
Creating A Do File For STATA
No ratings yet
Creating A Do File For STATA
5 pages
Creating A Do File
No ratings yet
Creating A Do File
9 pages
Stata Mini-Course - Session 1
No ratings yet
Stata Mini-Course - Session 1
21 pages

2023 Stata Lab Session

Uploaded by

2023 Stata Lab Session

Uploaded by

Prof.

Stata Lab Session Handout No.1

“Start > Programs > Quantitative Apps > Stata”

A Stata window will pop up.

2. Bring a dataset into Stata

NOTE: If you are using ANES data you’ll need to register.

Paste into DO file

Execute, Select &

You should only have 4 files per assignment:

1. A dataset (which you will not modify or save changes to)

4. Descriptive Statistics (1)

Then, hit the return key to execute the command.

To get the tabulation of the frequency distribution, type:

4.1 Begin copying and pasting commands into the do-file

***This is a note about the variable named liberal

5. Recoding – Treating the Missing Cases

. recode V085084 (-9 -8 -7 -2 = .), gen(ideol)

Now, see the tabulation for this new variable.

Re-executing a previously executed command:

6. Descriptive Statistics (2)

Histograms are often useful graphical ways to display data. Type:

Let’s practice recoding (combining or collapsing categories) by dichotomizing the “ideol”

. recode ideol (1/3 = 1) (4/7 = 0) (miss = .), gen(liberal)

. recode ideol (1 2 3 = 1) (4 5 6 7 = 0) (miss = .), gen(liberal)

Check the result of recoding by typing:

8. Documenting your data: how to make labels

. label define labelname 1 "Liberals" 0 "Non-Liberals"

Again, to view the changes type:

. note variable: phrase

. note liberal: Work done on 2008-09-08

Now, test your do-file now by executing all commands.

Execute the command

10. How to open a saved do file next session

You might also like