Data & Variable Transformation: Recode and Transform Variables Summarise Variables and Cases Descriptives and Summaries

sjmisc is an R package that complements dplyr and helps with data transformation tasks and recoding variables. It provides functions for recoding and transforming variables, summarizing variables and cases, and descriptive statistics. The functions are designed to work seamlessly with dplyr and pipes. They follow tidyverse principles by making the data argument first and returning an object of the same type.

Uploaded by

ayrusurya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

89 views

Data & Variable Transformation: Recode and Transform Variables Summarise Variables and Cases Descriptives and Summaries

Uploaded by

ayrusurya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 1

Data & Variable Descriptives and Summaries Recode and Transform Variables Summarise Variables and Cases

Transformation Most of the sjmisc functions (including recode- Recode functions add a suffix to new variables, The summary functions
with sjmisc Cheat Sheet functions) also work on grouped data frames: so original variables are preserved. mostly mimic base R
library(dplyr) By default, original input data frame and new equivalents, but are de-
efc %>% created variables are returned. Use append = signed to work together
group_by(e16sex, c172code) %>% FALSE to return the recoded variables only. with pipes and dplyr.
sjmisc complements dplyr, and helps with data
transformation tasks and recoding variables. frq(e42dep)
rec(x, ..., rec, as.num = TRUE, var.label = row_sums(x, ..., na.rm = TRUE, var =
sjmisc works together "rowsums", append = FALSE)
seamlessly with dplyr Frequency Tables NULL, val.labels = NULL, append = TRUE,
and pipes. All func- suffix = "_r") Row sums of data frames.
tions are designed to row_sums(efc, c82cop1:c90cop9)
frq(x, ..., sort.frq = c("none", "asc", "desc"), Recode values, return result as numeric,
support labelled data. weight.by = NULL, auto.grp ) character or categorical (factor).
Print frequency tables of (labelled) vectors. Uses rec(mtcars, carb, rec = "1,2=1; 3,4=2; else=3") row_means(x, ..., n, var = "rowmeans",
Design Philosophy variable labels as table header. append = FALSE)
data(efc); frq(efc, e42dep, c161sex) dicho(x, ..., dich.by = "median", as.num = Row means, for at least n valid (non-NA) values.
The design of sjmisc functions follows the FALSE, var.label = NULL, val.labels = NULL, row_means(efc, c82cop1:c90cop9, n = 7)
tidyverse-approach: first argument is always the Use this data set append = TRUE, suffix = "_d")
data (either a data frame or vector), followed by in examples!
variable names to be processed by the functions. Dichotomise variable by median, mean or row_count(x, ..., count, var = "rowcount",
specific value. append = FALSE)
flat_table(data, ..., margin = c("counts",
The returned object for each function equals the dicho(mtcars, disp) Row-wise count # of values in data frames.
"cell", "row", "col"), digits = 2,
type of the data-argument. Also col_count().
show.values = FALSE)
split_var(x, ..., n, as.num = FALSE, row_count(efc, c82cop1:c90cop9, count = 2)
Vector input Print contingency tables of (labelled) vectors.
• If the data-argument is a vector, functions Uses value labels. val.labels = NULL, var.label = NULL,
return a vector. flat_table(efc, e42dep, c172code, e16sex) inclusive = FALSE, append = TRUE, Other Useful Functions
suffix = "_g")
Split variable into equal sized groups. Unlike add_columns() and replace_columns() to
count_na(x, ...) dplyr::ntile(), does not split original categories combine data frames, but either replace or
rec(mtcars$carb, rec = "1,2=1; 3,4=2; else=3")
Print frequency table of tagged NA values. into different values (see examples in ?split_var). preserve existing columns.
library(haven); x <- labelled(c(1:3, split_var(mtcars, mpg, disp, n = 3) set_na() and replace_na() to convert regular
Data frame input tagged_na("a", "a", "z")), labels = into missing values, or vice versa. replace_na()
• If the data-argument is a data frame, functions c("Refused" = tagged_na("a"), "N/A" = also replaces specific tagged NA values only.
return a data frame. tagged_na("z"))) group_var(x, ..., size = 5, as.num = TRUE,
count_na(x) right.interval = FALSE, n = 30, append = remove_var() and var_rename() to remove
TRUE, suffix = "_gr") variables from data frames, or rename variables.
Split variable into groups with equal value range, group_str() to group similar string values. Useful
Descriptive Summary or into a max. # of groups (value range per group for variables with similar, but not identically
is adjusted to match # of groups).
rec(mtcars, carb, rec = "1,2=1; 3,4=2; else=3") descr(x, ..., max.length = NULL) group_var(mtcars, mpg, disp, size = 5) merge_df() to full join data frames and preserve
Descriptive summary of data frames, including group_var(mtcars, mpg, size = "auto", n = 4) value and variable labels.
variable labels in output. to_long() to gather multiple columns in data
-ellipses Argument descr(efc, contains("cop"), max.length = 20) frames from wide into long format.
std(x, ..., robust = "sd", include.fac = FALSE,
Apply functions to a single variable, selected
variables or to a complete data frame.
append = TRUE, suffix = "_z")
Finding Variables in a Data Frame Z-standardise variables. Also center(). Use with %>% and dplyr
Variable selection is powered by select():
Separate variables with comma, or use Use find_var() to search for variables by names, std(efc, e17age, c160age) # use sjmisc-functions in pipes
select-helpers to select variables, e.g. ?rec: value or variable labels. Returns vector/data mtcars %>% select(gear, carb) %>%
frame. rec(rec = "min:3=1; 4:max=2")
recode_to(x, ..., lowest = 0, highest = -1,
rec(mtcars, one_of(c("gear", "carb")), # use sjmisc-function inside mutate
append = TRUE, suffix = "_r0)
rec = "min:3=1; 4:max=2") find_var(efc, pattern = "cop", out = "df" ) mtcars %>% select(gear, carb) %>% mutate(
rec(mtcars, gear, carb, rec = "min:3=1; 4:max=2") # variables with "level" in names and value labels recode_to(mtcars$gear)
find_var(efc, "level", search = "name_value")
CC BY Daniel Lüdecke [email protected] github.com/strengejacke Learn more with browseVignettes("sjmisc") sjmisc 2.7.0 02/18

1581784059781-Robert Glover - No More MR Nice Guy
100% (10)
1581784059781-Robert Glover - No More MR Nice Guy
155 pages
Stata Cheat Sheets
100% (1)
Stata Cheat Sheets
6 pages
r Module 5
No ratings yet
r Module 5
21 pages
Lecture 10 R
No ratings yet
Lecture 10 R
117 pages
R Reference Card
No ratings yet
R Reference Card
1 page
Business Analytics-1: STR (Crew - Data)
No ratings yet
Business Analytics-1: STR (Crew - Data)
16 pages
Introduction To Stata: Li-Pin Juan
No ratings yet
Introduction To Stata: Li-Pin Juan
41 pages
R Module 6 - Data Summarization
No ratings yet
R Module 6 - Data Summarization
25 pages
Importing The Files
No ratings yet
Importing The Files
14 pages
MGMT 469 Helpful Stata Commands
No ratings yet
MGMT 469 Helpful Stata Commands
8 pages
Summary of Basic STATA Commands and Syntax
No ratings yet
Summary of Basic STATA Commands and Syntax
5 pages
MDPN460 Lecture05
No ratings yet
MDPN460 Lecture05
32 pages
STATA Commands
100% (2)
STATA Commands
35 pages
Introduction to R for Business Analytics(1)
No ratings yet
Introduction to R for Business Analytics(1)
7 pages
Stata Data Managment
No ratings yet
Stata Data Managment
79 pages
Zelig For R Cheat Sheet: Plots Vectors
No ratings yet
Zelig For R Cheat Sheet: Plots Vectors
2 pages
Cheat Sheet: With Stata 15
No ratings yet
Cheat Sheet: With Stata 15
6 pages
AllCheatSheets Stata v15 PDF
No ratings yet
AllCheatSheets Stata v15 PDF
6 pages
AllCheatSheets Stata v15
100% (1)
AllCheatSheets Stata v15
6 pages
STATAforEconWorkshop3
No ratings yet
STATAforEconWorkshop3
12 pages
Chapter - 3 Common Statistical Procedure
No ratings yet
Chapter - 3 Common Statistical Procedure
20 pages
R Lectures Chapter 4
No ratings yet
R Lectures Chapter 4
3 pages
MultivariateRGGobi PDF
No ratings yet
MultivariateRGGobi PDF
60 pages
STATA
No ratings yet
STATA
26 pages
Linear Regression Analysis HUDM 5122: Introduction To R Johnny Wang
No ratings yet
Linear Regression Analysis HUDM 5122: Introduction To R Johnny Wang
17 pages
Computational Techniques in Statistics: Exercise 1
No ratings yet
Computational Techniques in Statistics: Exercise 1
5 pages
R study material I
No ratings yet
R study material I
8 pages
R Data Types 8
No ratings yet
R Data Types 8
7 pages
Functions and Packages
No ratings yet
Functions and Packages
7 pages
Getting Started With Your Data: Using Stata
No ratings yet
Getting Started With Your Data: Using Stata
32 pages
X - 15 x-1 2. Print ('Hello Word!') ## (1) "Hello Word!" 3. X - 4 y - 5 Z - X+y Print (Z) 4. X - 4 y - 5 Cat ('The Sum of X and y Is', X+y)
No ratings yet
X - 15 x-1 2. Print ('Hello Word!') ## (1) "Hello Word!" 3. X - 4 y - 5 Z - X+y Print (Z) 4. X - 4 y - 5 Cat ('The Sum of X and y Is', X+y)
15 pages
CH 3
No ratings yet
CH 3
33 pages
DAL 371 SLID 13 Functions
No ratings yet
DAL 371 SLID 13 Functions
48 pages
R-pres
No ratings yet
R-pres
53 pages
Lesson 7 - The Data Frame
No ratings yet
Lesson 7 - The Data Frame
7 pages
MATLAB Notes
No ratings yet
MATLAB Notes
26 pages
Lec 4
No ratings yet
Lec 4
18 pages
R Programming 101 Part 1
No ratings yet
R Programming 101 Part 1
53 pages
Stata-Syntax Reference
No ratings yet
Stata-Syntax Reference
4 pages
A Short List of The Most Useful R Commands
No ratings yet
A Short List of The Most Useful R Commands
11 pages
All Cheat Sheets
No ratings yet
All Cheat Sheets
5 pages
AllCheatSheets_Stata_v15
No ratings yet
AllCheatSheets_Stata_v15
6 pages
Presentation 1
No ratings yet
Presentation 1
34 pages
WWWWWW WWWWWW WWWWWW WWWWWW WWWW WWWW WWWWWW: Data Transformation With Dplyr
No ratings yet
WWWWWW WWWWWW WWWWWW WWWWWW WWWW WWWW WWWWWW: Data Transformation With Dplyr
2 pages
R Imp Funtions
No ratings yet
R Imp Funtions
10 pages
R Commands: Appendix B
No ratings yet
R Commands: Appendix B
5 pages
Statistics Cheat Sheet
100% (1)
Statistics Cheat Sheet
4 pages
Basics: TH TH TH TH TH TH TH
No ratings yet
Basics: TH TH TH TH TH TH TH
3 pages
4 Overview of R Part 2
No ratings yet
4 Overview of R Part 2
63 pages
MATLAB For Data Processing and Visualization Quick Reference
No ratings yet
MATLAB For Data Processing and Visualization Quick Reference
11 pages
R Module 5
No ratings yet
R Module 5
21 pages
STTN 225 R Summary
No ratings yet
STTN 225 R Summary
18 pages
Using R For Basic Statistical Analysis
No ratings yet
Using R For Basic Statistical Analysis
11 pages
lec_09
No ratings yet
lec_09
16 pages
Unit - 2: Data Manipulation With R & Data Visualization in Watson Studio
No ratings yet
Unit - 2: Data Manipulation With R & Data Visualization in Watson Studio
58 pages
Data Preprocessing
No ratings yet
Data Preprocessing
27 pages
Fall 2005 Statistics 579 R Tutorial: Vectors, Matrices, and Arrays
No ratings yet
Fall 2005 Statistics 579 R Tutorial: Vectors, Matrices, and Arrays
8 pages
R - Tutorial: Matrices Are Vectors
No ratings yet
R - Tutorial: Matrices Are Vectors
13 pages
Exploratory Data Analysis - NOTES
No ratings yet
Exploratory Data Analysis - NOTES
31 pages
The Essential R Reference
From Everand
The Essential R Reference
Mark Gardener
No ratings yet
Introduction to PHP, Part 2, Second Edition
From Everand
Introduction to PHP, Part 2, Second Edition
Adam Majczak
No ratings yet
Machine Learning Modelling in R PDF
No ratings yet
Machine Learning Modelling in R PDF
1 page
Data Science in Spark With Sparklyr::: Cheat Sheet
No ratings yet
Data Science in Spark With Sparklyr::: Cheat Sheet
2 pages
MLR PDF
No ratings yet
MLR PDF
2 pages
Quanteda PDF
No ratings yet
Quanteda PDF
2 pages
M YCVUh 5 MFZ SDF Lawcfhe Sa Ye Gsu Upm
No ratings yet
M YCVUh 5 MFZ SDF Lawcfhe Sa Ye Gsu Upm
7 pages
Inbound 7669119689067910280
No ratings yet
Inbound 7669119689067910280
2 pages
Creativity Is
0% (1)
Creativity Is
29 pages
Performance Appraisal System in Small Scale Industries
100% (1)
Performance Appraisal System in Small Scale Industries
63 pages
Guthrie's Theory of Contiguity and Hull's Systematic Behavior Theory
100% (4)
Guthrie's Theory of Contiguity and Hull's Systematic Behavior Theory
12 pages
Graphs of Motion Test
100% (1)
Graphs of Motion Test
11 pages
Characteristics of Successful Employer Brands PDF
No ratings yet
Characteristics of Successful Employer Brands PDF
17 pages
Hensel Recycling HenRy 2017 English
No ratings yet
Hensel Recycling HenRy 2017 English
20 pages
1PH0 1F Exam-Paper 20180523
0% (1)
1PH0 1F Exam-Paper 20180523
32 pages
Journal of Career Assessment-2000-Betz-205-22 PDF
100% (1)
Journal of Career Assessment-2000-Betz-205-22 PDF
18 pages
The Development of Educational Game "Who Wants To Be An Accountant" To Improve Student'S Learning Outcomes
No ratings yet
The Development of Educational Game "Who Wants To Be An Accountant" To Improve Student'S Learning Outcomes
14 pages
A Super Memory: How To Develop
100% (1)
A Super Memory: How To Develop
12 pages
Monitoring Training Load To Understand Fatigue in Athletes
No ratings yet
Monitoring Training Load To Understand Fatigue in Athletes
9 pages
Evaporative Cooling Pads
No ratings yet
Evaporative Cooling Pads
7 pages
Teknik Budidaya Dan Kolam Terpal: Azolla Microphylla PADA MEDIA EMBER
No ratings yet
Teknik Budidaya Dan Kolam Terpal: Azolla Microphylla PADA MEDIA EMBER
5 pages
John Doe v. Holy See, Et Al
No ratings yet
John Doe v. Holy See, Et Al
55 pages
ACS880 Joe+Chen
No ratings yet
ACS880 Joe+Chen
61 pages
Parts of A Research Paper
100% (1)
Parts of A Research Paper
2 pages
Final Wassce Pc1 2025 Intl TT November 25, 2024
No ratings yet
Final Wassce Pc1 2025 Intl TT November 25, 2024
5 pages
Boo
No ratings yet
Boo
6 pages
QQ Ad Practice Test 5
No ratings yet
QQ Ad Practice Test 5
7 pages
W8GS
No ratings yet
W8GS
8 pages
Gian PPT 29oct15
No ratings yet
Gian PPT 29oct15
14 pages
The Official Introduction Guide To EmoTrance 2008 Edition
No ratings yet
The Official Introduction Guide To EmoTrance 2008 Edition
135 pages
STATISTICS - UNIT - 1 - WORKSHEET Class 11
100% (1)
STATISTICS - UNIT - 1 - WORKSHEET Class 11
3 pages
A Post Colonial Critical History of Heart of Darkness
No ratings yet
A Post Colonial Critical History of Heart of Darkness
7 pages
Materializing Memory in Art and Popular Culture
No ratings yet
Materializing Memory in Art and Popular Culture
25 pages
1
No ratings yet
1
123 pages
Uses of Rates and Word Problems Involving Rates 2
No ratings yet
Uses of Rates and Word Problems Involving Rates 2
6 pages

Data & Variable Transformation: Recode and Transform Variables Summarise Variables and Cases Descriptives and Summaries

Uploaded by

Data & Variable Transformation: Recode and Transform Variables Summarise Variables and Cases Descriptives and Summaries

Uploaded by

Data & Variable Descriptives and Summaries Recode and Transform Variables Summarise Variables and Cases

You might also like