0% found this document useful (0 votes)
86 views

Python Data Associate Certification Study Guide

rfs

Uploaded by

ujjwal1807
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
86 views

Python Data Associate Certification Study Guide

rfs

Uploaded by

ujjwal1807
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

Python Data Associate Certification

Study Guide

Please use this study guide to create your certification self-study plan. We’ve included the
objectives you should meet for each assessed competency, with links to relevant practice
assessments.

Associate

Exam PY101: Data Management and Exploratory Analysis

1. Perform standard data import, joining and aggregation tasks


1.1. Import data from flat files
1.2. Combine multiple tables by rows or columns
1.3. Filter data based on different criteria
1.4. Aggregate numeric, categorical variables and dates by groups

2. Perform cleaning tasks to prepare data for analysis


2.1. Match strings in a dataset with specific patterns
2.2. Convert values between data types
2.3. Clean categorical and text data by manipulating strings.
2.4. Clean date and time data.

3. Assess data quality and perform validation tasks


3.1. Identify and replace missing values
3.2. Perform different types of data validation tasks (e.g. consistency, constraints,
range validation, uniqueness)
3.3. Identify and validate data types in a data set

Related assessment

Importing & Cleaning Data with Python

4. Calculate metrics to effectively report characteristics of data and relationships


between features
Python Data Associate Certification
Study Guide
4.1. Calculate measures of center (e.g. mean, median, mode) for variables
4.2. Calculate measures of spread (e.g. range, standard deviation, variance) for
variables
4.3. Calculate skewness for variables
4.4. Calculate missingness for variables and explain its influence on reporting
characteristics of data and relationships
4.5. Calculate the correlation between variables

5. Create data visualizations to demonstrate the characteristics of data


5.1. Create and customize bar charts
5.2. Create and customize box plots
5.3. Create and customize line graphs
5.4. Create and customize histograms

6. Identify and reduce the impact of characteristics of data


6.1. Identify when imputation methods should be used and implement them to
reduce the impact of missing data on analysis or modeling
6.2. Describe when a transformation to a variable is required and implement
corresponding transformations
6.3. Describe the differences between types of missingness and identify relevant
approaches to handling types of missingness

Related assessment

Data Manipulation with Python

You might also like