0% found this document useful (0 votes)

6 views

CU Data Science

The Online Certificate Program in Data Science focuses on utilizing the R programming language to apply data analytics tools for decision-making. It consists of eight courses covering topics such as data collection, pattern recognition, machine learning, and optimization techniques. Participants will earn a Data Science Certificate from Cornell College of Engineering and gain practical experience through real-world data sets.

Uploaded by

ivyprepbkk

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views

CU Data Science

Uploaded by

ivyprepbkk

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

1

DATA SCIE N CE
Online Certificate Program

OVERVIEW
R is quickly becoming one of the most popular and effective programming languages
of data science. In this program, you’ll apply data science tools to the collection of
data and the translation of data into information, constructing models that can be
used to address the questions that you’re investigating. You’ll have the opportunity to
apply data analytics as a four-part process: gathering data, looking for patterns in that
data, finding insights in any patterns you discover, and using those insights to make
decisions. This process does not make decisions for you, but it will help you to better
understand the effects of the decisions you might make. Through an examination of
real-world data sets and different modeling techniques, as well as an in-depth look
at how the programming language R can be used to help you find patterns and derive
insights, you will gain valuable experience working in each stage of the data analytics
process, helping you and your organization to make better decisions – and gain a
sound scientific understanding of why you’re making the choices you’re making.

COURSES COURSE LENGTH FORMAT

8 2 weeks 100% online

COURSES
• Understanding Data Analytics
• Finding Patterns in Data Using Association Rules, PCA, and Factor Analysis
• Finding Patterns in Data Using Cluster and Hotspot Analysis
• Regression Analysis and Discrete Choice Models
• Supervised Learning Techniques
• Neural Networks and Machine Learning
• Making Data-Driven Recommendations Using Optimization
• Making Predictions Using Simulation

Visit ecornell.cornell.edu
7 COURSES

2 INSIDE the PROGRAM

100% ONLINE

KEY TAKEAWAYS
• Explore the data analytics process and • Predict the value of continuous variables
examine the tools available to improve with linear regression
decision making • Use neural networks to make
• Use unsupervised learning techniques predictions about new data
to help identify patterns in data and • Make forecasts from data collected over
create visualizations to better spot those time and measure their accuracy
patterns • Create linear programs and simulations
• Categorize data using supervised to optimize system performance and
learning algorithms dynamics

WHO SHOULD ENROLL

• Current and aspiring data scientists
• Analysts
• Engineers
• Researchers
• Technical managers

WHAT YOU’LL EARN

• Data Science Certificate from Cornell College of
Engineering
• 160 Professional Development Hours (16 CEUs)

Visit ecornell.cornell.edu
7 COURSES

3
COURSE
DESCRIPTIONS
100% ONLINE

UNDERSTANDING DATA ANALYTICS

By some estimates, 90% of the data that has ever existed has been created in the
last two years. This is a staggering figure and has given rise to new challenges and
opportunities in almost every industry: What kind of data do you need to collect to
compete, and how can you make sense of it once you have collected it? As technology
evolves and the volume of data increases, how can you make the best use of all this
information? How can you use the data to help drive your decision making? How can
you make data work for you? How can you ensure your data accurately reflects the
population in which you’re interested?

In this course, you will determine the types of engineering and business questions you
can answer, the kinds of problems you can solve, and the decisions you can make, all
through using data analytics. You will explore best practices for collecting information
so that you can make informed predictions, develop insights, and better inform
organizational decision making. You will see real-world examples that demonstrate
how those tools work. Additionally, you will have a chance to apply some of the
concepts to your own work. You will explore best practices for sampling and examine
how different types of sampling are suited for different situations. Finally, you will
see real-world examples that demonstrate how those tools work and have a chance to
practice sampling techniques in some case-study scenarios.

FINDING PATTERNS IN DATA USING ASSOCIATION RULES, PCA, AND

FACTOR ANALYSIS

Visualization is one of the most simple and effective ways to find patterns in data.
These patterns include: What is the general range and shape of the data set? Are there
any clusters of observations? Which variables correlate with each other? Are there any
obvious outliers?

As your data set grows in terms of the number of data points and variables, however,
it becomes increasingly difficult to visualize all this information at once. At most, you
can plot data points on a three-dimensional axis and add further distinctions of size,

Visit ecornell.cornell.edu
7 COURSES

4
COURSE
DESCRIPTIONS
100% ONLINE

color, shape, and so on. Yet this can easily become too busy and difficult to read. How,
then, do we find patterns in really big data sets?

In this course, you will explore several powerful and commonly utilized techniques for
distilling patterns from data. You will implement each of these techniques using the
free and open-source statistical programming language R with real-world data sets.
The focus will be on making these methods accessible for you in your own work.

FINDING PATTERNS IN DATA USING CLUSTER AND HOTSPOT ANALYSIS

When you have large groups of objects, it is often helpful to split them into meaningful
groups or clusters. One example of this would be to identify different types of
customers so that a company can more efficiently route their calls to a helpline. As
a second example, suppose an automobile manufacturer wanted to segment their
market to target the ads more carefully. One approach might be to take a database of
recent car sales, including the social demographics associated with each customer,
and segment the population purchasing each type of automobile into meaningful
groups.

Specialized approaches exist if your data contains information that relates to time
and geography. You can use this additional information to identify geographical and
temporal hotspots. Hotspots are regions of high activity or a high value of a particular
variable. These results can help you focus your attention on a particular region where
a problem is occurring more than usual, such as the incidence of asthma in a large
city. In both cluster and hotspot analysis, the results can help you discover new and
interesting features, problems, and red flags regarding the data being analyzed.

In this course, you will explore several powerful and commonly utilized techniques for
performing both cluster and hotspot analysis. You will implement these techniques
using the free and open-source statistical programming language R with real-world
data sets. The focus will be on making these methods accessible and applicable to your
work.

Visit ecornell.cornell.edu
7 COURSES

5
COURSE
DESCRIPTIONS
100% ONLINE

REGRESSION ANALYSIS AND DISCRETE CHOICE MODELS

A story can play an important role in understanding data. It can help distill complex
information into something manageable — something we can think about easily,
relate to, and use to make decisions. For many problems that we encounter globally,
however, a story that describes what already happened is not enough precision for the
job we want to perform. Often, we would like to use available data to make numerically
accurate predictions about what might happen in the future. This task requires the
construction of mathematical models that are well suited to our real-world problems.

In this course, you will explore several types of statistical models used with data to
make predictions. These models bring with them a whole batch of important concerns,
such as estimation and validation, that make the entire process into both an art and a
science. You will implement each of these techniques using the free and open-source
statistical programming language R with real-world data sets. The focus will be on
making these methods accessible for you in your own work.

SUPERVISED LEARNING TECHNIQUES

Supervised learning is a general term for any machine learning technique that
attempts to discover the relationship between a data set and some associated labels
for prediction. In regression, the labels are continuous numbers. This course will
focus on classification, where the labels are taken from a finite set of numbers or
characters. The prototypical and perhaps most well-known example of classification
is image recognition. The goal is to take an image (represented by its pixel values) and
determine what objects are in the image. Is it a dog? A grapefruit? A stop sign?

There are many practical classification tasks, such as determining whether an

individual’s financial history makes them high risk for a loan, whether there is a defect
in a material based on some sensor readings, or whether a new email is spam or not.
These problems share the same basic form and can be solved with many different
types of mathematical, statistical, and probabilistic models developed by the machine
learning community.

Visit ecornell.cornell.edu
7 COURSES

6
COURSE
DESCRIPTIONS
100% ONLINE

In this course, you will explore several powerful and commonly utilized techniques for
supervised learning. You will implement each of these techniques using the free and
open-source statistical programming language R with real-world data sets. The focus
will be on making these methods accessible for you in your own work.

NEURAL NETWORKS AND MACHINE LEARNING

Neural networks, a nonlinear supervised learning modeling tool, have become

hugely popular within the last two decades because they have been successfully
applied to a wide range of problems, including automatic language processing,
image classification, object detection, speech recognition, and pattern recognition.
They are mathematical models that are loosely built up based on an analogy to the
interconnected neuron in the brain. They take in a vector or matrix of input data
and output either a classification value or an approximation to a functional value.
The beauty is that the relationships between the inputs and outputs can be highly
nonlinear and complex.

In this course, you will explore the mechanics of neural networks and the intricacies
involved in fitting them to data for prediction. Using packages in the free and
open-source statistical programming language R with real-world data sets, you will
implement these techniques. The focus will be on making these methods accessible
for you in your own work.

MAKING DATA-DRIVEN RECOMMENDATIONS USING OPTIMIZATION

Statistics is about using data to estimate certain values and evaluate certain
hypotheses; this makes perfect sense for passively studying how the world works (i.e.,
the scientific method). More often than not, however, we find ourselves wanting to use
this statistical information to make decisions regarding the systems involved. Suppose
we estimate that the demand for jet fuel next month will be greater than normal. How
does this information affect the decision of an oil refinery to purchase crude oil from
their various sources? How does an airline company decide how many flight crews to

Visit ecornell.cornell.edu
7 COURSES

7
COURSE
DESCRIPTIONS
100% ONLINE

employ based on the current flight schedule? How does past sales information across
the U.S. influence a company’s decision over where to place its warehouses?

The quantification and mathematical solution of these types of decision-making

problems are known broadly as optimization. The general features of an optimization
problem are a set of quantifiable decisions that have a quantifiable effect that should
be minimized or maximized (think cost or revenue) and a set of constraints on the
possible values of those decisions. There are many different optimization branches,
but the most prominent, due to its widespread applicability and computational
efficiency, is linear programming, where the objective function and constraints are all
linear.

In this course, you will explore the mathematics of linear programs, how to solve them,
and how to evaluate your model. You will implement these techniques using packages
in the free and open-source statistical programming language R to solve real-world
logistical business problems. The focus will be on making these methods accessible
for you in your own work.

MAKING PREDICTIONS USING SIMULATION

Simulation is about quantifying the outcome of specific “what if” questions. What if
the average demand for tickets on a 150-seat aircraft is actually 200? What if people
who have purchased a ticket don’t show up? What if we offered a different number, or
economy and first-class tickets? Perhaps most importantly, what effect do these “what
if” scenarios have on total revenue?

As you might guess, many “what if” questions in the real world are fundamentally
uncertain; there is no deterministic formula for predicting exactly how many people
will not show up for a given flight. You can, however, use historical data to estimate no-
show probabilities. Once you conclude that uncertainty plays an important part in your
problem, it may be that you will have to turn to a probabilistic simulation. Running
many replications of the simulation will then help you statistically analyze the system’s
behavior and assess the effects of different design choices.

Visit ecornell.cornell.edu
7 COURSES

8
COURSE
DESCRIPTIONS
100% ONLINE

In this course, you will explore the intricacies of designing and analyzing probabilistic
simulations. You will also run simulations using packages in the free and open-source
statistical programming language R to solve real-world logistical business problems.
The focus will be on making these methods accessible for you in your own work.

Visit ecornell.cornell.edu

Cessna 172 Skyhawk Checklist
82% (33)
Cessna 172 Skyhawk Checklist
2 pages
30 11 41 Rev.19 Pneumatic de Icers
100% (1)
30 11 41 Rev.19 Pneumatic de Icers
130 pages
Ocs353dsf Unit Wise Notes
100% (2)
Ocs353dsf Unit Wise Notes
121 pages
Vodacom 2022 Iar - DF
No ratings yet
Vodacom 2022 Iar - DF
61 pages
Data Science With R
No ratings yet
Data Science With R
8 pages
Inyathi High School 040148: Computer Science Project
100% (2)
Inyathi High School 040148: Computer Science Project
54 pages
Data Scientist Masters - V9
No ratings yet
Data Scientist Masters - V9
30 pages
Professional Certificate in Data Science
No ratings yet
Professional Certificate in Data Science
15 pages
Data Science Brochure - Jan
No ratings yet
Data Science Brochure - Jan
14 pages
Syllabus (AI & ML BlackBelt+ Program)
No ratings yet
Syllabus (AI & ML BlackBelt+ Program)
15 pages
Data SC With Data Visualization
No ratings yet
Data SC With Data Visualization
9 pages
Syllabus AI ML BlackBelt Program 3
No ratings yet
Syllabus AI ML BlackBelt Program 3
18 pages
Course HandOut Data Analytics Course 2024
No ratings yet
Course HandOut Data Analytics Course 2024
5 pages
DSBA Curriculum Booklet
No ratings yet
DSBA Curriculum Booklet
14 pages
Kadir
No ratings yet
Kadir
84 pages
IITK Edvancer Advanced Certification in Data Analytics Curriculum Updated
No ratings yet
IITK Edvancer Advanced Certification in Data Analytics Curriculum Updated
9 pages
Data Science Mastery: From Beginner to Expert in Big Data Analytics
From Everand
Data Science Mastery: From Beginner to Expert in Big Data Analytics
Kameron Hussain
No ratings yet
Data Analytics With Python
No ratings yet
Data Analytics With Python
1,254 pages
Datascience Notes
No ratings yet
Datascience Notes
161 pages
pg-program-dsba (2)
No ratings yet
pg-program-dsba (2)
28 pages
01-R Basics
No ratings yet
01-R Basics
65 pages
Mit Data Science Machine Learning Program Brochure
No ratings yet
Mit Data Science Machine Learning Program Brochure
17 pages
TE-AINDS-Syllabus-REV-2019_DAV
No ratings yet
TE-AINDS-Syllabus-REV-2019_DAV
3 pages
Data Science for Business With R 1st Edition Jeffrey S. Saltz 2024 scribd download
100% (1)
Data Science for Business With R 1st Edition Jeffrey S. Saltz 2024 scribd download
37 pages
KIT306/606: Data Analytics Unit Coordinator: A/Prof. Quan Bai University of Tasmania
No ratings yet
KIT306/606: Data Analytics Unit Coordinator: A/Prof. Quan Bai University of Tasmania
51 pages
Data Science Course Agenda
No ratings yet
Data Science Course Agenda
29 pages
R Lect1 Introduction
No ratings yet
R Lect1 Introduction
16 pages
Data Science Course Curriculum 27 Feb 2023
No ratings yet
Data Science Course Curriculum 27 Feb 2023
21 pages
Dr. Gaurav Dixit: Department of Management Studies
No ratings yet
Dr. Gaurav Dixit: Department of Management Studies
26 pages
Ivy - Data Analytics and Data Visualization Certification Course
No ratings yet
Ivy - Data Analytics and Data Visualization Certification Course
9 pages
Project Report
No ratings yet
Project Report
29 pages
Data Science for Business With R 1st Edition Jeffrey S. Saltz - Download the full ebook version right now
100% (5)
Data Science for Business With R 1st Edition Jeffrey S. Saltz - Download the full ebook version right now
72 pages
2023 Gerunov BusinessAnalyticsR SU
No ratings yet
2023 Gerunov BusinessAnalyticsR SU
107 pages
EDS Unit 1?
No ratings yet
EDS Unit 1?
15 pages
OceanofPDF.com Modern Data Science With R - Baumer Benjamin SKaplan Daniel THort
No ratings yet
OceanofPDF.com Modern Data Science With R - Baumer Benjamin SKaplan Daniel THort
985 pages
The Art of Data Science Roger D. Peng pdf download
No ratings yet
The Art of Data Science Roger D. Peng pdf download
76 pages
Certified Business Analytics Professional Course Curriculum: Topic What Does It Mean? Introduction To Business Analytics
No ratings yet
Certified Business Analytics Professional Course Curriculum: Topic What Does It Mean? Introduction To Business Analytics
3 pages
MAT8033 Lecture Slides (3)
No ratings yet
MAT8033 Lecture Slides (3)
62 pages
MAT8033 Lecture Slides
No ratings yet
MAT8033 Lecture Slides
29 pages
Advanced Certification Course in Data Science - Brochure
No ratings yet
Advanced Certification Course in Data Science - Brochure
15 pages
Ivy - Data Science and Data Visualization Certification Course
100% (1)
Ivy - Data Science and Data Visualization Certification Course
10 pages
Learn Data Scoence Learnbay
No ratings yet
Learn Data Scoence Learnbay
8 pages
PG Program Dsba
No ratings yet
PG Program Dsba
28 pages
Data Science
No ratings yet
Data Science
33 pages
DS Curriculum
No ratings yet
DS Curriculum
4 pages
Data Analyst Masters Program
No ratings yet
Data Analyst Masters Program
34 pages
Full Stack Data Science: About Jigsaw Academy
No ratings yet
Full Stack Data Science: About Jigsaw Academy
4 pages
The Art of Data Science Roger D. Peng pdf download
No ratings yet
The Art of Data Science Roger D. Peng pdf download
55 pages
DSBA Curriculum Guide
No ratings yet
DSBA Curriculum Guide
9 pages
Data Analyst Master's Program
No ratings yet
Data Analyst Master's Program
37 pages
Data Science Training in Hyderabad
No ratings yet
Data Science Training in Hyderabad
7 pages
Data Analytics Course Guide 2024
No ratings yet
Data Analytics Course Guide 2024
14 pages
Data Analytics and Business Intelligence
No ratings yet
Data Analytics and Business Intelligence
9 pages
R Data Analyst DAR
No ratings yet
R Data Analyst DAR
6 pages
AFRICDSA Certified Data Scientist Syllabus - V1.2
No ratings yet
AFRICDSA Certified Data Scientist Syllabus - V1.2
12 pages
Data Science Learning Checklist
No ratings yet
Data Science Learning Checklist
1 page
Fundamentals of Data Analytics Syllabus
No ratings yet
Fundamentals of Data Analytics Syllabus
2 pages
Mit Data Science Machine Learning Program Brochure
No ratings yet
Mit Data Science Machine Learning Program Brochure
17 pages
Introduction To R For Social Scientist Preview
No ratings yet
Introduction To R For Social Scientist Preview
26 pages
Modules Overview
No ratings yet
Modules Overview
1 page
Dive Deep Into Data Science - Website
No ratings yet
Dive Deep Into Data Science - Website
2 pages
Be Data Curious!: Be Data Curious!, #1
From Everand
Be Data Curious!: Be Data Curious!, #1
Nick Jewell
No ratings yet
Introduction to Robotics
From Everand
Introduction to Robotics
Swarnalata Verma
No ratings yet
Principles of Data Mining
From Everand
Principles of Data Mining
Subodh Keshari
No ratings yet
Caterpillar: Confidential Green CR Suffix Code (417-4722) TR Suffix Code (459-0786)
No ratings yet
Caterpillar: Confidential Green CR Suffix Code (417-4722) TR Suffix Code (459-0786)
13 pages
Design Centre Brochure
No ratings yet
Design Centre Brochure
2 pages
1993 FarmerVenema Comp - Security.unix Improving The Security of Your Site by Breaking Into It2
No ratings yet
1993 FarmerVenema Comp - Security.unix Improving The Security of Your Site by Breaking Into It2
22 pages
Hildear463 PDF
No ratings yet
Hildear463 PDF
3 pages
190 Number Bingo
No ratings yet
190 Number Bingo
17 pages
Test Bank for E-Commerce 2019 Business, Technology and Society 15th by Laudonpdf download
100% (7)
Test Bank for E-Commerce 2019 Business, Technology and Society 15th by Laudonpdf download
59 pages
Top Read Computer Science and Informatio
No ratings yet
Top Read Computer Science and Informatio
54 pages
NORGREN - Filter Regulator - B73G Series - Datasheet
No ratings yet
NORGREN - Filter Regulator - B73G Series - Datasheet
5 pages
Lesson 4-Quantitative Techniques
No ratings yet
Lesson 4-Quantitative Techniques
7 pages
Multimodal Transport
No ratings yet
Multimodal Transport
27 pages
Exercises DBA II
No ratings yet
Exercises DBA II
63 pages
FFRTC Log
No ratings yet
FFRTC Log
207 pages
Jawaharlal Nehru Engineering College: Cyber Security Laboratory Manual
No ratings yet
Jawaharlal Nehru Engineering College: Cyber Security Laboratory Manual
12 pages
Dornier 328Jet-Communications
No ratings yet
Dornier 328Jet-Communications
49 pages
Uji Konduktivitas Bata API
No ratings yet
Uji Konduktivitas Bata API
8 pages
CC Notes
No ratings yet
CC Notes
12 pages
Empotech Lesson 1 Introduction To Ict
No ratings yet
Empotech Lesson 1 Introduction To Ict
68 pages
Kirloskar MRP Price List
70% (10)
Kirloskar MRP Price List
141 pages
Amphenol Mat 6177300
No ratings yet
Amphenol Mat 6177300
4 pages
Solis Datasheet Solis (100 110) K 5G
No ratings yet
Solis Datasheet Solis (100 110) K 5G
2 pages
WIN Case Informal Report
No ratings yet
WIN Case Informal Report
5 pages
LS-8-XXX SI E1,3W6W8W12W21W SI ECO#E, 8W12W20W LI EXC#ECO, 40W LI (-G) ECO#ECO-1# - RCM Cert
No ratings yet
LS-8-XXX SI E1,3W6W8W12W21W SI ECO#E, 8W12W20W LI EXC#ECO, 40W LI (-G) ECO#ECO-1# - RCM Cert
7 pages
HPE Integrated Lights
No ratings yet
HPE Integrated Lights
6 pages
Vendor Consignment Process - SAP Blogs PDF
No ratings yet
Vendor Consignment Process - SAP Blogs PDF
26 pages
Simufact Welding Tutorial en
No ratings yet
Simufact Welding Tutorial en
492 pages
Caterpillar 330d2 l | Excavator | Service Manual | Repair PDF
No ratings yet
Caterpillar 330d2 l | Excavator | Service Manual | Repair PDF
33 pages

CU Data Science

Uploaded by

CU Data Science

Uploaded by

1

COURSES COURSE LENGTH FORMAT

2 INSIDE the PROGRAM

WHO SHOULD ENROLL

WHAT YOU’LL EARN

UNDERSTANDING DATA ANALYTICS

FINDING PATTERNS IN DATA USING ASSOCIATION RULES, PCA, AND

FINDING PATTERNS IN DATA USING CLUSTER AND HOTSPOT ANALYSIS

REGRESSION ANALYSIS AND DISCRETE CHOICE MODELS

SUPERVISED LEARNING TECHNIQUES

There are many practical classification tasks, such as determining whether an

NEURAL NETWORKS AND MACHINE LEARNING

Neural networks, a nonlinear supervised learning modeling tool, have become

MAKING DATA-DRIVEN RECOMMENDATIONS USING OPTIMIZATION

The quantification and mathematical solution of these types of decision-making

MAKING PREDICTIONS USING SIMULATION

You might also like