COGS9 Syllabus

This document provides an overview of the COGS 9: Introduction to Data Science course for Fall 2017. The course will be taught on Mondays, Wednesdays, and Fridays from 10:00-10:50am in Pepper Canyon Hall 106. The instructor is Bradley Voytek and the TAs are Richard Gao and Isaac Shamie. There will be four assignments worth 15% each, a final project worth 30%, and class participation is 10% of the grade. Topics covered include data/information, Python, data mining, text mining, visualization, statistics, machine learning and more.

Uploaded by

antserene

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

318 views

COGS9 Syllabus

Uploaded by

antserene

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

COGS 9: Introduction to Data Science

Fall 2017, MWF 10:00-10:50a

Pepper Canyon Hall 106

Instructor: Bradley Voytek ([email protected])

Teaching Assistants: Richard Gao; Isaac Shamie ({rigao; isshamie}@ucsd.edu)
Voyteks Office hours: Mondays, 11:00a-12:00p or by appointment (CSB 169)
TA Office hours: TBA
Final exam date: NO FINAL EXAM, ONLY FINAL PROJECT DEADLINE
Grading: Four assignments (15% each) + Final project (30%) + Participation (10%)

Course Background: Who cares about data? We all should! We are experiencing an
explosion of it: 90% of all digital data didnt exist two years ago. Researchers are
leveraging this data deluge to uncover new insights into human behavior, intelligence and
culture (sometimes with surprising findings). Companies are leveraging these data to
recommend products to purchase, movies to watch, places to go, and things to do. What
are the future implications for data science? Soon, we will move beyond targeted ads and
product recommendations to profound transformations in business, science, and society.

Course Overview: In order to understand data science, we first need to talk about data:
what counts as data and what doesn't? How do you visualize 1,000,000,000 Facebook
friendships? How can you turn numbers on the screen into something meaningful? And
how can data lead us astray?

Topics Covered: In this class I will introduce you to the following topics:
. Data and Information
. Python
. Data-mining
. Text-mining and analytics
. Communication theory
. Human-based computation
. Automated science
. Data visualization and storytelling

Grades: NOTE THIS IS STILL A WORK IN PROGRESS. DETAILS MAY CHANGE

BUT, IF THEY DO, YOU WILL BE GIVEN PLENTY OF ADVNACE NOTICE.

There will be four assignments worth 15% each and a final project worth 30%. 10% of
your grade is for class participation (attendance taken during guest lectures). Late
assignments earn fractional credit (75% within one week late; 50% otherwise up until
assignment answers have been posted after which no late credit can be earned).

A rough guide to what is in each assignment:

. Introduction to Python and handling data
. Exploring data using descriptive statistics, and how not to get fooled
. Visualizing data, and how not to get fooled
. How to get fooled: p-hacking your way to the results you want
. Turn in a draft of the final project, get back comments on it to move you in the
right direction

Final project: The final project is a research report on how you would handle a
complicated analysis from front to back telling us all about the nitty gritty, whys, and
hows of the analysis you choose. Youll write about the problems and issues with data
handling and the analysis, and why you choose to overcome the problems in this
particular way. If its appropriate to the problem (e.g., hypothesis testing) youll write
about the expected results, but even if not youll at least mention the different kinds of
outcomes you might see. You WONT have to actually perform the analysis, just write
about it. But if you do make it that far, and can present results, thats great and will be
taken into account.

Readings:
. Donoho D, 50 Years of Data Science
. Tukey JW, Exploratory Data Analysis
. Buchanan M, Depths of Learning, Nature Physics 2015
. Krzywinski M & Cairo A, Points of view: Storytelling, Nature Methods 2013
Course Calendar:

Date Title Assignment Due

09-29 Hello world!
10-02 What is Data Science?
10-04 Data and information
10-06 Python
10-09 Culturomics and text-mining
10-11 Geospatial Analysis
10-13 Tentative guest lecture
10-16 Data visualization
10-18 No class!
10-20 Data journalism
10-23 Tentative guest lecture
10-25 Probability and statistics
10-27 Statistical inference
10-30 Algorithms and computability
11-01 Hypothesis-testing vs. exploratory data analysis
11-03 Tentative guest lecture
11-06 Inference errors
11-08 Data extraction
11-10 Veterans Day - No class!
11-13 No class!
11-15 Signals and noise
11-17 Version control and reproducability
11-20 Tentative guest lecture
11-22 Machine learning
11-24 Cross validation and bootstrapping
11-27 Thanksgiving - No class!
11-29 Databases
12-01 Tentative guest lecture
12-04 Wisdom of the crowds and crowdsourcing
12-06 Privacy and ethics
12-08 The future of Data Science

Axiatonal Lines
100% (7)
Axiatonal Lines
4 pages
CSC 422 522 001
No ratings yet
CSC 422 522 001
8 pages
EE0005 Introduction To Data Science and Artificial Intelligence - OBTL
No ratings yet
EE0005 Introduction To Data Science and Artificial Intelligence - OBTL
8 pages
Data Science - CS109: Joe Blitzstein, Verena Kaynig-Fittkau, Hanspeter Pfister
No ratings yet
Data Science - CS109: Joe Blitzstein, Verena Kaynig-Fittkau, Hanspeter Pfister
47 pages
Automated Item Analysis
75% (16)
Automated Item Analysis
22 pages
Everest Case Final
67% (3)
Everest Case Final
12 pages
Diversity in Designing and Assessing Learning Activities
100% (1)
Diversity in Designing and Assessing Learning Activities
42 pages
Introduction To Data Science: Cpts 483-06 - Syllabus
No ratings yet
Introduction To Data Science: Cpts 483-06 - Syllabus
5 pages
Python For Data Science Syllabus
No ratings yet
Python For Data Science Syllabus
6 pages
Data Science Course
No ratings yet
Data Science Course
70 pages
Data Science CS481 - Course Outline Spring 2020
No ratings yet
Data Science CS481 - Course Outline Spring 2020
3 pages
FDS Course Plan - Update
No ratings yet
FDS Course Plan - Update
7 pages
COURSE PLAN - FDS THEORY
No ratings yet
COURSE PLAN - FDS THEORY
8 pages
Presentation
No ratings yet
Presentation
10 pages
Dand Syllabus v7 Terms 1
No ratings yet
Dand Syllabus v7 Terms 1
6 pages
DS100 Sp22 Lec 01 - Course Overview, Data Science Lifecycle
No ratings yet
DS100 Sp22 Lec 01 - Course Overview, Data Science Lifecycle
80 pages
Presentation
No ratings yet
Presentation
10 pages
CS3352 FDS
No ratings yet
CS3352 FDS
23 pages
COGS138 Wi22
No ratings yet
COGS138 Wi22
8 pages
Course Outline PDF
No ratings yet
Course Outline PDF
2 pages
21AML543 - Fundamentals of Data Science
No ratings yet
21AML543 - Fundamentals of Data Science
4 pages
22am901 Data Science Using Python Unit 2
No ratings yet
22am901 Data Science Using Python Unit 2
116 pages
Data Science- Course Handout -Dr P Balamurugan- EVEN 2025
No ratings yet
Data Science- Course Handout -Dr P Balamurugan- EVEN 2025
8 pages
CS 3352 Foundations of Data Science Syllabus
No ratings yet
CS 3352 Foundations of Data Science Syllabus
2 pages
Course Structure_Introduction to Data Science
No ratings yet
Course Structure_Introduction to Data Science
23 pages
CS3352 - Foundations of Data Science
No ratings yet
CS3352 - Foundations of Data Science
142 pages
New Syllabus
No ratings yet
New Syllabus
4 pages
CodeOp DS Course Guide 2023
No ratings yet
CodeOp DS Course Guide 2023
15 pages
Minor Cse Dsv2
No ratings yet
Minor Cse Dsv2
7 pages
Unit 1 Fod
No ratings yet
Unit 1 Fod
43 pages
CodeOp DS Course Guide 2023
No ratings yet
CodeOp DS Course Guide 2023
15 pages
19CS003..Handout
No ratings yet
19CS003..Handout
5 pages
Foundations of Data Science.docx
No ratings yet
Foundations of Data Science.docx
3 pages
MCA 3rd Sem Syllabus
No ratings yet
MCA 3rd Sem Syllabus
79 pages
Data Science Course Outline CES LUMS
No ratings yet
Data Science Course Outline CES LUMS
4 pages
Data Science Topics
No ratings yet
Data Science Topics
7 pages
30 Data Science Minor
No ratings yet
30 Data Science Minor
18 pages
SEM 4 stuff
No ratings yet
SEM 4 stuff
27 pages
FDS Unit1 Part1
No ratings yet
FDS Unit1 Part1
57 pages
Introduction To Data Science 439
No ratings yet
Introduction To Data Science 439
28 pages
B.Tech.AIDS R 2021
No ratings yet
B.Tech.AIDS R 2021
31 pages
DC DSA DSM DSV POC Merged
No ratings yet
DC DSA DSM DSV POC Merged
5 pages
Data Science Syllabus
No ratings yet
Data Science Syllabus
7 pages
PRINCIPLES OF DATA SCIENCE by - JOHN P DICKERSON
No ratings yet
PRINCIPLES OF DATA SCIENCE by - JOHN P DICKERSON
91 pages
MCASyll 31
No ratings yet
MCASyll 31
3 pages
Data Science New Report
No ratings yet
Data Science New Report
39 pages
Data Scientist Analyitcs Syllabus - Tech Transition
No ratings yet
Data Scientist Analyitcs Syllabus - Tech Transition
7 pages
Semester-5 MCA Integrated IIPS DAVV Syllabus
No ratings yet
Semester-5 MCA Integrated IIPS DAVV Syllabus
26 pages
CS6220 Syllabus
No ratings yet
CS6220 Syllabus
2 pages
New Syllabus
No ratings yet
New Syllabus
4 pages
S22 Lecture 1 Intro Inked
No ratings yet
S22 Lecture 1 Intro Inked
46 pages
BUDT704: Data Processing and Analysis in Python
No ratings yet
BUDT704: Data Processing and Analysis in Python
9 pages
Mod00 Syllabus 2024fall
No ratings yet
Mod00 Syllabus 2024fall
12 pages
ho
No ratings yet
ho
9 pages
MCS102
No ratings yet
MCS102
5 pages
MC Data Science23
No ratings yet
MC Data Science23
26 pages
Data Analyst Nanodegree Program - Syllabus
50% (2)
Data Analyst Nanodegree Program - Syllabus
7 pages
BE Elex and Comp Engg - 2019 Course
No ratings yet
BE Elex and Comp Engg - 2019 Course
91 pages
Data Science Regular Handout
No ratings yet
Data Science Regular Handout
25 pages
COS10022 Unit Outline May2025 HCM
No ratings yet
COS10022 Unit Outline May2025 HCM
10 pages
Nd002 Syllabus 2018 June v9
No ratings yet
Nd002 Syllabus 2018 June v9
5 pages
Data Science SS
No ratings yet
Data Science SS
5 pages
Udacity Dandsyllabus
No ratings yet
Udacity Dandsyllabus
7 pages
Data Fun Facts
From Everand
Data Fun Facts
Ravi Nakamoto
No ratings yet
MGT 112 Syllabus S18
No ratings yet
MGT 112 Syllabus S18
6 pages
L7 - Gender Disc in Developing Countries
No ratings yet
L7 - Gender Disc in Developing Countries
42 pages
MGT 112 Syllabus S18
No ratings yet
MGT 112 Syllabus S18
6 pages
The Toulmin Model of Argumentation
No ratings yet
The Toulmin Model of Argumentation
7 pages
Music 110 Music Appreciation DVC
No ratings yet
Music 110 Music Appreciation DVC
2 pages
Mediacomp Java 1 10 05 PDF
No ratings yet
Mediacomp Java 1 10 05 PDF
383 pages
The Summary of Boston Conservatory President Karl Paulnack's Welcome Address To Incoming Students
100% (2)
The Summary of Boston Conservatory President Karl Paulnack's Welcome Address To Incoming Students
2 pages
COMSC 110 DVC - Assignment 1
No ratings yet
COMSC 110 DVC - Assignment 1
3 pages
The Turing Test 20_250116_224126
No ratings yet
The Turing Test 20_250116_224126
4 pages
CLX Connected Learning Guide - Public Release Draft 3-4-19
No ratings yet
CLX Connected Learning Guide - Public Release Draft 3-4-19
10 pages
MLPPT
No ratings yet
MLPPT
21 pages
Analytics Specialist/Data Scientist: The Opportunity
No ratings yet
Analytics Specialist/Data Scientist: The Opportunity
2 pages
The Neurology of Proverbs
No ratings yet
The Neurology of Proverbs
20 pages
Shattuck Hufnagel&Turk1996
No ratings yet
Shattuck Hufnagel&Turk1996
55 pages
PATH Assessment® (Servant Leader) Result - GoodJob
No ratings yet
PATH Assessment® (Servant Leader) Result - GoodJob
4 pages
Instructional Design Models - Instructional Design Central (IDC)
No ratings yet
Instructional Design Models - Instructional Design Central (IDC)
10 pages
Lemon Volcano
No ratings yet
Lemon Volcano
3 pages
Review 1 To 4 Interchange 3 4th Ed
No ratings yet
Review 1 To 4 Interchange 3 4th Ed
4 pages
Virginia Henderson Theory
No ratings yet
Virginia Henderson Theory
30 pages
Week 5 - Science 7
No ratings yet
Week 5 - Science 7
3 pages
Essential Elements of Writing A Research Review Paper For Conference Journals
No ratings yet
Essential Elements of Writing A Research Review Paper For Conference Journals
6 pages
Albert Einstein
No ratings yet
Albert Einstein
2 pages
LP 6th Grade Grid Drawing 2 Lash 1
No ratings yet
LP 6th Grade Grid Drawing 2 Lash 1
13 pages
Varieties and Registers of Written and Oral Communication
No ratings yet
Varieties and Registers of Written and Oral Communication
13 pages
Afonasenko D.: Broke My Favorite Mug. - I'll Buy A New One.
No ratings yet
Afonasenko D.: Broke My Favorite Mug. - I'll Buy A New One.
2 pages
Assessment Task
No ratings yet
Assessment Task
5 pages
Compounding in English and Arabic
No ratings yet
Compounding in English and Arabic
18 pages
PDF Top 15 Korean Words and Easy Associations
100% (1)
PDF Top 15 Korean Words and Easy Associations
5 pages
Afzal Et Al., 2020
No ratings yet
Afzal Et Al., 2020
18 pages
Unit IV Part 2
No ratings yet
Unit IV Part 2
23 pages
Nursing Science and The Foundation of Knowledge
No ratings yet
Nursing Science and The Foundation of Knowledge
37 pages
Creative Plant Lesson
No ratings yet
Creative Plant Lesson
3 pages
Basic Tenses Exercises
No ratings yet
Basic Tenses Exercises
2 pages
Translation Studies Assignment #1 Ali Raza
No ratings yet
Translation Studies Assignment #1 Ali Raza
6 pages

COGS9 Syllabus

Uploaded by

COGS9 Syllabus

Uploaded by

COGS 9: Introduction to Data Science

Fall 2017, MWF 10:00-10:50a

Instructor: Bradley Voytek ([email protected])

Grades: NOTE THIS IS STILL A WORK IN PROGRESS. DETAILS MAY CHANGE

A rough guide to what is in each assignment:

Date Title Assignment Due

You might also like