COGS9 Syllabus
COGS9 Syllabus
Course Background: Who cares about data? We all should! We are experiencing an
explosion of it: 90% of all digital data didnt exist two years ago. Researchers are
leveraging this data deluge to uncover new insights into human behavior, intelligence and
culture (sometimes with surprising findings). Companies are leveraging these data to
recommend products to purchase, movies to watch, places to go, and things to do. What
are the future implications for data science? Soon, we will move beyond targeted ads and
product recommendations to profound transformations in business, science, and society.
Course Overview: In order to understand data science, we first need to talk about data:
what counts as data and what doesn't? How do you visualize 1,000,000,000 Facebook
friendships? How can you turn numbers on the screen into something meaningful? And
how can data lead us astray?
Topics Covered: In this class I will introduce you to the following topics:
. Data and Information
. Python
. Data-mining
. Text-mining and analytics
. Communication theory
. Human-based computation
. Automated science
. Data visualization and storytelling
There will be four assignments worth 15% each and a final project worth 30%. 10% of
your grade is for class participation (attendance taken during guest lectures). Late
assignments earn fractional credit (75% within one week late; 50% otherwise up until
assignment answers have been posted after which no late credit can be earned).
Final project: The final project is a research report on how you would handle a
complicated analysis from front to back telling us all about the nitty gritty, whys, and
hows of the analysis you choose. Youll write about the problems and issues with data
handling and the analysis, and why you choose to overcome the problems in this
particular way. If its appropriate to the problem (e.g., hypothesis testing) youll write
about the expected results, but even if not youll at least mention the different kinds of
outcomes you might see. You WONT have to actually perform the analysis, just write
about it. But if you do make it that far, and can present results, thats great and will be
taken into account.
Readings:
. Donoho D, 50 Years of Data Science
. Tukey JW, Exploratory Data Analysis
. Buchanan M, Depths of Learning, Nature Physics 2015
. Krzywinski M & Cairo A, Points of view: Storytelling, Nature Methods 2013
Course Calendar: