Data Science in Practice
Data Science in Practice
Presented by
Yhat
http://yhat.com/
June 2016
hat
hat
APPLICATION 1:
RECOMMENDER SYSTEMS
Recommender systems, also known as
recommender engines, are one of the most
well known applications of data science.
Recommender systems are a subclass of
information filtering systems, systems that
cut through the noise of all options and
present users with just the subset of options
Recommender Systems
Content Filtering
(location, age, gender)
Collaborative Filtering
(previous behavior, similar users)
Recommender
System
energy products.
hat
APPLICATION 2:
CREDIT SCORING
scenes.
APPLICATION 3:
DYNAMIC PRICING
banking institutions.
hat
a flight, or a cab.
Dynamic Pricing
D2
and Python.
D1
P2
P1
APPLICATION 4:
CUSTOMER CHURN
Churn rate describes the rate at which
customers abandon a product or service.
Understanding customers likelihood to churn
is particularly important for subscription-
Q1
Q2
hat
boxes.
Data scientists looking to predict customer
churn may consider a variety of algorithms
for the job, such as support vector machines,
random forest, or k-nearest-neighbors.
Beyond the accuracy of a given model, data
scientists must also balance the tradeoff
between precision (correctly predicting a
APPLICATION 5:
FRAUD DETECTION
Financial technology, or FinTech, companies
offer financial services like banking, investing,
and payment processing via software, rather
than through traditional banking institutions.
hat
realized.
Now what?
So when can we
go live with the
new model?
hat
process.
YHATS ROLE
WHAT IS SCIENCEOPS?
Yhats data science operations system that
eliminates the barrier between data scientists and
engineers
HOW DOES IT WORK?
ScienceOps makes R and Python models accessible
via REST API and provides a platform to monitor,
manage and scale data science models
WHAT IS A REAL USE CASE?
ScienceOps is used by companies around the
globe, including each of those highlighed in the five
applications above
hat
Works Cited
Chiang, Eric. Predicting Customer Churn with Scikit-learn. The Yhat Blog. Yhat, 20 Mar. 2014.
Web.
Huang, Cheng-Lung, Mu-Chen Chen, and Chieh-Jen Wang. Credit Scoring with a Data Mining
Approach Based on Support Vector Machines. Expert Systems with Application 33 (2007):
847-56. Web.
Leskovec, Jure, and Jeffrey Ullman. Recommendation Systems. Mining of Massive Data Sets.
Ed. Anand Rajaraman. 2.1 ed. Cambridge: Cambridge UP, 2014. 307-41. Print.
Phua, Clifton, Vincent Lee, Kate Smith, and Ross Gayler. A Comprehensive Survey of Data
Yhat. Applied Data Science: Practical Guide to Building Data-driven Products beyond Analysts
hat
About
Yhat (pronounced Y-hat) provides an end-to-end data science platform for developing,
deploying, and managing real-time decision APIs.
Yhats flagship product, ScienceOps, enables data scientists to transform static insights into
production-ready decision making APIs that integrate seamlessly with any customer- or
employee-facing app. Yhat also created Rodeo, an open source integrated development
environment (IDE) for Python.
Contact Us
http://yhat.com
[email protected]
(718) 855-2107
45 Main Street
Suite #707
Brooklyn, New York 11201
hat
10