SlideShare a Scribd company logo
How to make
Data Science
Products
LESSONS FROM TOKOPEDIA
WHO AM I Paytm QR Code
Paytm PassBook
Paytm Merchant SDK
Paytm Payments Bank
Paytm Offline Pay etc.
I started working as PM straight out of college with Paytm.
I was focussed on making Paytm Consumer Products for next 2
years in which I launched -
Referral and it's Fraud Detection
User Clustering and Product Clustering
Marketing Optimization
NLP and Image Cassification for Product Catalogue
Then I worked with Tokopedia in Indonesia as Growth and Data
Product Lead. Products -
Currently, I am working with Branch Metrics, which
is based in SF and responsible for their Core
Platform and Deep Linking product.
SOME MORE INFO
Top Indonesian Everyday App with Valuation of $7B
Founded: 6 February 2009
Tokopedia has raised a total of $2.4B in funding over 9 rounds
Top App in Indonesia
Interested in Data Science and Got my first job accidentaly
because of it.
Courses - Data Camp, AI Product Manager Udacity, Machine
Learning and Deeplearning courses online.
Spend my time reading Life 3.0, AI Superpowers, Fourth Age
etc.
Tokopedia -
Me -
WHAT IS AI-ML-DL
Image Source - Nividia
Artificial Intelligence — Human
Intelligence Exhibited by Machines
Machine Learning — An Approach to
Achieve Artificial Intelligence
Deep Learning — A Technique for
Implementing Machine Learning
LET'S SIMPLIFY THE FLUFF
Prediction Optimization
Classification Regression
Optimization Simulation
Products worked on
Recommendation Engine
Marketing automation
Products we will discuss
Fraud Detection for Referral
Recommendation Engine
NLP based sort for Product Feed
Image Recognition based product cleaning
Marketing automation
Purchase Predictor
Chatbot with Microsoft
RECOMMENDATION
User Based Clustering
Product Based Clustering
Cluster based Collaborative Filtering
What -
Recommend users product which they will be interested in, with a target to
increase Conversion Rate (View->Purchase).
How -
RECOMMENDATION
Infer user properties based on user parameters like - Purchase, View,
Price of Product, Category, Address, Device, Payment Method, Shops, Type
of Product etc.
Determine - Religion, SES, Occupation, Age, Gender, Interests, Marital
Status, Kids
Take explicit consent from user about data
Give user an option to opt out of the recommendation and data
processing
User Based Clustering
C2C platform so anyone can upload anything and put
description and categories.
Use various techniques -
Session Based clusters
Image Recognition
NLP on Product Title, Description, Category and
Sub Category
Why were merchants uploading bad image?
Why the products were incorrectly labeled?
Product Clustering
Things with similar attribute are going to be of similar interest to user.
Recommend something based on past similar data.
Can be product (Amazon), content (Netflix), or users (Facebook)
E.g. People who are seeing iPhone will be interested in iphone cover too
once they PURCHASE iPhone, because that's what other SIMILAR user did.
Session Based Recommendation
Cluster Based Collaborative Filtering
MARKETING AUTOMATION
Targeting right user with right product at right time through right
channel
Using the Recommender system above and Some more data -
Most active platform (App, Mobile Web, Desktop)
Most active channel (Email, SMS, Push Notification, Banner)
Most Active Time of day, week and month
Then integrate the product recommendation to these marketing channels
using APIs
What -
RESULT - ~26% increase in Conversion Rate and ~40% in CTR
CHALLENGES
Scale of data - 60M DAU and data in peta byte, cost is a
factor
Edge Case - Blocked the sensitive categories in Ramadan
and ended up showing a magazine with nudity. Social
Media Bashing
Seasonality - Festivals, Back to Schools,
Measurement - Have the goal and how you will measure it
in mind and discuss with team.
Variance - If you don't include new data then you will get
into spiral. Always have at least 30% variance.
C2C Marketplace - There is no standardization here.
Cultural aspects - Language, Trends, Life Style, Spending
Power
UNDERSTAND THE PROBLEM
Credits - Udacity
Five Vs of Data -
Volume - Broadly speaking, the amount of data that is being produced over any
given unit of time
Variety - The level of deviation within your data, which can have both positive and
negative effects depending on what it is you’re hoping to achieve
Velocity - A term referring to how quickly new data is produced. Velocity can also
allude to the concept of drift, or, how quickly data underlying a model can change
over time
Veracity - The accuracy of data that is being collected, a trait which can be affected
by faulty inputs, poor organization, or a variety ofs other factors
Value - A holistic measure based on all other underlying characteristics of data and
rooted in how likely the data is to help you reach your desired end state
EVALUATE THE DATA
EVALUATE THE PROBLEM
Control Group
Goal and Metrics
Feedback Loop
Improve
DEPLOY SMALL, MEASURE AND OPTIMIZE
Thank You
prashantmahajan.com
https://www.linkedin.com/in/prashantmahajan31/
Ad

More Related Content

What's hot (20)

How to Pitch Your First AR Project
How to Pitch Your First AR ProjectHow to Pitch Your First AR Project
How to Pitch Your First AR Project
FITC
 
Mock User Acquisition Marketing Plan
Mock User Acquisition Marketing PlanMock User Acquisition Marketing Plan
Mock User Acquisition Marketing Plan
Vincent Barr
 
Usability: Whats The Use by PRWD & Sigma
Usability: Whats The Use by PRWD & SigmaUsability: Whats The Use by PRWD & Sigma
Usability: Whats The Use by PRWD & Sigma
Become Customer-Centric
 
Fingerprint Berkeley 2015
Fingerprint Berkeley 2015Fingerprint Berkeley 2015
Fingerprint Berkeley 2015
Stanford University
 
The Rise of the Creative Social Influencer (and How to Become One)
The Rise of the Creative Social Influencer (and How to Become One)The Rise of the Creative Social Influencer (and How to Become One)
The Rise of the Creative Social Influencer (and How to Become One)
FITC
 
Business Models Template E145
Business Models Template E145Business Models Template E145
Business Models Template E145
Stanford University
 
SalesStash Berkeley 2016
SalesStash Berkeley 2016SalesStash Berkeley 2016
SalesStash Berkeley 2016
Stanford University
 
A Practical Guide to Measuring User Experience
A Practical Guide to Measuring User ExperienceA Practical Guide to Measuring User Experience
A Practical Guide to Measuring User Experience
Richard Dalton
 
Lean Business Plan:
Lean Business Plan:Lean Business Plan:
Lean Business Plan:
Liran Rorlich
 
How To Build Amazing Products Through Customer Feedback
How To Build Amazing Products Through Customer FeedbackHow To Build Amazing Products Through Customer Feedback
How To Build Amazing Products Through Customer Feedback
Product School
 
Super strategy of Affiliate Marketing.
Super strategy of Affiliate Marketing. Super strategy of Affiliate Marketing.
Super strategy of Affiliate Marketing.
Shauryasharma86
 
Why the first 2 stages of Design thinking are important for a startup?
Why the first 2 stages of Design thinking are important for a startup?Why the first 2 stages of Design thinking are important for a startup?
Why the first 2 stages of Design thinking are important for a startup?
Anuradha Sridharan
 
Running Lean - Dallas
Running Lean - DallasRunning Lean - Dallas
Running Lean - Dallas
Ash Maurya
 
MVP Types, Tools and Social Impact
MVP Types, Tools and Social ImpactMVP Types, Tools and Social Impact
MVP Types, Tools and Social Impact
Paul Orlando
 
Designing Outcomes For Usability Nycupa Hurst Final
Designing Outcomes For Usability Nycupa Hurst FinalDesigning Outcomes For Usability Nycupa Hurst Final
Designing Outcomes For Usability Nycupa Hurst Final
WIKOLO
 
Peep Laja_SearchLove London 2013
Peep Laja_SearchLove London 2013Peep Laja_SearchLove London 2013
Peep Laja_SearchLove London 2013
Distilled
 
So what is it that you’re selling?
So what is it that you’re selling?So what is it that you’re selling?
So what is it that you’re selling?
Divante
 
GOKCE TOMBUL - HOW TO BUILD A SUCCESSFUL EXPERIMENTATION PROGRAM
GOKCE TOMBUL - HOW TO BUILD A SUCCESSFUL EXPERIMENTATION PROGRAMGOKCE TOMBUL - HOW TO BUILD A SUCCESSFUL EXPERIMENTATION PROGRAM
GOKCE TOMBUL - HOW TO BUILD A SUCCESSFUL EXPERIMENTATION PROGRAM
Hilary Ip
 
Mentlo Berkeley 2015
Mentlo Berkeley 2015Mentlo Berkeley 2015
Mentlo Berkeley 2015
Stanford University
 
Corporate Case Study IMS
Corporate Case Study IMSCorporate Case Study IMS
Corporate Case Study IMS
Kevin Cobb
 
How to Pitch Your First AR Project
How to Pitch Your First AR ProjectHow to Pitch Your First AR Project
How to Pitch Your First AR Project
FITC
 
Mock User Acquisition Marketing Plan
Mock User Acquisition Marketing PlanMock User Acquisition Marketing Plan
Mock User Acquisition Marketing Plan
Vincent Barr
 
Usability: Whats The Use by PRWD & Sigma
Usability: Whats The Use by PRWD & SigmaUsability: Whats The Use by PRWD & Sigma
Usability: Whats The Use by PRWD & Sigma
Become Customer-Centric
 
The Rise of the Creative Social Influencer (and How to Become One)
The Rise of the Creative Social Influencer (and How to Become One)The Rise of the Creative Social Influencer (and How to Become One)
The Rise of the Creative Social Influencer (and How to Become One)
FITC
 
A Practical Guide to Measuring User Experience
A Practical Guide to Measuring User ExperienceA Practical Guide to Measuring User Experience
A Practical Guide to Measuring User Experience
Richard Dalton
 
How To Build Amazing Products Through Customer Feedback
How To Build Amazing Products Through Customer FeedbackHow To Build Amazing Products Through Customer Feedback
How To Build Amazing Products Through Customer Feedback
Product School
 
Super strategy of Affiliate Marketing.
Super strategy of Affiliate Marketing. Super strategy of Affiliate Marketing.
Super strategy of Affiliate Marketing.
Shauryasharma86
 
Why the first 2 stages of Design thinking are important for a startup?
Why the first 2 stages of Design thinking are important for a startup?Why the first 2 stages of Design thinking are important for a startup?
Why the first 2 stages of Design thinking are important for a startup?
Anuradha Sridharan
 
Running Lean - Dallas
Running Lean - DallasRunning Lean - Dallas
Running Lean - Dallas
Ash Maurya
 
MVP Types, Tools and Social Impact
MVP Types, Tools and Social ImpactMVP Types, Tools and Social Impact
MVP Types, Tools and Social Impact
Paul Orlando
 
Designing Outcomes For Usability Nycupa Hurst Final
Designing Outcomes For Usability Nycupa Hurst FinalDesigning Outcomes For Usability Nycupa Hurst Final
Designing Outcomes For Usability Nycupa Hurst Final
WIKOLO
 
Peep Laja_SearchLove London 2013
Peep Laja_SearchLove London 2013Peep Laja_SearchLove London 2013
Peep Laja_SearchLove London 2013
Distilled
 
So what is it that you’re selling?
So what is it that you’re selling?So what is it that you’re selling?
So what is it that you’re selling?
Divante
 
GOKCE TOMBUL - HOW TO BUILD A SUCCESSFUL EXPERIMENTATION PROGRAM
GOKCE TOMBUL - HOW TO BUILD A SUCCESSFUL EXPERIMENTATION PROGRAMGOKCE TOMBUL - HOW TO BUILD A SUCCESSFUL EXPERIMENTATION PROGRAM
GOKCE TOMBUL - HOW TO BUILD A SUCCESSFUL EXPERIMENTATION PROGRAM
Hilary Ip
 
Corporate Case Study IMS
Corporate Case Study IMSCorporate Case Study IMS
Corporate Case Study IMS
Kevin Cobb
 

Similar to How to make data science products (20)

Digital Marketing2008
Digital Marketing2008Digital Marketing2008
Digital Marketing2008
John Siewierski
 
FEI Community Strategy with Comments
FEI Community Strategy with CommentsFEI Community Strategy with Comments
FEI Community Strategy with Comments
ComBlu, Inc.
 
Paper Presentation "Opportunities & Challenges For New Outlook In Global Work...
Paper Presentation "Opportunities & Challenges For New Outlook In Global Work...Paper Presentation "Opportunities & Challenges For New Outlook In Global Work...
Paper Presentation "Opportunities & Challenges For New Outlook In Global Work...
Sonali Srivastava
 
Data and Creativity - Choices in Digital Marketing
Data and Creativity - Choices in Digital MarketingData and Creativity - Choices in Digital Marketing
Data and Creativity - Choices in Digital Marketing
Royal Holloway, University of London
 
Dma2011postcon
Dma2011postconDma2011postcon
Dma2011postcon
pulsepointstrategy
 
Training on Digital Strategy & Planning
Training on Digital Strategy & PlanningTraining on Digital Strategy & Planning
Training on Digital Strategy & Planning
Madhura Chaudhuri
 
Introduction to lean analytics
Introduction to lean analyticsIntroduction to lean analytics
Introduction to lean analytics
Kartik Narayanan
 
Managing Digital Marketing
Managing Digital MarketingManaging Digital Marketing
Managing Digital Marketing
John Chacksfield
 
10 Ways to Improve Your Web Content Strategy
10 Ways to Improve Your Web Content Strategy10 Ways to Improve Your Web Content Strategy
10 Ways to Improve Your Web Content Strategy
Bridgeline Digital
 
Social Networking Strategies Internet Research Tools Ccm 6 Dec11
Social Networking Strategies Internet Research Tools Ccm 6 Dec11Social Networking Strategies Internet Research Tools Ccm 6 Dec11
Social Networking Strategies Internet Research Tools Ccm 6 Dec11
steveallen
 
Big data can be used at SME's too
Big data can be used at SME's tooBig data can be used at SME's too
Big data can be used at SME's too
George Antony
 
ROle of Digital Marketing in Business.pptx
ROle of Digital Marketing in Business.pptxROle of Digital Marketing in Business.pptx
ROle of Digital Marketing in Business.pptx
harshita932188
 
Relation of Big Data and E-Commerce
Relation of Big Data and E-CommerceRelation of Big Data and E-Commerce
Relation of Big Data and E-Commerce
Ankita Tiwari
 
E Marketing Week06
E Marketing Week06E Marketing Week06
E Marketing Week06
Stephen Dann
 
Web presence&zeromoment
Web presence&zeromomentWeb presence&zeromoment
Web presence&zeromoment
Ryan Jack
 
Module5 other analytics
Module5   other analyticsModule5   other analytics
Module5 other analytics
Gayathri Choda
 
Creating a digital presence
Creating a digital presenceCreating a digital presence
Creating a digital presence
Tony Passey
 
Assessment Task No 2THT2112 Digital Marketing for Tourism an.docx
Assessment Task No 2THT2112 Digital Marketing for Tourism an.docxAssessment Task No 2THT2112 Digital Marketing for Tourism an.docx
Assessment Task No 2THT2112 Digital Marketing for Tourism an.docx
galerussel59292
 
Andrey Shapovalov: Didžiųjų duomenų panaudojimas rinkodarai
Andrey Shapovalov: Didžiųjų duomenų panaudojimas rinkodaraiAndrey Shapovalov: Didžiųjų duomenų panaudojimas rinkodarai
Andrey Shapovalov: Didžiųjų duomenų panaudojimas rinkodarai
Vladas Sapranavicius
 
Digital marketing analytics paths of value - 12-4-17
Digital marketing analytics   paths of value - 12-4-17Digital marketing analytics   paths of value - 12-4-17
Digital marketing analytics paths of value - 12-4-17
Marshall Sponder
 
FEI Community Strategy with Comments
FEI Community Strategy with CommentsFEI Community Strategy with Comments
FEI Community Strategy with Comments
ComBlu, Inc.
 
Paper Presentation "Opportunities & Challenges For New Outlook In Global Work...
Paper Presentation "Opportunities & Challenges For New Outlook In Global Work...Paper Presentation "Opportunities & Challenges For New Outlook In Global Work...
Paper Presentation "Opportunities & Challenges For New Outlook In Global Work...
Sonali Srivastava
 
Training on Digital Strategy & Planning
Training on Digital Strategy & PlanningTraining on Digital Strategy & Planning
Training on Digital Strategy & Planning
Madhura Chaudhuri
 
Introduction to lean analytics
Introduction to lean analyticsIntroduction to lean analytics
Introduction to lean analytics
Kartik Narayanan
 
Managing Digital Marketing
Managing Digital MarketingManaging Digital Marketing
Managing Digital Marketing
John Chacksfield
 
10 Ways to Improve Your Web Content Strategy
10 Ways to Improve Your Web Content Strategy10 Ways to Improve Your Web Content Strategy
10 Ways to Improve Your Web Content Strategy
Bridgeline Digital
 
Social Networking Strategies Internet Research Tools Ccm 6 Dec11
Social Networking Strategies Internet Research Tools Ccm 6 Dec11Social Networking Strategies Internet Research Tools Ccm 6 Dec11
Social Networking Strategies Internet Research Tools Ccm 6 Dec11
steveallen
 
Big data can be used at SME's too
Big data can be used at SME's tooBig data can be used at SME's too
Big data can be used at SME's too
George Antony
 
ROle of Digital Marketing in Business.pptx
ROle of Digital Marketing in Business.pptxROle of Digital Marketing in Business.pptx
ROle of Digital Marketing in Business.pptx
harshita932188
 
Relation of Big Data and E-Commerce
Relation of Big Data and E-CommerceRelation of Big Data and E-Commerce
Relation of Big Data and E-Commerce
Ankita Tiwari
 
E Marketing Week06
E Marketing Week06E Marketing Week06
E Marketing Week06
Stephen Dann
 
Web presence&zeromoment
Web presence&zeromomentWeb presence&zeromoment
Web presence&zeromoment
Ryan Jack
 
Module5 other analytics
Module5   other analyticsModule5   other analytics
Module5 other analytics
Gayathri Choda
 
Creating a digital presence
Creating a digital presenceCreating a digital presence
Creating a digital presence
Tony Passey
 
Assessment Task No 2THT2112 Digital Marketing for Tourism an.docx
Assessment Task No 2THT2112 Digital Marketing for Tourism an.docxAssessment Task No 2THT2112 Digital Marketing for Tourism an.docx
Assessment Task No 2THT2112 Digital Marketing for Tourism an.docx
galerussel59292
 
Andrey Shapovalov: Didžiųjų duomenų panaudojimas rinkodarai
Andrey Shapovalov: Didžiųjų duomenų panaudojimas rinkodaraiAndrey Shapovalov: Didžiųjų duomenų panaudojimas rinkodarai
Andrey Shapovalov: Didžiųjų duomenų panaudojimas rinkodarai
Vladas Sapranavicius
 
Digital marketing analytics paths of value - 12-4-17
Digital marketing analytics   paths of value - 12-4-17Digital marketing analytics   paths of value - 12-4-17
Digital marketing analytics paths of value - 12-4-17
Marshall Sponder
 
Ad

Recently uploaded (20)

Quantum Computing Quick Research Guide by Arthur Morgan
Quantum Computing Quick Research Guide by Arthur MorganQuantum Computing Quick Research Guide by Arthur Morgan
Quantum Computing Quick Research Guide by Arthur Morgan
Arthur Morgan
 
HCL Nomad Web – Best Practices and Managing Multiuser Environments
HCL Nomad Web – Best Practices and Managing Multiuser EnvironmentsHCL Nomad Web – Best Practices and Managing Multiuser Environments
HCL Nomad Web – Best Practices and Managing Multiuser Environments
panagenda
 
Into The Box Conference Keynote Day 1 (ITB2025)
Into The Box Conference Keynote Day 1 (ITB2025)Into The Box Conference Keynote Day 1 (ITB2025)
Into The Box Conference Keynote Day 1 (ITB2025)
Ortus Solutions, Corp
 
HCL Nomad Web – Best Practices und Verwaltung von Multiuser-Umgebungen
HCL Nomad Web – Best Practices und Verwaltung von Multiuser-UmgebungenHCL Nomad Web – Best Practices und Verwaltung von Multiuser-Umgebungen
HCL Nomad Web – Best Practices und Verwaltung von Multiuser-Umgebungen
panagenda
 
AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...
AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...
AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...
Alan Dix
 
How Can I use the AI Hype in my Business Context?
How Can I use the AI Hype in my Business Context?How Can I use the AI Hype in my Business Context?
How Can I use the AI Hype in my Business Context?
Daniel Lehner
 
SAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdf
SAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdfSAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdf
SAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdf
Precisely
 
Semantic Cultivators : The Critical Future Role to Enable AI
Semantic Cultivators : The Critical Future Role to Enable AISemantic Cultivators : The Critical Future Role to Enable AI
Semantic Cultivators : The Critical Future Role to Enable AI
artmondano
 
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptxIncreasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Anoop Ashok
 
Big Data Analytics Quick Research Guide by Arthur Morgan
Big Data Analytics Quick Research Guide by Arthur MorganBig Data Analytics Quick Research Guide by Arthur Morgan
Big Data Analytics Quick Research Guide by Arthur Morgan
Arthur Morgan
 
Special Meetup Edition - TDX Bengaluru Meetup #52.pptx
Special Meetup Edition - TDX Bengaluru Meetup #52.pptxSpecial Meetup Edition - TDX Bengaluru Meetup #52.pptx
Special Meetup Edition - TDX Bengaluru Meetup #52.pptx
shyamraj55
 
IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
organizerofv
 
Generative Artificial Intelligence (GenAI) in Business
Generative Artificial Intelligence (GenAI) in BusinessGenerative Artificial Intelligence (GenAI) in Business
Generative Artificial Intelligence (GenAI) in Business
Dr. Tathagat Varma
 
Drupalcamp Finland – Measuring Front-end Energy Consumption
Drupalcamp Finland – Measuring Front-end Energy ConsumptionDrupalcamp Finland – Measuring Front-end Energy Consumption
Drupalcamp Finland – Measuring Front-end Energy Consumption
Exove
 
Splunk Security Update | Public Sector Summit Germany 2025
Splunk Security Update | Public Sector Summit Germany 2025Splunk Security Update | Public Sector Summit Germany 2025
Splunk Security Update | Public Sector Summit Germany 2025
Splunk
 
Technology Trends in 2025: AI and Big Data Analytics
Technology Trends in 2025: AI and Big Data AnalyticsTechnology Trends in 2025: AI and Big Data Analytics
Technology Trends in 2025: AI and Big Data Analytics
InData Labs
 
Rusty Waters: Elevating Lakehouses Beyond Spark
Rusty Waters: Elevating Lakehouses Beyond SparkRusty Waters: Elevating Lakehouses Beyond Spark
Rusty Waters: Elevating Lakehouses Beyond Spark
carlyakerly1
 
Role of Data Annotation Services in AI-Powered Manufacturing
Role of Data Annotation Services in AI-Powered ManufacturingRole of Data Annotation Services in AI-Powered Manufacturing
Role of Data Annotation Services in AI-Powered Manufacturing
Andrew Leo
 
What is Model Context Protocol(MCP) - The new technology for communication bw...
What is Model Context Protocol(MCP) - The new technology for communication bw...What is Model Context Protocol(MCP) - The new technology for communication bw...
What is Model Context Protocol(MCP) - The new technology for communication bw...
Vishnu Singh Chundawat
 
Enhancing ICU Intelligence: How Our Functional Testing Enabled a Healthcare I...
Enhancing ICU Intelligence: How Our Functional Testing Enabled a Healthcare I...Enhancing ICU Intelligence: How Our Functional Testing Enabled a Healthcare I...
Enhancing ICU Intelligence: How Our Functional Testing Enabled a Healthcare I...
Impelsys Inc.
 
Quantum Computing Quick Research Guide by Arthur Morgan
Quantum Computing Quick Research Guide by Arthur MorganQuantum Computing Quick Research Guide by Arthur Morgan
Quantum Computing Quick Research Guide by Arthur Morgan
Arthur Morgan
 
HCL Nomad Web – Best Practices and Managing Multiuser Environments
HCL Nomad Web – Best Practices and Managing Multiuser EnvironmentsHCL Nomad Web – Best Practices and Managing Multiuser Environments
HCL Nomad Web – Best Practices and Managing Multiuser Environments
panagenda
 
Into The Box Conference Keynote Day 1 (ITB2025)
Into The Box Conference Keynote Day 1 (ITB2025)Into The Box Conference Keynote Day 1 (ITB2025)
Into The Box Conference Keynote Day 1 (ITB2025)
Ortus Solutions, Corp
 
HCL Nomad Web – Best Practices und Verwaltung von Multiuser-Umgebungen
HCL Nomad Web – Best Practices und Verwaltung von Multiuser-UmgebungenHCL Nomad Web – Best Practices und Verwaltung von Multiuser-Umgebungen
HCL Nomad Web – Best Practices und Verwaltung von Multiuser-Umgebungen
panagenda
 
AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...
AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...
AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...
Alan Dix
 
How Can I use the AI Hype in my Business Context?
How Can I use the AI Hype in my Business Context?How Can I use the AI Hype in my Business Context?
How Can I use the AI Hype in my Business Context?
Daniel Lehner
 
SAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdf
SAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdfSAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdf
SAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdf
Precisely
 
Semantic Cultivators : The Critical Future Role to Enable AI
Semantic Cultivators : The Critical Future Role to Enable AISemantic Cultivators : The Critical Future Role to Enable AI
Semantic Cultivators : The Critical Future Role to Enable AI
artmondano
 
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptxIncreasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Anoop Ashok
 
Big Data Analytics Quick Research Guide by Arthur Morgan
Big Data Analytics Quick Research Guide by Arthur MorganBig Data Analytics Quick Research Guide by Arthur Morgan
Big Data Analytics Quick Research Guide by Arthur Morgan
Arthur Morgan
 
Special Meetup Edition - TDX Bengaluru Meetup #52.pptx
Special Meetup Edition - TDX Bengaluru Meetup #52.pptxSpecial Meetup Edition - TDX Bengaluru Meetup #52.pptx
Special Meetup Edition - TDX Bengaluru Meetup #52.pptx
shyamraj55
 
IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
organizerofv
 
Generative Artificial Intelligence (GenAI) in Business
Generative Artificial Intelligence (GenAI) in BusinessGenerative Artificial Intelligence (GenAI) in Business
Generative Artificial Intelligence (GenAI) in Business
Dr. Tathagat Varma
 
Drupalcamp Finland – Measuring Front-end Energy Consumption
Drupalcamp Finland – Measuring Front-end Energy ConsumptionDrupalcamp Finland – Measuring Front-end Energy Consumption
Drupalcamp Finland – Measuring Front-end Energy Consumption
Exove
 
Splunk Security Update | Public Sector Summit Germany 2025
Splunk Security Update | Public Sector Summit Germany 2025Splunk Security Update | Public Sector Summit Germany 2025
Splunk Security Update | Public Sector Summit Germany 2025
Splunk
 
Technology Trends in 2025: AI and Big Data Analytics
Technology Trends in 2025: AI and Big Data AnalyticsTechnology Trends in 2025: AI and Big Data Analytics
Technology Trends in 2025: AI and Big Data Analytics
InData Labs
 
Rusty Waters: Elevating Lakehouses Beyond Spark
Rusty Waters: Elevating Lakehouses Beyond SparkRusty Waters: Elevating Lakehouses Beyond Spark
Rusty Waters: Elevating Lakehouses Beyond Spark
carlyakerly1
 
Role of Data Annotation Services in AI-Powered Manufacturing
Role of Data Annotation Services in AI-Powered ManufacturingRole of Data Annotation Services in AI-Powered Manufacturing
Role of Data Annotation Services in AI-Powered Manufacturing
Andrew Leo
 
What is Model Context Protocol(MCP) - The new technology for communication bw...
What is Model Context Protocol(MCP) - The new technology for communication bw...What is Model Context Protocol(MCP) - The new technology for communication bw...
What is Model Context Protocol(MCP) - The new technology for communication bw...
Vishnu Singh Chundawat
 
Enhancing ICU Intelligence: How Our Functional Testing Enabled a Healthcare I...
Enhancing ICU Intelligence: How Our Functional Testing Enabled a Healthcare I...Enhancing ICU Intelligence: How Our Functional Testing Enabled a Healthcare I...
Enhancing ICU Intelligence: How Our Functional Testing Enabled a Healthcare I...
Impelsys Inc.
 
Ad

How to make data science products

  • 1. How to make Data Science Products LESSONS FROM TOKOPEDIA
  • 2. WHO AM I Paytm QR Code Paytm PassBook Paytm Merchant SDK Paytm Payments Bank Paytm Offline Pay etc. I started working as PM straight out of college with Paytm. I was focussed on making Paytm Consumer Products for next 2 years in which I launched - Referral and it's Fraud Detection User Clustering and Product Clustering Marketing Optimization NLP and Image Cassification for Product Catalogue Then I worked with Tokopedia in Indonesia as Growth and Data Product Lead. Products - Currently, I am working with Branch Metrics, which is based in SF and responsible for their Core Platform and Deep Linking product.
  • 3. SOME MORE INFO Top Indonesian Everyday App with Valuation of $7B Founded: 6 February 2009 Tokopedia has raised a total of $2.4B in funding over 9 rounds Top App in Indonesia Interested in Data Science and Got my first job accidentaly because of it. Courses - Data Camp, AI Product Manager Udacity, Machine Learning and Deeplearning courses online. Spend my time reading Life 3.0, AI Superpowers, Fourth Age etc. Tokopedia - Me -
  • 4. WHAT IS AI-ML-DL Image Source - Nividia Artificial Intelligence — Human Intelligence Exhibited by Machines Machine Learning — An Approach to Achieve Artificial Intelligence Deep Learning — A Technique for Implementing Machine Learning
  • 5. LET'S SIMPLIFY THE FLUFF Prediction Optimization Classification Regression Optimization Simulation
  • 6. Products worked on Recommendation Engine Marketing automation Products we will discuss Fraud Detection for Referral Recommendation Engine NLP based sort for Product Feed Image Recognition based product cleaning Marketing automation Purchase Predictor Chatbot with Microsoft
  • 7. RECOMMENDATION User Based Clustering Product Based Clustering Cluster based Collaborative Filtering What - Recommend users product which they will be interested in, with a target to increase Conversion Rate (View->Purchase). How -
  • 8. RECOMMENDATION Infer user properties based on user parameters like - Purchase, View, Price of Product, Category, Address, Device, Payment Method, Shops, Type of Product etc. Determine - Religion, SES, Occupation, Age, Gender, Interests, Marital Status, Kids Take explicit consent from user about data Give user an option to opt out of the recommendation and data processing User Based Clustering
  • 9. C2C platform so anyone can upload anything and put description and categories. Use various techniques - Session Based clusters Image Recognition NLP on Product Title, Description, Category and Sub Category Why were merchants uploading bad image? Why the products were incorrectly labeled? Product Clustering
  • 10. Things with similar attribute are going to be of similar interest to user. Recommend something based on past similar data. Can be product (Amazon), content (Netflix), or users (Facebook) E.g. People who are seeing iPhone will be interested in iphone cover too once they PURCHASE iPhone, because that's what other SIMILAR user did. Session Based Recommendation Cluster Based Collaborative Filtering
  • 11. MARKETING AUTOMATION Targeting right user with right product at right time through right channel Using the Recommender system above and Some more data - Most active platform (App, Mobile Web, Desktop) Most active channel (Email, SMS, Push Notification, Banner) Most Active Time of day, week and month Then integrate the product recommendation to these marketing channels using APIs What - RESULT - ~26% increase in Conversion Rate and ~40% in CTR
  • 12. CHALLENGES Scale of data - 60M DAU and data in peta byte, cost is a factor Edge Case - Blocked the sensitive categories in Ramadan and ended up showing a magazine with nudity. Social Media Bashing Seasonality - Festivals, Back to Schools, Measurement - Have the goal and how you will measure it in mind and discuss with team. Variance - If you don't include new data then you will get into spiral. Always have at least 30% variance. C2C Marketplace - There is no standardization here. Cultural aspects - Language, Trends, Life Style, Spending Power
  • 14. Five Vs of Data - Volume - Broadly speaking, the amount of data that is being produced over any given unit of time Variety - The level of deviation within your data, which can have both positive and negative effects depending on what it is you’re hoping to achieve Velocity - A term referring to how quickly new data is produced. Velocity can also allude to the concept of drift, or, how quickly data underlying a model can change over time Veracity - The accuracy of data that is being collected, a trait which can be affected by faulty inputs, poor organization, or a variety ofs other factors Value - A holistic measure based on all other underlying characteristics of data and rooted in how likely the data is to help you reach your desired end state EVALUATE THE DATA
  • 16. Control Group Goal and Metrics Feedback Loop Improve DEPLOY SMALL, MEASURE AND OPTIMIZE