SlideShare a Scribd company logo
© 2017 IBM Corporation1
Building Data Science Ecosystems
for Smart City and Smart Commerce
Dr. Alex Liu
Chief Data Scientist
Analytics Services
© 2017 IBM Corporation2
Alex Liu Introduction
▪ Chief Data Scientist – Analytics Services at IBM
▪ A Data Scientist Thought Leader
▪ Chief Data Scientist for a few corporations
before joined IBM
▪ Taught advanced data analytics for the
University of South California and the University
of California at Irvine
▪ Consulted for the United Nations, Ingram Micro
…
▪ M.S. and Ph.D. from Stanford University
© 2017 IBM Corporation3
Data Science: Turning Data into VALUE With Models
Data Science produces insights/values via
a complicated processes
a big set of tools
3
BigInsights
(HDFS)
Cloudant
(DBaaS)
dashDB
(Analytics)
Swift
(Object
Storage
)
SQDB
(Managed
DB2)
© 2017 IBM Corporation4
Data Science projects return very valuable results
but a lot failed
▪Netflix, for example, integrates data science
into each part of their business; they
estimate a billion dollars in incremental
value from their personalization and
recommendation alone.
▪Knight Capital Group, for instance, lost
$440 million in 45 minutes after a mistake in
updating a model (New York times).
▪Gartner estimated that 60% of big data
projects fail in 2016, and in 2017.
© 2017 IBM Corporation5
Reasons why data science projects failed
▪Wrong data
▪Wrong question
▪Week stakeholders commitment
▪Lack of diverse expertise per CIO DIVE
▪Many research leads to conclude mismatch among question, data,
model, researchers, tools … as main reason
▪Due to bad coordination of parts involved
© 2017 IBM Corporation6
WHY - Data Science very complicated
Very Complicated enough just for Model Building Stage
• More than 50 different models: SVM, Neural Net, Decision Trees/Forests, Naïve Bayes,
Regression, SMO, k-nearest Neighbor, Clustering, Rules, …
• Combinatorially explosive number of parameter choices per algorithm: kernel type, pruning
strategy, number of trees in a forest, learning rate, …
• Wide variation in performance across different algorithm implementations (e.g., SPSS vs Python
vs WEKA vs SPARK …)
• User-Defined algorithms
• Substantial cost in user and compute time
• User spends time on trying new combinations and parameters
• Computational cost for training a single SVM can exceed 24h
• Selection commonly based on data scientist bias
• Each additional pipeline stage increases complexity dramatically!
© 2017 IBM Corporation7
WHY - challenges for managing data scientists
▪High turn over rate of data scientists (average tenure 18 months)
▪Ever developing new technologies to master
▪Lack of training for data scientists
7
Data Prep
Modeling
DeploymentEvaluation
Exploration
© 2017 IBM Corporation
A data science ecosystem approach
A data science ECOSYSTEM has three basic elements
1) Data portal, 2) Data Science platform, 3) Data Science community
© 2017 IBM Corporation9
The defining characteristics of an ecosystem - mutuality & orchestration
Markets comprise entities that operate
out of individual self-interest
Ecosystems comprise entities that operate
out of orchestrated, mutual shared-interest
A set of individuals or organizations who
exchange products or services within an
environment governed by the laws of supply
and demand
A set of individuals or organizations who
formally or informally operate together to
produce something of greater value for the
mutual benefit of the ecosystem as a whole
Ecosystems exists because operating in an orchestrated environment, participants
can deliver more value within the ecosystem acting together than acting alone
© 2017 IBM Corporation10
▪ Value capture reflects a networked, dynamic,
everyone-to-everyone process of exchange
▪ Ecosystems produce more value as a whole,
than the sum of the individual participants acting
independently
Value creation in a value chain
▪ Value creation is incremental as organizations
cover costs plus some return on assets
▪ Value capture reflects an additive, sequential
process of exchange
Value creation in traditional data science tends to be linear;
value creation in ecosystems tends to be networked …
Cost
plus
return
Cost
plus
Cost
plus
Cost
plus
Cost
plus
Value creation in an ecosystem
© 2017 IBM Corporation11
Ecosystems can yield substantial benefits
New capabilities Improved access Improved Agility
Increase Success Ratio of Data Science Projects
Embrace ecosystems’ strategic potential
Ecosystems enable organizations
to access critical capabilities that
they would otherwise have
difficulty obtaining
Ecosystems support greater
access to new or different
resources such as new talents,
new tools, new data sets
Ecosystems support quick
creation of new types of products,
with different combinations of
organizations and assets
© 2017 IBM Corporation12
1) An ecosystem for city data science
101
010
101
Platform
~ IBM DSX
Events Data Social Media
Analytical
Insights for
Smart Cities
Connecting all
the data
scientists from
a DS
community
Applications
Optimizing Operations Solutions
IoT Data
© 2017 IBM Corporation13
RMDS Communities at IBM Glendale
▪Pasadena/Glendale Meetup Community
▪Local face to face community – more than 1100 members
▪https://www.meetup.com/RMDS_LA/
▪https://www.linkedin.com/groups/1895501 has 29K participants
2 former CDOs and the current CDO are members, also presented their work here.
Aim to create an environment for utilizing big data analytics to make smart cities
© 2017 IBM Corporation14
105,000+
collections
349 citizen apps
500,000 data
resources
175 agencies
450 APIs
Source: City of LA Mayor’s Tech Advisor Presentation
at RMDS Meetup.
Data.gov
citizen data science ecosystem with
open data
© 2017 IBM Corporation15
IBM Data Science Experience
Community Open Source IBM Added Value
Powered by IBM Watson Data Platform
• Find tutorials and datasets
• Connect with Data Scientists
• Ask questions
• Read articles and papers
• Fork and share projects
• Code in Scala/Python/R/SQL
• Jupyter Notebooks
• RStudio IDE and Shiny
• Apache Spark
• Your favorite libraries
• Watson Machine Learning
• SPSS Modeler Canvas
• Advanced Visualizations
• Projects and Version Control
• Managed Spark Service
* Closed beta
a data science platform – the IBM Data Science Experience
- now IBM Watson Studio Local
https://datascience.ibm.com/
IBM Confidential16
© 2017 IBM Corporation17
Overview
IBM Data Science Experience (DSX – now Watson Studio Local) is a social environment
where you can collaborate to solve data challenges with the best tools and latest expertise.
▪ Community
▪ Projects
▪ Notebooks
▪ Data Asset
▪ Model Management
▪ Admin console
Forrester Research Ranks IBM as a Leader in
Multimodal Predictive Analytics and Machine Learning
IBM puts AI to work. IBM Watson is a vast umbrella of
technologies and solutions, one of which is Watson Studio, a
PAML solution. Watson Studio was designed from the ground
up to aesthetically blend SPSS-inspired workflow capabilities
with open source machine learning libraries and notebook-
based interfaces.
It is designed for all collaborators — business stakeholders, data
engineers, data scientists, and app developers — who are key
to making machine learning models surface into production
applications. Watson Studio offers easy integrated access to
IBM Cloud pretrained machine learning models such as Visual
Recognition, Watson Natural Language Classifier, and many
others.
It is a perfectly balanced PAML solution for enterprise data
science teams that want the productivity of visual tools and
access to the latest open source via a notebook-based coding
interface.
Source: “The Forrester WaveTM: Multimodal Predictive Analytics And Machine
Learning Solutions, Q3 2018”, Forrester Research, September 2018
Los Angeles City as the Best Using Data in the United States
19
EX2 – IBM Weather Data
1km Visible (GOES-R will be even better)
http://www.ibm.com/weather
EX: Weather Data Serving Retails
Weather Data + WATSON Studio + RMDS Community
A data science ecosystem with weather data
101
010
101
Platform
~ IBM DSX
Weather Data Transaction
Analytical
Insights for
Smart
Commerces
Connecting all
the data
scientists from
a DS
community
Applications
Optimizing Operations Solutions
IoT Data
Five steps for building successful data science ecosystems
Know your
ecosystem and
identify new
opportunities
Recognize your
capabilities and
identify your gaps
Identify ecosystem
value and how you
might capture it
Make connections
in pursuit of your
objectives
Measure your
success and decide
next steps
▪ Understand your value chain
▪ Understand your ecosystem
▪ Identify and prioritize
ecosystem value
pools
▪ Understand your capabilities
relative to your business
model
▪ Determine what to invest in
and what to partner for
▪ Choose the right partner
▪ Engage in the right way
▪ Define measurement
model and collect data
▪ Refine business model
and ecosystem
partnerships
1
2
34
5
IBM Analytics University 2018
Notices and disclaimers
Copyright © 2018 by International Business Machines Corporation (IBM).
No part of this document may be reproduced or transmitted in any form
without written permission from IBM.
U.S. Government Users Restricted Rights — use, duplication or disclosure
restricted by GSA ADP Schedule Contract with IBM.
Information in these presentations (including information relating to products
that have not yet been announced by IBM) has been reviewed for accuracy as
of the date of initial publication and could include unintentional technical or
typographical errors. IBM shall have no responsibility to update this
information. This document is distributed “as is” without any warranty,
either express or implied. In no event shall IBM be liable for any damage
arising from the use of this information, including but not limited to, loss of
data, business interruption, loss of profit or loss of opportunity.
IBM products and services are warranted according to the terms and
conditions of the agreements under which they are provided.
IBM products are manufactured from new parts or new and used parts.
In some cases, a product may not be new and may have been previously
installed. Regardless, our warranty terms apply.”
Any statements regarding IBM's future direction, intent or product plans
are subject to change or withdrawal without notice.
Performance data contained herein was generally obtained in a controlled,
isolated environments. Customer examples are presented
as illustrations of how those customers have used IBM products and
the results they may have achieved. Actual performance, cost, savings or other
results in other operating environments may vary.
References in this document to IBM products, programs, or services does not
imply that IBM intends to make such products, programs or services available
in all countries in which IBM operates or does business.
Workshops, sessions and associated materials may have been prepared by
independent session speakers, and do not necessarily reflect the
views of IBM. All materials and discussions are provided for informational
purposes only, and are neither intended to, nor shall constitute legal or other
guidance or advice to any individual participant or their specific situation.
It is the customer’s responsibility to insure its own compliance with legal
requirements and to obtain advice of competent legal counsel as to
the identification and interpretation of any relevant laws and regulatory
requirements that may affect the customer’s business and any actions
the customer may need to take to comply with such laws. IBM does not
provide legal advice or represent or warrant that its services or products will
ensure that the customer is in compliance with any law.
IBM Analytics University 2018
Notices and disclaimers
continued
Information concerning non-IBM products was obtained from the suppliers
of those products, their published announcements or other publicly available
sources. IBM has not tested those products in connection with this
publication and cannot confirm the accuracy of performance, compatibility
or any other claims related to non-IBM products. Questions on the
capabilities of non-IBM products should be addressed to the suppliers of
those products. IBM does not warrant the quality of any third-party
products, or the ability of any such third-party products to interoperate with
IBM’s products. IBM expressly disclaims all warranties, expressed or
implied, including but not limited to, the implied warranties of
merchantability and fitness for a particular, purpose.
The provision of the information contained herein is not intended to, and
does not, grant any right or license under any IBM patents, copyrights,
trademarks or other intellectual property right.
IBM, the IBM logo, ibm.com, Aspera®, Bluemix, Blueworks Live, CICS,
Clearcase, Cognos®, DOORS®, Emptoris®, Enterprise Document Management
System™, FASP®, FileNet®, Global Business Services®,
Global Technology Services®, IBM ExperienceOne™, IBM SmartCloud®, IBM
Social Business®, Information on Demand, ILOG, Maximo®,
MQIntegrator®, MQSeries®, Netcool®, OMEGAMON, OpenPower,
PureAnalytics™, PureApplication®, pureCluster™, PureCoverage®, PureData®,
PureExperience®, PureFlex®, pureQuery®, pureScale®, PureSystems®, QRadar®,
Rational®, Rhapsody®, Smarter Commerce®, SoDA, SPSS, Sterling Commerce®,
StoredIQ, Tealeaf®, Tivoli® Trusteer®, Unica®, urban{code}®, Watson,
WebSphere®, Worklight®, X-Force® and System z® Z/OS, are trademarks of
International Business Machines Corporation, registered in many
jurisdictions worldwide. Other product and service names might
be trademarks of IBM or other companies. A current list of IBM trademarks is
available on the Web at "Copyright and trademark information" at:
www.ibm.com/legal/copytrade.shtml.
Ad

More Related Content

What's hot (20)

Everything Has Changed Except Us: Modernizing the Data Warehouse
Everything Has Changed Except Us: Modernizing the Data WarehouseEverything Has Changed Except Us: Modernizing the Data Warehouse
Everything Has Changed Except Us: Modernizing the Data Warehouse
mark madsen
 
Bigdata analytics
Bigdata analyticsBigdata analytics
Bigdata analytics
SwarnaLatha177
 
Cloud computing & big data for service innovation & learning
Cloud computing & big data for service innovation & learningCloud computing & big data for service innovation & learning
Cloud computing & big data for service innovation & learning
2016
 
Wikipedia (DBpedia): Crowdsourced Data Curation
Wikipedia (DBpedia): Crowdsourced Data CurationWikipedia (DBpedia): Crowdsourced Data Curation
Wikipedia (DBpedia): Crowdsourced Data Curation
Edward Curry
 
Dell hans timmerman v1.1
Dell hans timmerman v1.1Dell hans timmerman v1.1
Dell hans timmerman v1.1
BigDataExpo
 
CTO Radshow Hamburg17 - Keynote - The CxO responsibilities in Big Data and AI...
CTO Radshow Hamburg17 - Keynote - The CxO responsibilities in Big Data and AI...CTO Radshow Hamburg17 - Keynote - The CxO responsibilities in Big Data and AI...
CTO Radshow Hamburg17 - Keynote - The CxO responsibilities in Big Data and AI...
Santiago Cabrera-Naranjo
 
Govern and Protect Your End User Information
Govern and Protect Your End User InformationGovern and Protect Your End User Information
Govern and Protect Your End User Information
Denodo
 
CRTC Cloud- Scott Sadler
CRTC Cloud- Scott SadlerCRTC Cloud- Scott Sadler
CRTC Cloud- Scott Sadler
KrisValerio
 
Modernizing the Analytics and Data Science Lifecycle for the Scalable Enterpr...
Modernizing the Analytics and Data Science Lifecycle for the Scalable Enterpr...Modernizing the Analytics and Data Science Lifecycle for the Scalable Enterpr...
Modernizing the Analytics and Data Science Lifecycle for the Scalable Enterpr...
Data Con LA
 
Big Data World Forum
Big Data World ForumBig Data World Forum
Big Data World Forum
bigdatawf
 
Smarter Planet, Smart Work
Smarter Planet, Smart WorkSmarter Planet, Smart Work
Smarter Planet, Smart Work
IBMAsean
 
Bigdataissueschallengestoolsngoodpractices 141130054740-conversion-gate01
Bigdataissueschallengestoolsngoodpractices 141130054740-conversion-gate01Bigdataissueschallengestoolsngoodpractices 141130054740-conversion-gate01
Bigdataissueschallengestoolsngoodpractices 141130054740-conversion-gate01
Soujanya V
 
Data Virtualization: An Introduction
Data Virtualization: An IntroductionData Virtualization: An Introduction
Data Virtualization: An Introduction
Denodo
 
Dealing with Semantic Heterogeneity in Real-Time Information
Dealing with Semantic Heterogeneity in Real-Time InformationDealing with Semantic Heterogeneity in Real-Time Information
Dealing with Semantic Heterogeneity in Real-Time Information
Edward Curry
 
Approximate Semantic Matching of Heterogeneous Events
Approximate Semantic Matching of Heterogeneous EventsApproximate Semantic Matching of Heterogeneous Events
Approximate Semantic Matching of Heterogeneous Events
Edward Curry
 
Keynote Panel: Data Fabric - Why Should Organizations implement a Logical and...
Keynote Panel: Data Fabric - Why Should Organizations implement a Logical and...Keynote Panel: Data Fabric - Why Should Organizations implement a Logical and...
Keynote Panel: Data Fabric - Why Should Organizations implement a Logical and...
Denodo
 
BISMART Bihealth. Microsoft Business Intelligence in health
BISMART Bihealth. Microsoft Business Intelligence in healthBISMART Bihealth. Microsoft Business Intelligence in health
BISMART Bihealth. Microsoft Business Intelligence in health
albertisern
 
Big data analytics, research report
Big data analytics, research reportBig data analytics, research report
Big data analytics, research report
JULIO GONZALEZ SANZ
 
IoT 2014 Value Creation Workshop: SDIL
IoT 2014 Value Creation Workshop: SDILIoT 2014 Value Creation Workshop: SDIL
IoT 2014 Value Creation Workshop: SDIL
Till Riedel
 
Research paper on big data and hadoop
Research paper on big data and hadoopResearch paper on big data and hadoop
Research paper on big data and hadoop
Shree M.L.Kakadiya MCA mahila college, Amreli
 
Everything Has Changed Except Us: Modernizing the Data Warehouse
Everything Has Changed Except Us: Modernizing the Data WarehouseEverything Has Changed Except Us: Modernizing the Data Warehouse
Everything Has Changed Except Us: Modernizing the Data Warehouse
mark madsen
 
Cloud computing & big data for service innovation & learning
Cloud computing & big data for service innovation & learningCloud computing & big data for service innovation & learning
Cloud computing & big data for service innovation & learning
2016
 
Wikipedia (DBpedia): Crowdsourced Data Curation
Wikipedia (DBpedia): Crowdsourced Data CurationWikipedia (DBpedia): Crowdsourced Data Curation
Wikipedia (DBpedia): Crowdsourced Data Curation
Edward Curry
 
Dell hans timmerman v1.1
Dell hans timmerman v1.1Dell hans timmerman v1.1
Dell hans timmerman v1.1
BigDataExpo
 
CTO Radshow Hamburg17 - Keynote - The CxO responsibilities in Big Data and AI...
CTO Radshow Hamburg17 - Keynote - The CxO responsibilities in Big Data and AI...CTO Radshow Hamburg17 - Keynote - The CxO responsibilities in Big Data and AI...
CTO Radshow Hamburg17 - Keynote - The CxO responsibilities in Big Data and AI...
Santiago Cabrera-Naranjo
 
Govern and Protect Your End User Information
Govern and Protect Your End User InformationGovern and Protect Your End User Information
Govern and Protect Your End User Information
Denodo
 
CRTC Cloud- Scott Sadler
CRTC Cloud- Scott SadlerCRTC Cloud- Scott Sadler
CRTC Cloud- Scott Sadler
KrisValerio
 
Modernizing the Analytics and Data Science Lifecycle for the Scalable Enterpr...
Modernizing the Analytics and Data Science Lifecycle for the Scalable Enterpr...Modernizing the Analytics and Data Science Lifecycle for the Scalable Enterpr...
Modernizing the Analytics and Data Science Lifecycle for the Scalable Enterpr...
Data Con LA
 
Big Data World Forum
Big Data World ForumBig Data World Forum
Big Data World Forum
bigdatawf
 
Smarter Planet, Smart Work
Smarter Planet, Smart WorkSmarter Planet, Smart Work
Smarter Planet, Smart Work
IBMAsean
 
Bigdataissueschallengestoolsngoodpractices 141130054740-conversion-gate01
Bigdataissueschallengestoolsngoodpractices 141130054740-conversion-gate01Bigdataissueschallengestoolsngoodpractices 141130054740-conversion-gate01
Bigdataissueschallengestoolsngoodpractices 141130054740-conversion-gate01
Soujanya V
 
Data Virtualization: An Introduction
Data Virtualization: An IntroductionData Virtualization: An Introduction
Data Virtualization: An Introduction
Denodo
 
Dealing with Semantic Heterogeneity in Real-Time Information
Dealing with Semantic Heterogeneity in Real-Time InformationDealing with Semantic Heterogeneity in Real-Time Information
Dealing with Semantic Heterogeneity in Real-Time Information
Edward Curry
 
Approximate Semantic Matching of Heterogeneous Events
Approximate Semantic Matching of Heterogeneous EventsApproximate Semantic Matching of Heterogeneous Events
Approximate Semantic Matching of Heterogeneous Events
Edward Curry
 
Keynote Panel: Data Fabric - Why Should Organizations implement a Logical and...
Keynote Panel: Data Fabric - Why Should Organizations implement a Logical and...Keynote Panel: Data Fabric - Why Should Organizations implement a Logical and...
Keynote Panel: Data Fabric - Why Should Organizations implement a Logical and...
Denodo
 
BISMART Bihealth. Microsoft Business Intelligence in health
BISMART Bihealth. Microsoft Business Intelligence in healthBISMART Bihealth. Microsoft Business Intelligence in health
BISMART Bihealth. Microsoft Business Intelligence in health
albertisern
 
Big data analytics, research report
Big data analytics, research reportBig data analytics, research report
Big data analytics, research report
JULIO GONZALEZ SANZ
 
IoT 2014 Value Creation Workshop: SDIL
IoT 2014 Value Creation Workshop: SDILIoT 2014 Value Creation Workshop: SDIL
IoT 2014 Value Creation Workshop: SDIL
Till Riedel
 

Similar to Building Data Science Ecosystems for Smart Cities and Smart Commerce (20)

An Ecosystem Approach to Data Science
An Ecosystem Approach to Data ScienceAn Ecosystem Approach to Data Science
An Ecosystem Approach to Data Science
Alex Liu
 
BUILDING BETTER PREDICTIVE MODELS WITH COGNITIVE ASSISTANCE IN A DATA SCIENCE...
BUILDING BETTER PREDICTIVE MODELS WITH COGNITIVE ASSISTANCE IN A DATA SCIENCE...BUILDING BETTER PREDICTIVE MODELS WITH COGNITIVE ASSISTANCE IN A DATA SCIENCE...
BUILDING BETTER PREDICTIVE MODELS WITH COGNITIVE ASSISTANCE IN A DATA SCIENCE...
Alex Liu
 
Libera la potenza del Machine Learning
Libera la potenza del Machine LearningLibera la potenza del Machine Learning
Libera la potenza del Machine Learning
Jürgen Ambrosi
 
ICP for Data- Enterprise platform for AI, ML and Data Science
ICP for Data- Enterprise platform for AI, ML and Data ScienceICP for Data- Enterprise platform for AI, ML and Data Science
ICP for Data- Enterprise platform for AI, ML and Data Science
Karan Sachdeva
 
Big Data Driven Solutions to Combat Covid' 19
Big Data Driven Solutions to Combat Covid' 19Big Data Driven Solutions to Combat Covid' 19
Big Data Driven Solutions to Combat Covid' 19
Prof.Balakrishnan S
 
A Logical Architecture is Always a Flexible Architecture (ASEAN)
A Logical Architecture is Always a Flexible Architecture (ASEAN)A Logical Architecture is Always a Flexible Architecture (ASEAN)
A Logical Architecture is Always a Flexible Architecture (ASEAN)
Denodo
 
DAMA Webinar: Turn Grand Designs into a Reality with Data Virtualization
DAMA Webinar: Turn Grand Designs into a Reality with Data VirtualizationDAMA Webinar: Turn Grand Designs into a Reality with Data Virtualization
DAMA Webinar: Turn Grand Designs into a Reality with Data Virtualization
Denodo
 
Architecting Disruption - Rethink business, technology and data to build a pl...
Architecting Disruption - Rethink business, technology and data to build a pl...Architecting Disruption - Rethink business, technology and data to build a pl...
Architecting Disruption - Rethink business, technology and data to build a pl...
ssuser373742
 
A journey to faster, repeatable data commercialization
A journey to faster, repeatable data commercializationA journey to faster, repeatable data commercialization
A journey to faster, repeatable data commercialization
Institute of Contemporary Sciences
 
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
Denodo
 
Big Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on DataBig Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on Data
Matt Stubbs
 
Big Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on DataBig Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on Data
Matt Stubbs
 
The new dominant companies are running on data
The new dominant companies are running on data The new dominant companies are running on data
The new dominant companies are running on data
SnapLogic
 
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...
Denodo
 
Data Virtualization: Introduction and Business Value (UK)
Data Virtualization: Introduction and Business Value (UK)Data Virtualization: Introduction and Business Value (UK)
Data Virtualization: Introduction and Business Value (UK)
Denodo
 
Qo Introduction V2
Qo Introduction V2Qo Introduction V2
Qo Introduction V2
Joe_F
 
Machine Learning for z/OS
Machine Learning for z/OSMachine Learning for z/OS
Machine Learning for z/OS
Cuneyt Goksu
 
Future of Data Strategy (ASEAN)
Future of Data Strategy (ASEAN)Future of Data Strategy (ASEAN)
Future of Data Strategy (ASEAN)
Denodo
 
Data Fabric - Why Should Organizations Implement a Logical and Not a Physical...
Data Fabric - Why Should Organizations Implement a Logical and Not a Physical...Data Fabric - Why Should Organizations Implement a Logical and Not a Physical...
Data Fabric - Why Should Organizations Implement a Logical and Not a Physical...
Denodo
 
Data Mesh in Azure using Cloud Scale Analytics (WAF)
Data Mesh in Azure using Cloud Scale Analytics (WAF)Data Mesh in Azure using Cloud Scale Analytics (WAF)
Data Mesh in Azure using Cloud Scale Analytics (WAF)
Nathan Bijnens
 
An Ecosystem Approach to Data Science
An Ecosystem Approach to Data ScienceAn Ecosystem Approach to Data Science
An Ecosystem Approach to Data Science
Alex Liu
 
BUILDING BETTER PREDICTIVE MODELS WITH COGNITIVE ASSISTANCE IN A DATA SCIENCE...
BUILDING BETTER PREDICTIVE MODELS WITH COGNITIVE ASSISTANCE IN A DATA SCIENCE...BUILDING BETTER PREDICTIVE MODELS WITH COGNITIVE ASSISTANCE IN A DATA SCIENCE...
BUILDING BETTER PREDICTIVE MODELS WITH COGNITIVE ASSISTANCE IN A DATA SCIENCE...
Alex Liu
 
Libera la potenza del Machine Learning
Libera la potenza del Machine LearningLibera la potenza del Machine Learning
Libera la potenza del Machine Learning
Jürgen Ambrosi
 
ICP for Data- Enterprise platform for AI, ML and Data Science
ICP for Data- Enterprise platform for AI, ML and Data ScienceICP for Data- Enterprise platform for AI, ML and Data Science
ICP for Data- Enterprise platform for AI, ML and Data Science
Karan Sachdeva
 
Big Data Driven Solutions to Combat Covid' 19
Big Data Driven Solutions to Combat Covid' 19Big Data Driven Solutions to Combat Covid' 19
Big Data Driven Solutions to Combat Covid' 19
Prof.Balakrishnan S
 
A Logical Architecture is Always a Flexible Architecture (ASEAN)
A Logical Architecture is Always a Flexible Architecture (ASEAN)A Logical Architecture is Always a Flexible Architecture (ASEAN)
A Logical Architecture is Always a Flexible Architecture (ASEAN)
Denodo
 
DAMA Webinar: Turn Grand Designs into a Reality with Data Virtualization
DAMA Webinar: Turn Grand Designs into a Reality with Data VirtualizationDAMA Webinar: Turn Grand Designs into a Reality with Data Virtualization
DAMA Webinar: Turn Grand Designs into a Reality with Data Virtualization
Denodo
 
Architecting Disruption - Rethink business, technology and data to build a pl...
Architecting Disruption - Rethink business, technology and data to build a pl...Architecting Disruption - Rethink business, technology and data to build a pl...
Architecting Disruption - Rethink business, technology and data to build a pl...
ssuser373742
 
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
Denodo
 
Big Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on DataBig Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on Data
Matt Stubbs
 
Big Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on DataBig Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on Data
Matt Stubbs
 
The new dominant companies are running on data
The new dominant companies are running on data The new dominant companies are running on data
The new dominant companies are running on data
SnapLogic
 
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...
Denodo
 
Data Virtualization: Introduction and Business Value (UK)
Data Virtualization: Introduction and Business Value (UK)Data Virtualization: Introduction and Business Value (UK)
Data Virtualization: Introduction and Business Value (UK)
Denodo
 
Qo Introduction V2
Qo Introduction V2Qo Introduction V2
Qo Introduction V2
Joe_F
 
Machine Learning for z/OS
Machine Learning for z/OSMachine Learning for z/OS
Machine Learning for z/OS
Cuneyt Goksu
 
Future of Data Strategy (ASEAN)
Future of Data Strategy (ASEAN)Future of Data Strategy (ASEAN)
Future of Data Strategy (ASEAN)
Denodo
 
Data Fabric - Why Should Organizations Implement a Logical and Not a Physical...
Data Fabric - Why Should Organizations Implement a Logical and Not a Physical...Data Fabric - Why Should Organizations Implement a Logical and Not a Physical...
Data Fabric - Why Should Organizations Implement a Logical and Not a Physical...
Denodo
 
Data Mesh in Azure using Cloud Scale Analytics (WAF)
Data Mesh in Azure using Cloud Scale Analytics (WAF)Data Mesh in Azure using Cloud Scale Analytics (WAF)
Data Mesh in Azure using Cloud Scale Analytics (WAF)
Nathan Bijnens
 
Ad

Recently uploaded (20)

Conic Sectionfaggavahabaayhahahahahs.pptx
Conic Sectionfaggavahabaayhahahahahs.pptxConic Sectionfaggavahabaayhahahahahs.pptx
Conic Sectionfaggavahabaayhahahahahs.pptx
taiwanesechetan
 
Principles of information security Chapter 5.ppt
Principles of information security Chapter 5.pptPrinciples of information security Chapter 5.ppt
Principles of information security Chapter 5.ppt
EstherBaguma
 
Geometry maths presentation for begginers
Geometry maths presentation for begginersGeometry maths presentation for begginers
Geometry maths presentation for begginers
zrjacob283
 
DPR_Expert_Recruitment_notice_Revised.pdf
DPR_Expert_Recruitment_notice_Revised.pdfDPR_Expert_Recruitment_notice_Revised.pdf
DPR_Expert_Recruitment_notice_Revised.pdf
inmishra17121973
 
Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...
Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...
Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...
Abodahab
 
Flip flop presenation-Presented By Mubahir khan.pptx
Flip flop presenation-Presented By Mubahir khan.pptxFlip flop presenation-Presented By Mubahir khan.pptx
Flip flop presenation-Presented By Mubahir khan.pptx
mubashirkhan45461
 
md-presentHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHation.pptx
md-presentHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHation.pptxmd-presentHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHation.pptx
md-presentHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHation.pptx
fatimalazaar2004
 
Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...
Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...
Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...
James Francis Paradigm Asset Management
 
chapter 4 Variability statistical research .pptx
chapter 4 Variability statistical research .pptxchapter 4 Variability statistical research .pptx
chapter 4 Variability statistical research .pptx
justinebandajbn
 
C++_OOPs_DSA1_Presentation_Template.pptx
C++_OOPs_DSA1_Presentation_Template.pptxC++_OOPs_DSA1_Presentation_Template.pptx
C++_OOPs_DSA1_Presentation_Template.pptx
aquibnoor22079
 
AI Competitor Analysis: How to Monitor and Outperform Your Competitors
AI Competitor Analysis: How to Monitor and Outperform Your CompetitorsAI Competitor Analysis: How to Monitor and Outperform Your Competitors
AI Competitor Analysis: How to Monitor and Outperform Your Competitors
Contify
 
How to join illuminati Agent in uganda call+256776963507/0741506136
How to join illuminati Agent in uganda call+256776963507/0741506136How to join illuminati Agent in uganda call+256776963507/0741506136
How to join illuminati Agent in uganda call+256776963507/0741506136
illuminati Agent uganda call+256776963507/0741506136
 
VKS-Python-FIe Handling text CSV Binary.pptx
VKS-Python-FIe Handling text CSV Binary.pptxVKS-Python-FIe Handling text CSV Binary.pptx
VKS-Python-FIe Handling text CSV Binary.pptx
Vinod Srivastava
 
Developing Security Orchestration, Automation, and Response Applications
Developing Security Orchestration, Automation, and Response ApplicationsDeveloping Security Orchestration, Automation, and Response Applications
Developing Security Orchestration, Automation, and Response Applications
VICTOR MAESTRE RAMIREZ
 
Deloitte Analytics - Applying Process Mining in an audit context
Deloitte Analytics - Applying Process Mining in an audit contextDeloitte Analytics - Applying Process Mining in an audit context
Deloitte Analytics - Applying Process Mining in an audit context
Process mining Evangelist
 
FPET_Implementation_2_MA to 360 Engage Direct.pptx
FPET_Implementation_2_MA to 360 Engage Direct.pptxFPET_Implementation_2_MA to 360 Engage Direct.pptx
FPET_Implementation_2_MA to 360 Engage Direct.pptx
ssuser4ef83d
 
Perencanaan Pengendalian-Proyek-Konstruksi-MS-PROJECT.pptx
Perencanaan Pengendalian-Proyek-Konstruksi-MS-PROJECT.pptxPerencanaan Pengendalian-Proyek-Konstruksi-MS-PROJECT.pptx
Perencanaan Pengendalian-Proyek-Konstruksi-MS-PROJECT.pptx
PareaRusan
 
IAS-slides2-ia-aaaaaaaaaaain-business.pdf
IAS-slides2-ia-aaaaaaaaaaain-business.pdfIAS-slides2-ia-aaaaaaaaaaain-business.pdf
IAS-slides2-ia-aaaaaaaaaaain-business.pdf
mcgardenlevi9
 
183409-christina-rossetti.pdfdsfsdasggsag
183409-christina-rossetti.pdfdsfsdasggsag183409-christina-rossetti.pdfdsfsdasggsag
183409-christina-rossetti.pdfdsfsdasggsag
fardin123rahman07
 
03 Daniel 2-notes.ppt seminario escatologia
03 Daniel 2-notes.ppt seminario escatologia03 Daniel 2-notes.ppt seminario escatologia
03 Daniel 2-notes.ppt seminario escatologia
Alexander Romero Arosquipa
 
Conic Sectionfaggavahabaayhahahahahs.pptx
Conic Sectionfaggavahabaayhahahahahs.pptxConic Sectionfaggavahabaayhahahahahs.pptx
Conic Sectionfaggavahabaayhahahahahs.pptx
taiwanesechetan
 
Principles of information security Chapter 5.ppt
Principles of information security Chapter 5.pptPrinciples of information security Chapter 5.ppt
Principles of information security Chapter 5.ppt
EstherBaguma
 
Geometry maths presentation for begginers
Geometry maths presentation for begginersGeometry maths presentation for begginers
Geometry maths presentation for begginers
zrjacob283
 
DPR_Expert_Recruitment_notice_Revised.pdf
DPR_Expert_Recruitment_notice_Revised.pdfDPR_Expert_Recruitment_notice_Revised.pdf
DPR_Expert_Recruitment_notice_Revised.pdf
inmishra17121973
 
Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...
Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...
Day 1 - Lab 1 Reconnaissance Scanning with NMAP, Vulnerability Assessment wit...
Abodahab
 
Flip flop presenation-Presented By Mubahir khan.pptx
Flip flop presenation-Presented By Mubahir khan.pptxFlip flop presenation-Presented By Mubahir khan.pptx
Flip flop presenation-Presented By Mubahir khan.pptx
mubashirkhan45461
 
md-presentHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHation.pptx
md-presentHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHation.pptxmd-presentHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHation.pptx
md-presentHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHation.pptx
fatimalazaar2004
 
Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...
Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...
Safety Innovation in Mt. Vernon A Westchester County Model for New Rochelle a...
James Francis Paradigm Asset Management
 
chapter 4 Variability statistical research .pptx
chapter 4 Variability statistical research .pptxchapter 4 Variability statistical research .pptx
chapter 4 Variability statistical research .pptx
justinebandajbn
 
C++_OOPs_DSA1_Presentation_Template.pptx
C++_OOPs_DSA1_Presentation_Template.pptxC++_OOPs_DSA1_Presentation_Template.pptx
C++_OOPs_DSA1_Presentation_Template.pptx
aquibnoor22079
 
AI Competitor Analysis: How to Monitor and Outperform Your Competitors
AI Competitor Analysis: How to Monitor and Outperform Your CompetitorsAI Competitor Analysis: How to Monitor and Outperform Your Competitors
AI Competitor Analysis: How to Monitor and Outperform Your Competitors
Contify
 
VKS-Python-FIe Handling text CSV Binary.pptx
VKS-Python-FIe Handling text CSV Binary.pptxVKS-Python-FIe Handling text CSV Binary.pptx
VKS-Python-FIe Handling text CSV Binary.pptx
Vinod Srivastava
 
Developing Security Orchestration, Automation, and Response Applications
Developing Security Orchestration, Automation, and Response ApplicationsDeveloping Security Orchestration, Automation, and Response Applications
Developing Security Orchestration, Automation, and Response Applications
VICTOR MAESTRE RAMIREZ
 
Deloitte Analytics - Applying Process Mining in an audit context
Deloitte Analytics - Applying Process Mining in an audit contextDeloitte Analytics - Applying Process Mining in an audit context
Deloitte Analytics - Applying Process Mining in an audit context
Process mining Evangelist
 
FPET_Implementation_2_MA to 360 Engage Direct.pptx
FPET_Implementation_2_MA to 360 Engage Direct.pptxFPET_Implementation_2_MA to 360 Engage Direct.pptx
FPET_Implementation_2_MA to 360 Engage Direct.pptx
ssuser4ef83d
 
Perencanaan Pengendalian-Proyek-Konstruksi-MS-PROJECT.pptx
Perencanaan Pengendalian-Proyek-Konstruksi-MS-PROJECT.pptxPerencanaan Pengendalian-Proyek-Konstruksi-MS-PROJECT.pptx
Perencanaan Pengendalian-Proyek-Konstruksi-MS-PROJECT.pptx
PareaRusan
 
IAS-slides2-ia-aaaaaaaaaaain-business.pdf
IAS-slides2-ia-aaaaaaaaaaain-business.pdfIAS-slides2-ia-aaaaaaaaaaain-business.pdf
IAS-slides2-ia-aaaaaaaaaaain-business.pdf
mcgardenlevi9
 
183409-christina-rossetti.pdfdsfsdasggsag
183409-christina-rossetti.pdfdsfsdasggsag183409-christina-rossetti.pdfdsfsdasggsag
183409-christina-rossetti.pdfdsfsdasggsag
fardin123rahman07
 
Ad

Building Data Science Ecosystems for Smart Cities and Smart Commerce

  • 1. © 2017 IBM Corporation1 Building Data Science Ecosystems for Smart City and Smart Commerce Dr. Alex Liu Chief Data Scientist Analytics Services
  • 2. © 2017 IBM Corporation2 Alex Liu Introduction ▪ Chief Data Scientist – Analytics Services at IBM ▪ A Data Scientist Thought Leader ▪ Chief Data Scientist for a few corporations before joined IBM ▪ Taught advanced data analytics for the University of South California and the University of California at Irvine ▪ Consulted for the United Nations, Ingram Micro … ▪ M.S. and Ph.D. from Stanford University
  • 3. © 2017 IBM Corporation3 Data Science: Turning Data into VALUE With Models Data Science produces insights/values via a complicated processes a big set of tools 3 BigInsights (HDFS) Cloudant (DBaaS) dashDB (Analytics) Swift (Object Storage ) SQDB (Managed DB2)
  • 4. © 2017 IBM Corporation4 Data Science projects return very valuable results but a lot failed ▪Netflix, for example, integrates data science into each part of their business; they estimate a billion dollars in incremental value from their personalization and recommendation alone. ▪Knight Capital Group, for instance, lost $440 million in 45 minutes after a mistake in updating a model (New York times). ▪Gartner estimated that 60% of big data projects fail in 2016, and in 2017.
  • 5. © 2017 IBM Corporation5 Reasons why data science projects failed ▪Wrong data ▪Wrong question ▪Week stakeholders commitment ▪Lack of diverse expertise per CIO DIVE ▪Many research leads to conclude mismatch among question, data, model, researchers, tools … as main reason ▪Due to bad coordination of parts involved
  • 6. © 2017 IBM Corporation6 WHY - Data Science very complicated Very Complicated enough just for Model Building Stage • More than 50 different models: SVM, Neural Net, Decision Trees/Forests, Naïve Bayes, Regression, SMO, k-nearest Neighbor, Clustering, Rules, … • Combinatorially explosive number of parameter choices per algorithm: kernel type, pruning strategy, number of trees in a forest, learning rate, … • Wide variation in performance across different algorithm implementations (e.g., SPSS vs Python vs WEKA vs SPARK …) • User-Defined algorithms • Substantial cost in user and compute time • User spends time on trying new combinations and parameters • Computational cost for training a single SVM can exceed 24h • Selection commonly based on data scientist bias • Each additional pipeline stage increases complexity dramatically!
  • 7. © 2017 IBM Corporation7 WHY - challenges for managing data scientists ▪High turn over rate of data scientists (average tenure 18 months) ▪Ever developing new technologies to master ▪Lack of training for data scientists 7 Data Prep Modeling DeploymentEvaluation Exploration
  • 8. © 2017 IBM Corporation A data science ecosystem approach A data science ECOSYSTEM has three basic elements 1) Data portal, 2) Data Science platform, 3) Data Science community
  • 9. © 2017 IBM Corporation9 The defining characteristics of an ecosystem - mutuality & orchestration Markets comprise entities that operate out of individual self-interest Ecosystems comprise entities that operate out of orchestrated, mutual shared-interest A set of individuals or organizations who exchange products or services within an environment governed by the laws of supply and demand A set of individuals or organizations who formally or informally operate together to produce something of greater value for the mutual benefit of the ecosystem as a whole Ecosystems exists because operating in an orchestrated environment, participants can deliver more value within the ecosystem acting together than acting alone
  • 10. © 2017 IBM Corporation10 ▪ Value capture reflects a networked, dynamic, everyone-to-everyone process of exchange ▪ Ecosystems produce more value as a whole, than the sum of the individual participants acting independently Value creation in a value chain ▪ Value creation is incremental as organizations cover costs plus some return on assets ▪ Value capture reflects an additive, sequential process of exchange Value creation in traditional data science tends to be linear; value creation in ecosystems tends to be networked … Cost plus return Cost plus Cost plus Cost plus Cost plus Value creation in an ecosystem
  • 11. © 2017 IBM Corporation11 Ecosystems can yield substantial benefits New capabilities Improved access Improved Agility Increase Success Ratio of Data Science Projects Embrace ecosystems’ strategic potential Ecosystems enable organizations to access critical capabilities that they would otherwise have difficulty obtaining Ecosystems support greater access to new or different resources such as new talents, new tools, new data sets Ecosystems support quick creation of new types of products, with different combinations of organizations and assets
  • 12. © 2017 IBM Corporation12 1) An ecosystem for city data science 101 010 101 Platform ~ IBM DSX Events Data Social Media Analytical Insights for Smart Cities Connecting all the data scientists from a DS community Applications Optimizing Operations Solutions IoT Data
  • 13. © 2017 IBM Corporation13 RMDS Communities at IBM Glendale ▪Pasadena/Glendale Meetup Community ▪Local face to face community – more than 1100 members ▪https://www.meetup.com/RMDS_LA/ ▪https://www.linkedin.com/groups/1895501 has 29K participants 2 former CDOs and the current CDO are members, also presented their work here. Aim to create an environment for utilizing big data analytics to make smart cities
  • 14. © 2017 IBM Corporation14 105,000+ collections 349 citizen apps 500,000 data resources 175 agencies 450 APIs Source: City of LA Mayor’s Tech Advisor Presentation at RMDS Meetup. Data.gov citizen data science ecosystem with open data
  • 15. © 2017 IBM Corporation15 IBM Data Science Experience Community Open Source IBM Added Value Powered by IBM Watson Data Platform • Find tutorials and datasets • Connect with Data Scientists • Ask questions • Read articles and papers • Fork and share projects • Code in Scala/Python/R/SQL • Jupyter Notebooks • RStudio IDE and Shiny • Apache Spark • Your favorite libraries • Watson Machine Learning • SPSS Modeler Canvas • Advanced Visualizations • Projects and Version Control • Managed Spark Service * Closed beta a data science platform – the IBM Data Science Experience - now IBM Watson Studio Local https://datascience.ibm.com/
  • 17. © 2017 IBM Corporation17 Overview IBM Data Science Experience (DSX – now Watson Studio Local) is a social environment where you can collaborate to solve data challenges with the best tools and latest expertise. ▪ Community ▪ Projects ▪ Notebooks ▪ Data Asset ▪ Model Management ▪ Admin console
  • 18. Forrester Research Ranks IBM as a Leader in Multimodal Predictive Analytics and Machine Learning IBM puts AI to work. IBM Watson is a vast umbrella of technologies and solutions, one of which is Watson Studio, a PAML solution. Watson Studio was designed from the ground up to aesthetically blend SPSS-inspired workflow capabilities with open source machine learning libraries and notebook- based interfaces. It is designed for all collaborators — business stakeholders, data engineers, data scientists, and app developers — who are key to making machine learning models surface into production applications. Watson Studio offers easy integrated access to IBM Cloud pretrained machine learning models such as Visual Recognition, Watson Natural Language Classifier, and many others. It is a perfectly balanced PAML solution for enterprise data science teams that want the productivity of visual tools and access to the latest open source via a notebook-based coding interface. Source: “The Forrester WaveTM: Multimodal Predictive Analytics And Machine Learning Solutions, Q3 2018”, Forrester Research, September 2018
  • 19. Los Angeles City as the Best Using Data in the United States 19
  • 20. EX2 – IBM Weather Data 1km Visible (GOES-R will be even better) http://www.ibm.com/weather
  • 21. EX: Weather Data Serving Retails Weather Data + WATSON Studio + RMDS Community A data science ecosystem with weather data 101 010 101 Platform ~ IBM DSX Weather Data Transaction Analytical Insights for Smart Commerces Connecting all the data scientists from a DS community Applications Optimizing Operations Solutions IoT Data
  • 22. Five steps for building successful data science ecosystems Know your ecosystem and identify new opportunities Recognize your capabilities and identify your gaps Identify ecosystem value and how you might capture it Make connections in pursuit of your objectives Measure your success and decide next steps ▪ Understand your value chain ▪ Understand your ecosystem ▪ Identify and prioritize ecosystem value pools ▪ Understand your capabilities relative to your business model ▪ Determine what to invest in and what to partner for ▪ Choose the right partner ▪ Engage in the right way ▪ Define measurement model and collect data ▪ Refine business model and ecosystem partnerships 1 2 34 5
  • 23. IBM Analytics University 2018 Notices and disclaimers Copyright © 2018 by International Business Machines Corporation (IBM). No part of this document may be reproduced or transmitted in any form without written permission from IBM. U.S. Government Users Restricted Rights — use, duplication or disclosure restricted by GSA ADP Schedule Contract with IBM. Information in these presentations (including information relating to products that have not yet been announced by IBM) has been reviewed for accuracy as of the date of initial publication and could include unintentional technical or typographical errors. IBM shall have no responsibility to update this information. This document is distributed “as is” without any warranty, either express or implied. In no event shall IBM be liable for any damage arising from the use of this information, including but not limited to, loss of data, business interruption, loss of profit or loss of opportunity. IBM products and services are warranted according to the terms and conditions of the agreements under which they are provided. IBM products are manufactured from new parts or new and used parts. In some cases, a product may not be new and may have been previously installed. Regardless, our warranty terms apply.” Any statements regarding IBM's future direction, intent or product plans are subject to change or withdrawal without notice. Performance data contained herein was generally obtained in a controlled, isolated environments. Customer examples are presented as illustrations of how those customers have used IBM products and the results they may have achieved. Actual performance, cost, savings or other results in other operating environments may vary. References in this document to IBM products, programs, or services does not imply that IBM intends to make such products, programs or services available in all countries in which IBM operates or does business. Workshops, sessions and associated materials may have been prepared by independent session speakers, and do not necessarily reflect the views of IBM. All materials and discussions are provided for informational purposes only, and are neither intended to, nor shall constitute legal or other guidance or advice to any individual participant or their specific situation. It is the customer’s responsibility to insure its own compliance with legal requirements and to obtain advice of competent legal counsel as to the identification and interpretation of any relevant laws and regulatory requirements that may affect the customer’s business and any actions the customer may need to take to comply with such laws. IBM does not provide legal advice or represent or warrant that its services or products will ensure that the customer is in compliance with any law.
  • 24. IBM Analytics University 2018 Notices and disclaimers continued Information concerning non-IBM products was obtained from the suppliers of those products, their published announcements or other publicly available sources. IBM has not tested those products in connection with this publication and cannot confirm the accuracy of performance, compatibility or any other claims related to non-IBM products. Questions on the capabilities of non-IBM products should be addressed to the suppliers of those products. IBM does not warrant the quality of any third-party products, or the ability of any such third-party products to interoperate with IBM’s products. IBM expressly disclaims all warranties, expressed or implied, including but not limited to, the implied warranties of merchantability and fitness for a particular, purpose. The provision of the information contained herein is not intended to, and does not, grant any right or license under any IBM patents, copyrights, trademarks or other intellectual property right. IBM, the IBM logo, ibm.com, Aspera®, Bluemix, Blueworks Live, CICS, Clearcase, Cognos®, DOORS®, Emptoris®, Enterprise Document Management System™, FASP®, FileNet®, Global Business Services®, Global Technology Services®, IBM ExperienceOne™, IBM SmartCloud®, IBM Social Business®, Information on Demand, ILOG, Maximo®, MQIntegrator®, MQSeries®, Netcool®, OMEGAMON, OpenPower, PureAnalytics™, PureApplication®, pureCluster™, PureCoverage®, PureData®, PureExperience®, PureFlex®, pureQuery®, pureScale®, PureSystems®, QRadar®, Rational®, Rhapsody®, Smarter Commerce®, SoDA, SPSS, Sterling Commerce®, StoredIQ, Tealeaf®, Tivoli® Trusteer®, Unica®, urban{code}®, Watson, WebSphere®, Worklight®, X-Force® and System z® Z/OS, are trademarks of International Business Machines Corporation, registered in many jurisdictions worldwide. Other product and service names might be trademarks of IBM or other companies. A current list of IBM trademarks is available on the Web at "Copyright and trademark information" at: www.ibm.com/legal/copytrade.shtml.