Data Market Evolution,
a future shaped by FAIR
on Thurs, 20th Feb 2020 at 16:00 GMT
Hosted by: Ian Harrow, Pistoia Alliance
Speaker: Matan Burstin, Clarivate Analytics
FAIR/OM projects Community of Interest webinar series
This webinar is being recorded
Audience Q&A
Please use the questions box
©PistoiaAlliance
Matan Burstin
• Matan Burstin is Director, R&D Data Science and Informatics at
Clarivate Analytics. As Data management practice leader, he
leads a global multi-disciplinary team of exceptional
consultants, servicing the Life sciences industry and providing
solutions which combine knowledge management, software
development and machine learning.
• Matan has 10 years of leadership experience, guiding R&D,
engineering, product and operations teams.
• In his previous roles, he led the data operations department
and data products management at a leading global labour
market analytics company, managed an R&D team at Boston
Children's Hospital under Computational Health Informatics
Program (CHIP) and developed LabCorp’s first Next
Generation Sequencing line of clinical products, supporting
R&D bioinformatics and informatics.
©PistoiaAlliance
FAIRification
https://libereurope.eu/wp-content/uploads/2017/12/LIBER-FAIR-Data.pdf
https://www.nature.com/articles/sdata201618
©PistoiaAlliance
Challenges: identifiers
BioIT 2019 - Database of host-microbial interactions (DoMI)
©PistoiaAlliance
Challenges: identifiers
https://david.ncifcrf.gov/helps/knowledgebase/DAVID_gene.html
©PistoiaAlliance
Challenges: identifiers
https://www.uniprot.org/help/different_protein_gene_names
©PistoiaAlliance
Challenges: identifiers
https://bmcbioinformatics.biomedcentral.com/articles/10.1186/1471-2105-12-S4-S4
http://annovar.openbioinformatics.org/en/latest/articles/dbSNP/
©PistoiaAlliance
Challenges: identifiers
©PistoiaAlliance
Challenges: accessibility and form
http://discover.clarivate.com/CancerAIChallenge
©PistoiaAlliance
Data market evolution
Example: The Music industry
• In 2000 the recorded music
industry was worth $40b
• In 2010 it is worth $25b (- 40%)
This is not the first time the industry
goes through changes:
• To 1500: music is mostly oral and
social. It was free!
• 1500 – 1900: music becomes
written and complex. Tied to a
medium, it becomes a
commodity for sale. The
aristocracy and the church pay for
it (live pieces commissioned)
• 1900 – today: music becomes
recorded. Accessible to the
entire population. New
intermediaries appear: music
labels.
True Value
• The song, not the album
• Single source of music easily accessible on demand (iTunes, Spotify, p2p)
• Ease of use (the new turntable is itunes) and discovery (Pandora)
• Multiplatform: Follow the user on all devices (iTunes, iPhone, car, etc.)
• New curators: Sharing (Facebook/Spotify, Apple Ping)
• Recorded music: artist keep 10-15%
• Live music: artist keep 60-70%
Content< Content Distribution
©PistoiaAlliance
Data market evolution
https://www.pewresearch.org/internet/2010/03/01/part-2-how-people-use-the-news-and-feel-about-the-news/
Content< Context
Example: News
©PistoiaAlliance
Data market evolution
https://mikehohnen.com/the-big-shift-products-to-services/
https://fsd.servicemax.com/2017/03/02/how-industrial-companies-are-making-the-shift-to-services/
https://www.forbes.com/sites/strategyand/2016/11/01/the-big-shift-to-software-and-services-innovation/#b169db537071
Data product < Data Service
©PistoiaAlliance
Going CaaS
©PistoiaAlliance
Going CaaS by FAIR
Findable Accessible Interoperable Reusable
©PistoiaAlliance
Going CaaS by FAIR @Clarivate
©PistoiaAlliance
Future of data
https://www.itransition.com/blog/the-future-of-big-data
©PistoiaAlliance
Future of data
©PistoiaAlliance
Future of data
• Rise of “actionable data” – the missing link between big data and business value
– Real time data stream (30% of all data by 2025)
– Processed, structured data
– Personalized
– Ready for analysis
“The overwhelming size of big data may create additional challenges in the future,
including data privacy and security risks, shortage of data professionals, and
difficulties in data storage and processing.”
https://www.pewresearch.org/internet/2010/03/01/part-2-how-people-use-the-news-and-feel-about-the-news/
https://data-economy.com/the-make-of-a-15bn-mega-business-ntt-ltds-ceo-jason-goodall-on-bringing-31-companies-together/
https://www.marketwatch.com/press-release/data-analytics-market-2019-by-industry-size-estimation-industry-share-future-demand-dynamics-drivers-research-methodology-by-2023-2019-05-03
©PistoiaAlliance
Audience Q&A
Please use the questions box
Knowledge graphs and semantic models
for drug discovery and healthcare
Join us for the next FAIR/OM CoI webinar:
Speaker: Ilaria Maresi, The Hyve
Thursday, 14th May at 16:00 GMT
info@pistoiaalliance.org @pistoiaalliance www.pistoiaalliance.org
Thanks for your attention

Data market evolution, a future shaped by FAIR