0% found this document useful (0 votes)

20 views18 pages

Swetabh Pathak

The document discusses the transition from AI models to deployed AI in life sciences, highlighting the importance of relevant use cases and high-quality data for successful implementation. It identifies common barriers to AI adoption, such as lack of expertise and training data, and presents case studies demonstrating how Elucidata's solutions can accelerate AI deployment and improve R&D productivity. The emphasis is on the value of data over tools, advocating for data-centric approaches to enhance AI capabilities in drug discovery and development.

Uploaded by

pal.spandan.99

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views18 pages

Swetabh Pathak

Uploaded by

pal.spandan.99

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 18

Navigating the Transition:

From Models to Deployed AI

Swetabh Pathak
CTO & Co-Founder, Elucidata
Emerging Leaders for AI in Life Sciences R&D
..with Differentiated Technology & Expertise

#1 30+ customer proof points across

discovery, development and trials

#2 Cloud-first data platform and ML that can

seamlessly complement your ecosystem.

#3 With a team that can quickly assimilate

domain expertise into high impact services.
AI Shows Promise, But Opinions Vary..

AI is currently only
It's too early to used for solving AI is a new hype -
AI is the future - it Al is somewhat
recognize the true simple investors buy into
will help us explore valuable.
impact of Al - we will problems. The the
areas that have never In our work, AI has
only be able to see InSilico desire to be hip, cash
been explored before. helped make a lot of
the true impact once screen would only is raised, Pharma cos
One day AI will help molecules
we can see the have do deals to be in the
us understand biology synthesizable faster &
productivity over had a 4% failure rate, news. There's lots of
so deeply that we can cheaper.
time. even without AI. noise in this field but
form new scientific
it has not been
laws and drug design
proven yet.
principles.
Computational
Head of Data and Translational Deputy Director,
Biologist, Chief Executive,
Platform Strategy, Scientist, Academia Global Health, Non-
Research Institute Data Consortium
‘AI-first’ biotech profit Organisation

Innovators Early Adopters Early Majority Late Majority Laggards

(2.5%) (13.5%) (34%) (34%) (16.5%)

Source:BCG Report on AI, 2023

Biggest Barriers to Broader AI Adoption

Limiting Belief 1 Limiting Belief 2 Limiting Belief 3

‘We don’t have ‘We just couldn’t ‘We don’t have

a relevant use get enough the expertise or
case for AI’ training data to the millions of
solve the dollars needed
problem we’re to build a team’
working on’

Translational Leader, Computational CSO, Early Stage

Mid-Stage Biologist, Research Therapeutics
Pharmaceutical Institute
Company
Limiting Belief 1

“We don’t have a relevant use

case for AI”
Good AI use cases are not rare events

Access to ‘high quality and relevant’ data

Human supervision is available & possible

What do good
use-cases look Biological rules can be framed

like?
Hypothesis generation rather than testing

Explainability is not necessary

Right use case is the biggest predictor of success

‘An Early Stage Therapeutics company wanted to develop and train Classifier Models
to segment patients in AML’

2+ Targets
Identified
and
Validated

10k Public Multi- Patient Prioritized list of

Segmented Cohorts
omics samples Stratification Model gene targets

AI was used to assist domain experts, to solve a well-defined problem with

clear outcomes: ‘Segment the patient cohorts in AML as per their prognosis’
LLMs were Deployed to Advance Target Identification

8
Limiting Belief 2

“We just couldn’t get enough

training data to solve the
problem we’re working on”
Make the Shift to Data-Centric AI

Fine-tune existing models with high quality and relevant data. Especially useful for predicting long-
tail problems with limited data points (<10,000)

Use Cases

Cell Type
Annotation
Pre-training Data

Available Single Cell Fine-tuned with task Dataset Integration

Gene Expression Fed into Foundational
specific, high quality
profiles (33 million) Model
data Gene Perturbation
Response
Prediction

Gene Regulatory
Network Inference
How do we define High Quality Data?

● Up-to-date with patient information, domain specific

● Metadata annotations, processing & ontologies are custom to
Highest
the use case / task
Quality Data
● QC-ed by experts

● Annotated with critical metadata

● Relevant to the biological domain
Context Specific,
Relevant (scGPT,
GenePT)

● Ingested & transformed into Machine readable

formats Machine Curated from Public Sources
● Structured into tabular files (CSV, JSON) (BERT, GPT)

11
How well does scGPT perform after Fine-Tuning with High Quality Data?

scGPT can perform reference based cell type annotation in a zero shot setting.
However, fine-tuning with high quality data improves model performance by 20% (avg)

Experimental Design

● Training Dataset: 25k immune

cells from HCA to fine-tune

● Testing Dataset: 13k immune

cells from Tabula Sapiens

● All datasets were cleaned and

linked with Elucidata’s
Harmonization Engine.
Elucidata’s Harmonization Engine
Cleans the Data you Need

50 Million
10X FASTER Samples harmonized to support use cases
curation with in drug discovery, development & trials
LLM powered
annotation tools Techn
ology

25+ Data Types

Supported including RWE, Omics and
Clinical
Proces
People
s

100+ Experts
30+
99.99% Accurate
In curation, NLP, data
Data delivered with Data Pipelines built and maintained on
engineering, & Elucidata infrastructure to process data
robust QA/QC
bioinformatics
Limiting Belief 3

“We don’t have the expertise

or millions needed to build a
team”
Case Study: Building Production-ready AMDET Pipelines

SCENARIO

‘This mid-cap pharmaceutical company wanted to develop an end-to-end ADMET prediction pipeline
that would support 5 lead development programs across neurology and oncology’.

THEIR NEEDS
● Collect and prepare all the data generated across assays in a meaningful and scalable way.
● Productionize existing models on the cloud, so that they can run at scale.
● Develop an ML-ops infrastructure to manage the data & models across stakeholders, multiple
sites and different types of users.

Doing this in-house needs a team of 7 FTEs and cloud resources.

Costs drum upto ~$2 Million and projects could take 1+ years to kick start.
Scaling up AI in production, Set up in 1.5 Months

Compound Screening & Evaluation Accelerated by 2X with Production-Ready ADMET pipeline

Polly

Load & Split Data, Feature Selection

Train & Fine-Tune Models

Track Experiments, Select Models,

Version

Relevant Public Harmonization Validated Model to Dashboard

and In-house Predict endpoints Predicted Endpoints
Engine
Datasets Ingest, process, annotate

Add inputs & Run Workflow

ADMET Model Deployment Workflow

Significant R&D Productivity Unlocked

Projects could be kickstarted within a month, at 4X Lower Costs.

Productivity Areas Improvement with Elucidata Rationale

● Dedicated team to perform searches

Data Acquisition 4X Faster
in Public Databases

Data Preparation, ● LLM-powered Harmonization reduced

4X Faster
Annotation, QC manual effort in data preparation.

● Key bottleneck steps in the process

Model Development /
Reduced by 30% (ingestion, cleaning, ML Model
Deployment Cycle
Deployment, Versioning) automated.
Any Questions?

Reach out at elucidata.io to know more!

"The value is in the data, it is not in
the tools. That is the one thing, it’s a
bit of a hobby horse for me. One
Polly Platform &
Services thing always point to in these
discussions around data, don’t
underestimate the amount of time
ML-ready and value in doing what is really
Data Access often difficult and not so rewarding
directly work, like cleaning data sets
isn’t always fun, but it is often the
most valuable thing you can do."

ML
Initiatives Dr. Jeffrey Reid,
Regeneron's Chief Data Officer

Chapter 3 Supervision of Instruction
100% (2)
Chapter 3 Supervision of Instruction
22 pages
Industry Analysis Report On FMCG Sector
100% (7)
Industry Analysis Report On FMCG Sector
78 pages
Kerry H. Robinson - Innocence, Knowledge and The Construction of Childhood - The Contradictory Nature of Sexuality and Censorship in Children's Contemporary Lives-Routledge (2013)
No ratings yet
Kerry H. Robinson - Innocence, Knowledge and The Construction of Childhood - The Contradictory Nature of Sexuality and Censorship in Children's Contemporary Lives-Routledge (2013)
183 pages
Product Marketing Assignment (1)
No ratings yet
Product Marketing Assignment (1)
3 pages
Making Most of AI in Medtech
No ratings yet
Making Most of AI in Medtech
7 pages
Deck3 .pdf
No ratings yet
Deck3 .pdf
18 pages
Ilovepdf Merged-2
No ratings yet
Ilovepdf Merged-2
77 pages
Ilovepdf Merged 7
No ratings yet
Ilovepdf Merged 7
356 pages
Generative AI in The Pharmaceutical Industry - McKinsey
No ratings yet
Generative AI in The Pharmaceutical Industry - McKinsey
24 pages
Generative Ai in The Pharmaceutical Industry Moving From Hype To Reality
No ratings yet
Generative Ai in The Pharmaceutical Industry Moving From Hype To Reality
25 pages
Arti Ficial Intelligence For Pharma: Time For Internal Investment
No ratings yet
Arti Ficial Intelligence For Pharma: Time For Internal Investment
4 pages
Machine learning-based clinical decision support for lab data
No ratings yet
Machine learning-based clinical decision support for lab data
31 pages
AIHC-U2
No ratings yet
AIHC-U2
29 pages
Recursion_pitchDeck
No ratings yet
Recursion_pitchDeck
179 pages
how-artificial-intelligence-can-power-clinical-development
No ratings yet
how-artificial-intelligence-can-power-clinical-development
9 pages
generative-ai-in-the-pharmaceutical-industry-moving-from-hype-to-reality
No ratings yet
generative-ai-in-the-pharmaceutical-industry-moving-from-hype-to-reality
25 pages
Hackathon PPT Template
No ratings yet
Hackathon PPT Template
21 pages
AI AND MACHINE LEARNING IN HEALTHCARE[1]
100% (1)
AI AND MACHINE LEARNING IN HEALTHCARE[1]
5 pages
Revolutionizing-Drug-Development-Artefact-Whitepaper
No ratings yet
Revolutionizing-Drug-Development-Artefact-Whitepaper
17 pages
A Practical Framework For ArtificialIntelligence Product Development in Healthcare
No ratings yet
A Practical Framework For ArtificialIntelligence Product Development in Healthcare
14 pages
From Hype To Health Whitepaper
No ratings yet
From Hype To Health Whitepaper
9 pages
AI in Healthcare Final
No ratings yet
AI in Healthcare Final
20 pages
MachineLearning Ass
No ratings yet
MachineLearning Ass
12 pages
AI-in-Healthcare
No ratings yet
AI-in-Healthcare
20 pages
Case Study and Ethics
No ratings yet
Case Study and Ethics
5 pages
AiinHealth
No ratings yet
AiinHealth
16 pages
Accelerating-Ai-Ml-Adoption-In-Biopharma 1120
No ratings yet
Accelerating-Ai-Ml-Adoption-In-Biopharma 1120
15 pages
AI-Powered Healthcare Diagnostics
No ratings yet
AI-Powered Healthcare Diagnostics
31 pages
Generative Ai in The Pharmaceutical Industry Moving From Hype To Reality VF
No ratings yet
Generative Ai in The Pharmaceutical Industry Moving From Hype To Reality VF
25 pages
Low CG
No ratings yet
Low CG
6 pages
Artificial Intelligence in Genomics
No ratings yet
Artificial Intelligence in Genomics
25 pages
AI Techniques for Healthcare
No ratings yet
AI Techniques for Healthcare
16 pages
Comprehensive Beginner’s Guide to Google’s Generative AI Studio for Non-technical Executives
From Everand
Comprehensive Beginner’s Guide to Google’s Generative AI Studio for Non-technical Executives
CertSquad Professional Trainers
No ratings yet
Deep Learning Biomedicine
No ratings yet
Deep Learning Biomedicine
28 pages
AIRevolutionizingDiseaseDiagnosis3669c1187cab5ae6
No ratings yet
AIRevolutionizingDiseaseDiagnosis3669c1187cab5ae6
11 pages
Workshop Report
No ratings yet
Workshop Report
32 pages
Aidd Bii
No ratings yet
Aidd Bii
132 pages
Application of AI for Health Care
No ratings yet
Application of AI for Health Care
4 pages
s41587-023-01789-6
No ratings yet
s41587-023-01789-6
2 pages
Ch6
No ratings yet
Ch6
22 pages
1. Revolutionizing Healthcare- An Introduction to Artificial Intelligence
No ratings yet
1. Revolutionizing Healthcare- An Introduction to Artificial Intelligence
21 pages
Healthcare Events Healthcare Insights Companies & Products Marketing Solutions About Us
No ratings yet
Healthcare Events Healthcare Insights Companies & Products Marketing Solutions About Us
4 pages
Knowledge Guided Data Centric AI in Healthcare 1685207849
No ratings yet
Knowledge Guided Data Centric AI in Healthcare 1685207849
21 pages
Ilovepdf Merged-4
No ratings yet
Ilovepdf Merged-4
69 pages
1 s2.0 S0010482521001189 Main
No ratings yet
1 s2.0 S0010482521001189 Main
15 pages
AI + Prompt Engineer_ Module-1_Hands-On-4
No ratings yet
AI + Prompt Engineer_ Module-1_Hands-On-4
11 pages
Agyapong Sampson
No ratings yet
Agyapong Sampson
11 pages
health AI
No ratings yet
health AI
45 pages
High-tech Business General Report
No ratings yet
High-tech Business General Report
20 pages
ISB DT Required Assignment 9.2
No ratings yet
ISB DT Required Assignment 9.2
3 pages
Computer Science & Engineering: Experiment 3.2
No ratings yet
Computer Science & Engineering: Experiment 3.2
3 pages
AI Activity (Group 2)
No ratings yet
AI Activity (Group 2)
11 pages
What Is Artificial Intelligence in Medicine
No ratings yet
What Is Artificial Intelligence in Medicine
5 pages
Ethical Framework For Harnessing The Power of AI I
No ratings yet
Ethical Framework For Harnessing The Power of AI I
32 pages
AI applications in Health industry
No ratings yet
AI applications in Health industry
12 pages
13. Smart-solutions- Unleashing AI's potential in revolutionizing disease diagnosis
No ratings yet
13. Smart-solutions- Unleashing AI's potential in revolutionizing disease diagnosis
14 pages
Homework – AI for Business Strategy
No ratings yet
Homework – AI for Business Strategy
9 pages
AI on Modern Health Care
No ratings yet
AI on Modern Health Care
3 pages
XAI Framework For Cardiovascular Disease
No ratings yet
XAI Framework For Cardiovascular Disease
30 pages
Introduction To AI in The Medical Industry
100% (1)
Introduction To AI in The Medical Industry
8 pages
AI in Medicine Is Overhyped - Scientific American
No ratings yet
AI in Medicine Is Overhyped - Scientific American
4 pages
In Sitro
No ratings yet
In Sitro
56 pages
Artificial Intelligence and Machine Learning in Healthcare
No ratings yet
Artificial Intelligence and Machine Learning in Healthcare
4 pages
Research Plan
No ratings yet
Research Plan
16 pages
Sociology as Applied to Health and Medicine Graham Scambler - The ebook in PDF format with all chapters is ready for download
100% (4)
Sociology as Applied to Health and Medicine Graham Scambler - The ebook in PDF format with all chapters is ready for download
68 pages
Manufacturing Technology (ME 361) - Lecture 20: Engineering Metrology
No ratings yet
Manufacturing Technology (ME 361) - Lecture 20: Engineering Metrology
35 pages
About Mullakadu
No ratings yet
About Mullakadu
2 pages
Impulse Kick-Off: Innovation in The By-Product Supply Chain of Citrus in Mediterranean Area
No ratings yet
Impulse Kick-Off: Innovation in The By-Product Supply Chain of Citrus in Mediterranean Area
3 pages
Rediscovering SWOT 'S Integrative Nature: A New Understanding of An Old Framework
No ratings yet
Rediscovering SWOT 'S Integrative Nature: A New Understanding of An Old Framework
17 pages
Survey Occult Galbreath
No ratings yet
Survey Occult Galbreath
29 pages
Final Acknowledgement
No ratings yet
Final Acknowledgement
6 pages
Week 3 Formulating Research Questions and Scientific Hypotheses
No ratings yet
Week 3 Formulating Research Questions and Scientific Hypotheses
70 pages
Repair Retrofit And Inspection Of Building Exterior Wall Systems Astm Special Technical Publication 1493 Paul G Johnson instant download
No ratings yet
Repair Retrofit And Inspection Of Building Exterior Wall Systems Astm Special Technical Publication 1493 Paul G Johnson instant download
80 pages
Biostatistics, Scope and Objectives 2
No ratings yet
Biostatistics, Scope and Objectives 2
54 pages
Icfai University: Department of Commerce
No ratings yet
Icfai University: Department of Commerce
3 pages
Fourth European Working Conditions Survey
No ratings yet
Fourth European Working Conditions Survey
140 pages
Jacques Vallee, Claude Poher. Basic Patterns in UFO Observations (AiAA, 1975)
No ratings yet
Jacques Vallee, Claude Poher. Basic Patterns in UFO Observations (AiAA, 1975)
14 pages
Library Manual: Symbiosis Institute of Management Studies (SIMS)
No ratings yet
Library Manual: Symbiosis Institute of Management Studies (SIMS)
13 pages
Taking Education Really Seriously Four Years Hard Labour Michael Fielding download
100% (1)
Taking Education Really Seriously Four Years Hard Labour Michael Fielding download
48 pages
Gender Identity and Erotic Preference in Males: Kurt Freund and Ray Blanchard, Centre For Addiction and
No ratings yet
Gender Identity and Erotic Preference in Males: Kurt Freund and Ray Blanchard, Centre For Addiction and
4 pages
CHAPTER I To 2 Practical Researc 1 Danmark C - Dimasacat
No ratings yet
CHAPTER I To 2 Practical Researc 1 Danmark C - Dimasacat
17 pages
Formative Assessment
No ratings yet
Formative Assessment
14 pages
Encyclopedia of Social Science Research Methods
100% (3)
Encyclopedia of Social Science Research Methods
580 pages
Siu (2007) "Grounded Displacement"
No ratings yet
Siu (2007) "Grounded Displacement"
22 pages
Test of Significant Difference
No ratings yet
Test of Significant Difference
11 pages
The Awareness Level of The Safety and He
No ratings yet
The Awareness Level of The Safety and He
8 pages
Cunha 2019
No ratings yet
Cunha 2019
13 pages
EDCI 672 Definition & Revised Definition of ID Expertise Discussion
No ratings yet
EDCI 672 Definition & Revised Definition of ID Expertise Discussion
2 pages
Synthesis
No ratings yet
Synthesis
2 pages
Stock Market Prediction Using Reinforcement Learning With Sentiment Analysis
No ratings yet
Stock Market Prediction Using Reinforcement Learning With Sentiment Analysis
20 pages