SlideShare a Scribd company logo
© 2023 Snowflake Inc. All Rights Reserved
© 2023 Snowflake Inc. All Rights Reserved
Santona Tuli | Director of Data
Alexander Adam | Manager, Cloud Platform Engineering
February 23, 2023
Powering Centene’s real-time
enterprise data analytics
with Upsolver
© 2023 Snowflake Inc. All Rights Reserved
Manager, Cloud Platform Engineering
Santona Tuli
Director of Data
Upsolver
Alexander Adam
Centene Corporation
© 2023 Snowflake Inc. All Rights Reserved
AGENDA • Intro to Upsolver
High quality, observable data ingestion at scale
• Intro to Centene
Largest Medicaid managed care organization
• Centene data architecture
A perfect match for Upsolver and Snowflake
• Why Centene chose Upsolver
Benefits of managing data quality at ingestion
© 2023 Snowflake Inc. All Rights Reserved
About Upsolver
© 2023 Snowflake Inc. All Rights Reserved
Data quality at ingestion
A cloud data ingestion platform, purpose-built to overcome the quality
challenges posed by complex data (streams, files, CDC).
Certifications Partnerships
© 2023 Snowflake Inc. All Rights Reserved
High quality, observable data at work
Over 10,000 production pipelines
Processing petabytes per day
Across multiple industries in enterprise, midsize, and startups
© 2023 Snowflake Inc. All Rights Reserved
The compounding cost of quality issues
Ingestion Transformation Consumption
$
DISASTER
DIFFICULT FIX
SIMPLE FIX Analytics engineer rebuilds
multiple pipelines and
downstream dependencies
Data engineer fixes a
single ingestion job
Stale, incorrect, or
corrupt data used in
numerous analytics
and ML use cases
$
$
Incident
occurs
Data lifecycle
Quality
Issue
Impact
© 2023 Snowflake Inc. All Rights Reserved
Source data may arrive out-of-order
and include duplicates
Transform models need to be really
complex to handle different cases
Sources & targets use different data
types and naming conventions
Dependent models break,
columns disappear or are repeated,
data are cast incorrectly
Source systems’ outputs
change without warning
Unexpected or empty values in
important fields ruin analytics
Root cause Downstream impact
Issues at ingestion wreck havoc later
© 2023 Snowflake Inc. All Rights Reserved
ExactSync
● exactly-once delivery
● strong data ordering
Transformations can focus on data,
rather than error-handling
Auto schema evolution
Resolves type conversion and
column naming violations
No schema-related breakage or
manual evolution work
Real-time data observability
Continuous statistics per field for
monitoring and retrospection
Quick detection dramatically
reduces data issues and cost to fix
Upsolver solution Benefits
How can we prevent it?
© 2023 Snowflake Inc. All Rights Reserved
Significant increase in data volume
Significant number of NULLs in important fields
Haven’t been updated recently
Newly added columns
Not what I expected
Outage!! - unexpected drop in events
Real-time data observability
© 2023 Snowflake Inc. All Rights Reserved
1
2
3
Getting started with
© 2023 Snowflake Inc. All Rights Reserved
Quality
Observability
Simpler models,
fresher data
Model
Sources Ingest Stage Transform Deliver
Reference architecture
© 2023 Snowflake Inc. All Rights Reserved
Lakehouse
Data
warehouse
Operational
databases
dbt CLI SDK
SQL
GUI
ExactSync Auto Schema
Evolution
Real-time Data
Observability
Streams
Operational
Databases
Object Stores
Upsolver in the stack
© 2023 Snowflake Inc. All Rights Reserved
Upsolver in any stack
© 2023 Snowflake Inc. All Rights Reserved
TESTIMONIALS
Upsolver plays a crucial part in our core
data infrastructure, and the team has
proven to be a reliable partner that’s
been committed to our success from
day one.
Amit Attias | CTO, Bigabid
With Upsolver, I could handle massive
amounts of streaming data and see
real results, in a fraction of the time I
thought it would take.
Guy Levy-Yurista | ex CSO, Sisense
Upsolver is solving a problem
at scale that I don’t want to
think about. The fact that
it just works and people
don’t complain is a win.
Yuji Xie | Analytics Lead, Gem
Upsolver has saved thousands of engineering
hours and significantly reduced total cost of
ownership, enabling us to invest in our
hypergrowth rather than data pipelines.
Seva Feldman | VP of R&D, Ironsource (Unity)
© 2023 Snowflake Inc. All Rights Reserved
Upsolver is like the “easy button” for Snowflake.
We ingest data from our Kafka streams, process it
as necessary for different use cases, and deliver it,
all while observing how our schema and data are
changing in real time.
Alexander Adam
Cloud Platform Engineering Manager, Centene
© 2023 Snowflake Inc. All Rights Reserved
Meet our client
Centene Corporation
© 2023 Snowflake Inc. All Rights Reserved
Centene Corporation in a snapshot
• Purpose: Transforming the health of the
community, one person at a time.
• Mission: Better health outcomes at lower costs.
Centene provides access to high-quality
healthcare, innovative programs and a wide range
of health solutions that help families and
individuals get well, stay well, and be well.
• By the numbers:
○ 74,300 employees
○ Covering all 50 States
○ 27.1 Million Members
○ Fortune #26
© 2023 Snowflake Inc. All Rights Reserved
How Centene uses data
We use data to help provide care and solutions for our members.
Our data lake receives data from many different source systems related to:
Claims, Member, Provider, Authorization, Corporate Operations, Health Plan, etc.
Different business domain teams leverage data to improve an array of solutions, such as:
• Processing authorizations on the latest information about a member or their care.
• Validating prescriptions by combining pharmacy system data and member data.
We have a strict cloud security model so we can protect our member data.
© 2023 Snowflake Inc. All Rights Reserved
Data platform team
Our group’s goal is to ingest data from source systems and ship relevant data to
domain-driven data stores, depending on different criteria and SLAs.
• Information in the source systems get updated throughout the day.
• Data get streamed through Kafka, processed by Upsolver, and pushed into Snowflake.
• Data from different sources are transformed and combined into larger,
domain-informed data sets for downstream operations by business partners.
Transform
Extract Load Operate
© 2023 Snowflake Inc. All Rights Reserved
Platform enablement
Moving from on-prem to cloud, simply adding headcount to the platform team didn’t scale:
• Our domain-driven use cases are highly specialized.
• With our previous setup, it was taking too long to onboard and train data engineers.
We needed:
• A platform that allows different teams to self-serve their relevant data.
• An enablement team for setting up processes and establishing usage patterns,
providing templates, documentation, and how-to guides on data flow patterns.
• A solution for data platform engineers, domain data engineers, and data analysts.
We found a simple solution in Upsolver → “the easy button”
© 2023 Snowflake Inc. All Rights Reserved
Data ingestion for Snowflake at Centene
© 2023 Snowflake Inc. All Rights Reserved
Why Centene chose Upsolver
Simplification of workflows
• Quick way to ingest and flatten JSON data, with
metrics on schema drift.
• Fan out source feed to multiple Snowflake databases
with different transformation requirements.
• Eliminated the development complexities of
hand-coding Glue/PySpark jobs.
• Avoid pitfalls such as rouge Glue jobs that may
unpredictably drive up costs.
• Require less access provisioning to AWS account.
© 2023 Snowflake Inc. All Rights Reserved
Why Centene chose Upsolver
Putting quality, governance, and security first
• Data never leaves Centene’s cloud infrastructure.
• We can build monitoring based on the metadata found
within the system tables in Upsolver.
• Clusters auto-scaling significantly reduces costs.
• Enables cost attribution to teams based on usage.
© 2023 Snowflake Inc. All Rights Reserved
How Upsolver stood out
In one word, simplicity.
• Ease of setting up ingestion from Kafka to Snowflake.
• Creates a responsive staging table.
• Live metrics and metadata on fields as data flow.
Added more than 100 data pipelines
Onboarded 45 active users
… in two months
© 2023 Snowflake Inc. All Rights Reserved
Business impact in a short time
• With Upsolver, cloud data pipeline creation went from quarters to a single sprint.
• A platform for subject matter expert data engineers to self-serve their data through
the lake to EDWs that abstracts the complexities of creating an ETL pipeline.
• A lot less overhead on infrastructure management compared to other tools, while still
providing flexibility and customization of infrastructure.
Time
Cost
Ops
© 2023 Snowflake Inc. All Rights Reserved
Q&A
© 2023 Snowflake Inc. All Rights Reserved
THANK YOU!
Ad

More Related Content

Similar to Upsolver+Snowflake_at_____________Centene.pdf (20)

Accelerate Cloud Migrations and Architecture with Data Virtualization
Accelerate Cloud Migrations and Architecture with Data VirtualizationAccelerate Cloud Migrations and Architecture with Data Virtualization
Accelerate Cloud Migrations and Architecture with Data Virtualization
Denodo
 
Accenture 2014 AWS re:Invent Enterprise Migration Breakout Session
Accenture 2014 AWS re:Invent Enterprise Migration Breakout SessionAccenture 2014 AWS re:Invent Enterprise Migration Breakout Session
Accenture 2014 AWS re:Invent Enterprise Migration Breakout Session
Tom Laszewski
 
Customer Presentation- AVEVA Asset Information Management .pptx
Customer Presentation- AVEVA Asset Information Management .pptxCustomer Presentation- AVEVA Asset Information Management .pptx
Customer Presentation- AVEVA Asset Information Management .pptx
SonerSoycercel
 
Cassandra Summit 2014: Internet of Complex Things Analytics with Apache Cassa...
Cassandra Summit 2014: Internet of Complex Things Analytics with Apache Cassa...Cassandra Summit 2014: Internet of Complex Things Analytics with Apache Cassa...
Cassandra Summit 2014: Internet of Complex Things Analytics with Apache Cassa...
DataStax Academy
 
Turning Petabytes of Data into Profit with Hadoop for the World’s Biggest Ret...
Turning Petabytes of Data into Profit with Hadoop for the World’s Biggest Ret...Turning Petabytes of Data into Profit with Hadoop for the World’s Biggest Ret...
Turning Petabytes of Data into Profit with Hadoop for the World’s Biggest Ret...
Cloudera, Inc.
 
Cloudera Altus: Big Data in the Cloud Made Easy
Cloudera Altus: Big Data in the Cloud Made EasyCloudera Altus: Big Data in the Cloud Made Easy
Cloudera Altus: Big Data in the Cloud Made Easy
Cloudera, Inc.
 
High-Performance Analytics in the Cloud with Apache Impala
High-Performance Analytics in the Cloud with Apache ImpalaHigh-Performance Analytics in the Cloud with Apache Impala
High-Performance Analytics in the Cloud with Apache Impala
Cloudera, Inc.
 
Get More from your Data: Accelerate Time-to-Value and Reduce TCO with Conflue...
Get More from your Data: Accelerate Time-to-Value and Reduce TCO with Conflue...Get More from your Data: Accelerate Time-to-Value and Reduce TCO with Conflue...
Get More from your Data: Accelerate Time-to-Value and Reduce TCO with Conflue...
HostedbyConfluent
 
Whitepaper cloud 2016
Whitepaper cloud 2016Whitepaper cloud 2016
Whitepaper cloud 2016
Kaizenlogcom
 
New ways to apply infrastructure data for better business outcomes
New ways to apply infrastructure data for better business outcomesNew ways to apply infrastructure data for better business outcomes
New ways to apply infrastructure data for better business outcomes
accenture
 
Using Mainframe Data in the Cloud: Design Once, Deploy Anywhere in a Hybrid W...
Using Mainframe Data in the Cloud: Design Once, Deploy Anywhere in a Hybrid W...Using Mainframe Data in the Cloud: Design Once, Deploy Anywhere in a Hybrid W...
Using Mainframe Data in the Cloud: Design Once, Deploy Anywhere in a Hybrid W...
Precisely
 
Migrating thousands of workloads to AWS at enterprise scale
Migrating thousands of workloads to AWS at enterprise scaleMigrating thousands of workloads to AWS at enterprise scale
Migrating thousands of workloads to AWS at enterprise scale
Tom Laszewski
 
Applying systems thinking to AWS enterprise application migration
Applying systems thinking to AWS enterprise application migrationApplying systems thinking to AWS enterprise application migration
Applying systems thinking to AWS enterprise application migration
Kacy Clarke
 
Horses for Courses: Database Roundtable
Horses for Courses: Database RoundtableHorses for Courses: Database Roundtable
Horses for Courses: Database Roundtable
Eric Kavanagh
 
Future of Making Things
Future of Making ThingsFuture of Making Things
Future of Making Things
JC Davis
 
Cloud and Analytics - From Platforms to an Ecosystem
Cloud and Analytics - From Platforms to an EcosystemCloud and Analytics - From Platforms to an Ecosystem
Cloud and Analytics - From Platforms to an Ecosystem
Databricks
 
The Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data HubThe Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data Hub
Cloudera, Inc.
 
AssetNet
AssetNetAssetNet
AssetNet
Ken Hall
 
AssetNet
AssetNetAssetNet
AssetNet
Ken Hall
 
Cloud and Analytics -- 2020 sparksummit
Cloud and Analytics -- 2020 sparksummitCloud and Analytics -- 2020 sparksummit
Cloud and Analytics -- 2020 sparksummit
Ming Yuan
 
Accelerate Cloud Migrations and Architecture with Data Virtualization
Accelerate Cloud Migrations and Architecture with Data VirtualizationAccelerate Cloud Migrations and Architecture with Data Virtualization
Accelerate Cloud Migrations and Architecture with Data Virtualization
Denodo
 
Accenture 2014 AWS re:Invent Enterprise Migration Breakout Session
Accenture 2014 AWS re:Invent Enterprise Migration Breakout SessionAccenture 2014 AWS re:Invent Enterprise Migration Breakout Session
Accenture 2014 AWS re:Invent Enterprise Migration Breakout Session
Tom Laszewski
 
Customer Presentation- AVEVA Asset Information Management .pptx
Customer Presentation- AVEVA Asset Information Management .pptxCustomer Presentation- AVEVA Asset Information Management .pptx
Customer Presentation- AVEVA Asset Information Management .pptx
SonerSoycercel
 
Cassandra Summit 2014: Internet of Complex Things Analytics with Apache Cassa...
Cassandra Summit 2014: Internet of Complex Things Analytics with Apache Cassa...Cassandra Summit 2014: Internet of Complex Things Analytics with Apache Cassa...
Cassandra Summit 2014: Internet of Complex Things Analytics with Apache Cassa...
DataStax Academy
 
Turning Petabytes of Data into Profit with Hadoop for the World’s Biggest Ret...
Turning Petabytes of Data into Profit with Hadoop for the World’s Biggest Ret...Turning Petabytes of Data into Profit with Hadoop for the World’s Biggest Ret...
Turning Petabytes of Data into Profit with Hadoop for the World’s Biggest Ret...
Cloudera, Inc.
 
Cloudera Altus: Big Data in the Cloud Made Easy
Cloudera Altus: Big Data in the Cloud Made EasyCloudera Altus: Big Data in the Cloud Made Easy
Cloudera Altus: Big Data in the Cloud Made Easy
Cloudera, Inc.
 
High-Performance Analytics in the Cloud with Apache Impala
High-Performance Analytics in the Cloud with Apache ImpalaHigh-Performance Analytics in the Cloud with Apache Impala
High-Performance Analytics in the Cloud with Apache Impala
Cloudera, Inc.
 
Get More from your Data: Accelerate Time-to-Value and Reduce TCO with Conflue...
Get More from your Data: Accelerate Time-to-Value and Reduce TCO with Conflue...Get More from your Data: Accelerate Time-to-Value and Reduce TCO with Conflue...
Get More from your Data: Accelerate Time-to-Value and Reduce TCO with Conflue...
HostedbyConfluent
 
Whitepaper cloud 2016
Whitepaper cloud 2016Whitepaper cloud 2016
Whitepaper cloud 2016
Kaizenlogcom
 
New ways to apply infrastructure data for better business outcomes
New ways to apply infrastructure data for better business outcomesNew ways to apply infrastructure data for better business outcomes
New ways to apply infrastructure data for better business outcomes
accenture
 
Using Mainframe Data in the Cloud: Design Once, Deploy Anywhere in a Hybrid W...
Using Mainframe Data in the Cloud: Design Once, Deploy Anywhere in a Hybrid W...Using Mainframe Data in the Cloud: Design Once, Deploy Anywhere in a Hybrid W...
Using Mainframe Data in the Cloud: Design Once, Deploy Anywhere in a Hybrid W...
Precisely
 
Migrating thousands of workloads to AWS at enterprise scale
Migrating thousands of workloads to AWS at enterprise scaleMigrating thousands of workloads to AWS at enterprise scale
Migrating thousands of workloads to AWS at enterprise scale
Tom Laszewski
 
Applying systems thinking to AWS enterprise application migration
Applying systems thinking to AWS enterprise application migrationApplying systems thinking to AWS enterprise application migration
Applying systems thinking to AWS enterprise application migration
Kacy Clarke
 
Horses for Courses: Database Roundtable
Horses for Courses: Database RoundtableHorses for Courses: Database Roundtable
Horses for Courses: Database Roundtable
Eric Kavanagh
 
Future of Making Things
Future of Making ThingsFuture of Making Things
Future of Making Things
JC Davis
 
Cloud and Analytics - From Platforms to an Ecosystem
Cloud and Analytics - From Platforms to an EcosystemCloud and Analytics - From Platforms to an Ecosystem
Cloud and Analytics - From Platforms to an Ecosystem
Databricks
 
The Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data HubThe Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data Hub
Cloudera, Inc.
 
Cloud and Analytics -- 2020 sparksummit
Cloud and Analytics -- 2020 sparksummitCloud and Analytics -- 2020 sparksummit
Cloud and Analytics -- 2020 sparksummit
Ming Yuan
 

Recently uploaded (20)

IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
organizerofv
 
Special Meetup Edition - TDX Bengaluru Meetup #52.pptx
Special Meetup Edition - TDX Bengaluru Meetup #52.pptxSpecial Meetup Edition - TDX Bengaluru Meetup #52.pptx
Special Meetup Edition - TDX Bengaluru Meetup #52.pptx
shyamraj55
 
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdfThe Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
Abi john
 
HCL Nomad Web – Best Practices und Verwaltung von Multiuser-Umgebungen
HCL Nomad Web – Best Practices und Verwaltung von Multiuser-UmgebungenHCL Nomad Web – Best Practices und Verwaltung von Multiuser-Umgebungen
HCL Nomad Web – Best Practices und Verwaltung von Multiuser-Umgebungen
panagenda
 
Noah Loul Shares 5 Steps to Implement AI Agents for Maximum Business Efficien...
Noah Loul Shares 5 Steps to Implement AI Agents for Maximum Business Efficien...Noah Loul Shares 5 Steps to Implement AI Agents for Maximum Business Efficien...
Noah Loul Shares 5 Steps to Implement AI Agents for Maximum Business Efficien...
Noah Loul
 
Technology Trends in 2025: AI and Big Data Analytics
Technology Trends in 2025: AI and Big Data AnalyticsTechnology Trends in 2025: AI and Big Data Analytics
Technology Trends in 2025: AI and Big Data Analytics
InData Labs
 
AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...
AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...
AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...
Alan Dix
 
#StandardsGoals for 2025: Standards & certification roundup - Tech Forum 2025
#StandardsGoals for 2025: Standards & certification roundup - Tech Forum 2025#StandardsGoals for 2025: Standards & certification roundup - Tech Forum 2025
#StandardsGoals for 2025: Standards & certification roundup - Tech Forum 2025
BookNet Canada
 
Semantic Cultivators : The Critical Future Role to Enable AI
Semantic Cultivators : The Critical Future Role to Enable AISemantic Cultivators : The Critical Future Role to Enable AI
Semantic Cultivators : The Critical Future Role to Enable AI
artmondano
 
Into The Box Conference Keynote Day 1 (ITB2025)
Into The Box Conference Keynote Day 1 (ITB2025)Into The Box Conference Keynote Day 1 (ITB2025)
Into The Box Conference Keynote Day 1 (ITB2025)
Ortus Solutions, Corp
 
Quantum Computing Quick Research Guide by Arthur Morgan
Quantum Computing Quick Research Guide by Arthur MorganQuantum Computing Quick Research Guide by Arthur Morgan
Quantum Computing Quick Research Guide by Arthur Morgan
Arthur Morgan
 
Role of Data Annotation Services in AI-Powered Manufacturing
Role of Data Annotation Services in AI-Powered ManufacturingRole of Data Annotation Services in AI-Powered Manufacturing
Role of Data Annotation Services in AI-Powered Manufacturing
Andrew Leo
 
Complete Guide to Advanced Logistics Management Software in Riyadh.pdf
Complete Guide to Advanced Logistics Management Software in Riyadh.pdfComplete Guide to Advanced Logistics Management Software in Riyadh.pdf
Complete Guide to Advanced Logistics Management Software in Riyadh.pdf
Software Company
 
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep DiveDesigning Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
ScyllaDB
 
DevOpsDays Atlanta 2025 - Building 10x Development Organizations.pptx
DevOpsDays Atlanta 2025 - Building 10x Development Organizations.pptxDevOpsDays Atlanta 2025 - Building 10x Development Organizations.pptx
DevOpsDays Atlanta 2025 - Building 10x Development Organizations.pptx
Justin Reock
 
Mobile App Development Company in Saudi Arabia
Mobile App Development Company in Saudi ArabiaMobile App Development Company in Saudi Arabia
Mobile App Development Company in Saudi Arabia
Steve Jonas
 
Heap, Types of Heap, Insertion and Deletion
Heap, Types of Heap, Insertion and DeletionHeap, Types of Heap, Insertion and Deletion
Heap, Types of Heap, Insertion and Deletion
Jaydeep Kale
 
TrsLabs - Fintech Product & Business Consulting
TrsLabs - Fintech Product & Business ConsultingTrsLabs - Fintech Product & Business Consulting
TrsLabs - Fintech Product & Business Consulting
Trs Labs
 
Enhancing ICU Intelligence: How Our Functional Testing Enabled a Healthcare I...
Enhancing ICU Intelligence: How Our Functional Testing Enabled a Healthcare I...Enhancing ICU Intelligence: How Our Functional Testing Enabled a Healthcare I...
Enhancing ICU Intelligence: How Our Functional Testing Enabled a Healthcare I...
Impelsys Inc.
 
What is Model Context Protocol(MCP) - The new technology for communication bw...
What is Model Context Protocol(MCP) - The new technology for communication bw...What is Model Context Protocol(MCP) - The new technology for communication bw...
What is Model Context Protocol(MCP) - The new technology for communication bw...
Vishnu Singh Chundawat
 
IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
organizerofv
 
Special Meetup Edition - TDX Bengaluru Meetup #52.pptx
Special Meetup Edition - TDX Bengaluru Meetup #52.pptxSpecial Meetup Edition - TDX Bengaluru Meetup #52.pptx
Special Meetup Edition - TDX Bengaluru Meetup #52.pptx
shyamraj55
 
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdfThe Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
Abi john
 
HCL Nomad Web – Best Practices und Verwaltung von Multiuser-Umgebungen
HCL Nomad Web – Best Practices und Verwaltung von Multiuser-UmgebungenHCL Nomad Web – Best Practices und Verwaltung von Multiuser-Umgebungen
HCL Nomad Web – Best Practices und Verwaltung von Multiuser-Umgebungen
panagenda
 
Noah Loul Shares 5 Steps to Implement AI Agents for Maximum Business Efficien...
Noah Loul Shares 5 Steps to Implement AI Agents for Maximum Business Efficien...Noah Loul Shares 5 Steps to Implement AI Agents for Maximum Business Efficien...
Noah Loul Shares 5 Steps to Implement AI Agents for Maximum Business Efficien...
Noah Loul
 
Technology Trends in 2025: AI and Big Data Analytics
Technology Trends in 2025: AI and Big Data AnalyticsTechnology Trends in 2025: AI and Big Data Analytics
Technology Trends in 2025: AI and Big Data Analytics
InData Labs
 
AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...
AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...
AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...
Alan Dix
 
#StandardsGoals for 2025: Standards & certification roundup - Tech Forum 2025
#StandardsGoals for 2025: Standards & certification roundup - Tech Forum 2025#StandardsGoals for 2025: Standards & certification roundup - Tech Forum 2025
#StandardsGoals for 2025: Standards & certification roundup - Tech Forum 2025
BookNet Canada
 
Semantic Cultivators : The Critical Future Role to Enable AI
Semantic Cultivators : The Critical Future Role to Enable AISemantic Cultivators : The Critical Future Role to Enable AI
Semantic Cultivators : The Critical Future Role to Enable AI
artmondano
 
Into The Box Conference Keynote Day 1 (ITB2025)
Into The Box Conference Keynote Day 1 (ITB2025)Into The Box Conference Keynote Day 1 (ITB2025)
Into The Box Conference Keynote Day 1 (ITB2025)
Ortus Solutions, Corp
 
Quantum Computing Quick Research Guide by Arthur Morgan
Quantum Computing Quick Research Guide by Arthur MorganQuantum Computing Quick Research Guide by Arthur Morgan
Quantum Computing Quick Research Guide by Arthur Morgan
Arthur Morgan
 
Role of Data Annotation Services in AI-Powered Manufacturing
Role of Data Annotation Services in AI-Powered ManufacturingRole of Data Annotation Services in AI-Powered Manufacturing
Role of Data Annotation Services in AI-Powered Manufacturing
Andrew Leo
 
Complete Guide to Advanced Logistics Management Software in Riyadh.pdf
Complete Guide to Advanced Logistics Management Software in Riyadh.pdfComplete Guide to Advanced Logistics Management Software in Riyadh.pdf
Complete Guide to Advanced Logistics Management Software in Riyadh.pdf
Software Company
 
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep DiveDesigning Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
ScyllaDB
 
DevOpsDays Atlanta 2025 - Building 10x Development Organizations.pptx
DevOpsDays Atlanta 2025 - Building 10x Development Organizations.pptxDevOpsDays Atlanta 2025 - Building 10x Development Organizations.pptx
DevOpsDays Atlanta 2025 - Building 10x Development Organizations.pptx
Justin Reock
 
Mobile App Development Company in Saudi Arabia
Mobile App Development Company in Saudi ArabiaMobile App Development Company in Saudi Arabia
Mobile App Development Company in Saudi Arabia
Steve Jonas
 
Heap, Types of Heap, Insertion and Deletion
Heap, Types of Heap, Insertion and DeletionHeap, Types of Heap, Insertion and Deletion
Heap, Types of Heap, Insertion and Deletion
Jaydeep Kale
 
TrsLabs - Fintech Product & Business Consulting
TrsLabs - Fintech Product & Business ConsultingTrsLabs - Fintech Product & Business Consulting
TrsLabs - Fintech Product & Business Consulting
Trs Labs
 
Enhancing ICU Intelligence: How Our Functional Testing Enabled a Healthcare I...
Enhancing ICU Intelligence: How Our Functional Testing Enabled a Healthcare I...Enhancing ICU Intelligence: How Our Functional Testing Enabled a Healthcare I...
Enhancing ICU Intelligence: How Our Functional Testing Enabled a Healthcare I...
Impelsys Inc.
 
What is Model Context Protocol(MCP) - The new technology for communication bw...
What is Model Context Protocol(MCP) - The new technology for communication bw...What is Model Context Protocol(MCP) - The new technology for communication bw...
What is Model Context Protocol(MCP) - The new technology for communication bw...
Vishnu Singh Chundawat
 
Ad

Upsolver+Snowflake_at_____________Centene.pdf

  • 1. © 2023 Snowflake Inc. All Rights Reserved
  • 2. © 2023 Snowflake Inc. All Rights Reserved Santona Tuli | Director of Data Alexander Adam | Manager, Cloud Platform Engineering February 23, 2023 Powering Centene’s real-time enterprise data analytics with Upsolver
  • 3. © 2023 Snowflake Inc. All Rights Reserved Manager, Cloud Platform Engineering Santona Tuli Director of Data Upsolver Alexander Adam Centene Corporation
  • 4. © 2023 Snowflake Inc. All Rights Reserved AGENDA • Intro to Upsolver High quality, observable data ingestion at scale • Intro to Centene Largest Medicaid managed care organization • Centene data architecture A perfect match for Upsolver and Snowflake • Why Centene chose Upsolver Benefits of managing data quality at ingestion
  • 5. © 2023 Snowflake Inc. All Rights Reserved About Upsolver
  • 6. © 2023 Snowflake Inc. All Rights Reserved Data quality at ingestion A cloud data ingestion platform, purpose-built to overcome the quality challenges posed by complex data (streams, files, CDC). Certifications Partnerships
  • 7. © 2023 Snowflake Inc. All Rights Reserved High quality, observable data at work Over 10,000 production pipelines Processing petabytes per day Across multiple industries in enterprise, midsize, and startups
  • 8. © 2023 Snowflake Inc. All Rights Reserved The compounding cost of quality issues Ingestion Transformation Consumption $ DISASTER DIFFICULT FIX SIMPLE FIX Analytics engineer rebuilds multiple pipelines and downstream dependencies Data engineer fixes a single ingestion job Stale, incorrect, or corrupt data used in numerous analytics and ML use cases $ $ Incident occurs Data lifecycle Quality Issue Impact
  • 9. © 2023 Snowflake Inc. All Rights Reserved Source data may arrive out-of-order and include duplicates Transform models need to be really complex to handle different cases Sources & targets use different data types and naming conventions Dependent models break, columns disappear or are repeated, data are cast incorrectly Source systems’ outputs change without warning Unexpected or empty values in important fields ruin analytics Root cause Downstream impact Issues at ingestion wreck havoc later
  • 10. © 2023 Snowflake Inc. All Rights Reserved ExactSync ● exactly-once delivery ● strong data ordering Transformations can focus on data, rather than error-handling Auto schema evolution Resolves type conversion and column naming violations No schema-related breakage or manual evolution work Real-time data observability Continuous statistics per field for monitoring and retrospection Quick detection dramatically reduces data issues and cost to fix Upsolver solution Benefits How can we prevent it?
  • 11. © 2023 Snowflake Inc. All Rights Reserved Significant increase in data volume Significant number of NULLs in important fields Haven’t been updated recently Newly added columns Not what I expected Outage!! - unexpected drop in events Real-time data observability
  • 12. © 2023 Snowflake Inc. All Rights Reserved 1 2 3 Getting started with
  • 13. © 2023 Snowflake Inc. All Rights Reserved Quality Observability Simpler models, fresher data Model Sources Ingest Stage Transform Deliver Reference architecture
  • 14. © 2023 Snowflake Inc. All Rights Reserved Lakehouse Data warehouse Operational databases dbt CLI SDK SQL GUI ExactSync Auto Schema Evolution Real-time Data Observability Streams Operational Databases Object Stores Upsolver in the stack
  • 15. © 2023 Snowflake Inc. All Rights Reserved Upsolver in any stack
  • 16. © 2023 Snowflake Inc. All Rights Reserved TESTIMONIALS Upsolver plays a crucial part in our core data infrastructure, and the team has proven to be a reliable partner that’s been committed to our success from day one. Amit Attias | CTO, Bigabid With Upsolver, I could handle massive amounts of streaming data and see real results, in a fraction of the time I thought it would take. Guy Levy-Yurista | ex CSO, Sisense Upsolver is solving a problem at scale that I don’t want to think about. The fact that it just works and people don’t complain is a win. Yuji Xie | Analytics Lead, Gem Upsolver has saved thousands of engineering hours and significantly reduced total cost of ownership, enabling us to invest in our hypergrowth rather than data pipelines. Seva Feldman | VP of R&D, Ironsource (Unity)
  • 17. © 2023 Snowflake Inc. All Rights Reserved Upsolver is like the “easy button” for Snowflake. We ingest data from our Kafka streams, process it as necessary for different use cases, and deliver it, all while observing how our schema and data are changing in real time. Alexander Adam Cloud Platform Engineering Manager, Centene
  • 18. © 2023 Snowflake Inc. All Rights Reserved Meet our client Centene Corporation
  • 19. © 2023 Snowflake Inc. All Rights Reserved Centene Corporation in a snapshot • Purpose: Transforming the health of the community, one person at a time. • Mission: Better health outcomes at lower costs. Centene provides access to high-quality healthcare, innovative programs and a wide range of health solutions that help families and individuals get well, stay well, and be well. • By the numbers: ○ 74,300 employees ○ Covering all 50 States ○ 27.1 Million Members ○ Fortune #26
  • 20. © 2023 Snowflake Inc. All Rights Reserved How Centene uses data We use data to help provide care and solutions for our members. Our data lake receives data from many different source systems related to: Claims, Member, Provider, Authorization, Corporate Operations, Health Plan, etc. Different business domain teams leverage data to improve an array of solutions, such as: • Processing authorizations on the latest information about a member or their care. • Validating prescriptions by combining pharmacy system data and member data. We have a strict cloud security model so we can protect our member data.
  • 21. © 2023 Snowflake Inc. All Rights Reserved Data platform team Our group’s goal is to ingest data from source systems and ship relevant data to domain-driven data stores, depending on different criteria and SLAs. • Information in the source systems get updated throughout the day. • Data get streamed through Kafka, processed by Upsolver, and pushed into Snowflake. • Data from different sources are transformed and combined into larger, domain-informed data sets for downstream operations by business partners. Transform Extract Load Operate
  • 22. © 2023 Snowflake Inc. All Rights Reserved Platform enablement Moving from on-prem to cloud, simply adding headcount to the platform team didn’t scale: • Our domain-driven use cases are highly specialized. • With our previous setup, it was taking too long to onboard and train data engineers. We needed: • A platform that allows different teams to self-serve their relevant data. • An enablement team for setting up processes and establishing usage patterns, providing templates, documentation, and how-to guides on data flow patterns. • A solution for data platform engineers, domain data engineers, and data analysts. We found a simple solution in Upsolver → “the easy button”
  • 23. © 2023 Snowflake Inc. All Rights Reserved Data ingestion for Snowflake at Centene
  • 24. © 2023 Snowflake Inc. All Rights Reserved Why Centene chose Upsolver Simplification of workflows • Quick way to ingest and flatten JSON data, with metrics on schema drift. • Fan out source feed to multiple Snowflake databases with different transformation requirements. • Eliminated the development complexities of hand-coding Glue/PySpark jobs. • Avoid pitfalls such as rouge Glue jobs that may unpredictably drive up costs. • Require less access provisioning to AWS account.
  • 25. © 2023 Snowflake Inc. All Rights Reserved Why Centene chose Upsolver Putting quality, governance, and security first • Data never leaves Centene’s cloud infrastructure. • We can build monitoring based on the metadata found within the system tables in Upsolver. • Clusters auto-scaling significantly reduces costs. • Enables cost attribution to teams based on usage.
  • 26. © 2023 Snowflake Inc. All Rights Reserved How Upsolver stood out In one word, simplicity. • Ease of setting up ingestion from Kafka to Snowflake. • Creates a responsive staging table. • Live metrics and metadata on fields as data flow. Added more than 100 data pipelines Onboarded 45 active users … in two months
  • 27. © 2023 Snowflake Inc. All Rights Reserved Business impact in a short time • With Upsolver, cloud data pipeline creation went from quarters to a single sprint. • A platform for subject matter expert data engineers to self-serve their data through the lake to EDWs that abstracts the complexities of creating an ETL pipeline. • A lot less overhead on infrastructure management compared to other tools, while still providing flexibility and customization of infrastructure. Time Cost Ops
  • 28. © 2023 Snowflake Inc. All Rights Reserved Q&A
  • 29. © 2023 Snowflake Inc. All Rights Reserved THANK YOU!