SlideShare a Scribd company logo
How to effectively migrate
data from legacy systems
www.cloverdx.com
Migration from legacy system
Legacy system New system
Has not been modernized in a long
time, runs on outdated platform
In-house or heavily customized
Holds business critical data – CRM,
ERP, billing, …
Required for correct functioning of the
organization
Modern, more capable system
Often cloud-based
Provides business benefit like better
reporting, data quality, visibility, …
Migrate
Telco Migration, The Need
In-house
system
CLF nCRM
Validate, cleanse, map Validate, cleanse, map
Data flow
100k customers 6m customers
Planning and scope changes
Gaps in knowledge (in source and in target)
Data quality
Execution and reporting
Challenges
Planning & Scope
Optimists planning a migration
Analysis
and
discovery
Extraction
Validation
Cleansing
Loading
Reconciliation
Done
In reality…
…many activities
happen in parallel
Documentation of what needs to be done and of deadlines.
Can take long time to complete
Work with business and review as early as possible to prevent issues
Figuringout the scope
Business specification
In-depth look at each system involved to determine implementation plan.
Requires cooperation from each party
Knowledge gaps can prolong the process quite a bit.
Figuringout the scope
Technical analysis
Hardest part, especially for complex migrations with many business rules.
What is a cost of a business rule implementation?
Big bang migration or trickle migration?
Figuringout the scope
Effort estimates and planning
Analysis Implementation & testing Run
Analysis Implementation & testing Run Run Run Run
Big bang
Trickle
Not all data has the same complexity
Which data to process first?
Simple Complex
Data complexity
Number of records
Not all data has the same complexity
More complex data brings more problems
Which data to process first?
Simple Complex
Data complexity
Number of records
Problems
Not all data has the same complexity
More complex data brings more problems
Migrate complex records first, maybe manually
Which data to process first?
Simple Complex
Data complexity
Number of records
Problems
Do this first
Oh no! The scope changed!
Factor “known unknowns” into your estimates – possibility to change after
technical analysis and technology risks.
What if business changed their mind? This cannot be avoided. Full and
close cooperation with business helps when this happens.
Telco Migration, The Plan
24 m
Start 12 m
In-house to CLF
CLF to nCRM
Timeline
In-house
system
CLF nCRM
Validate, cleanse, map Validate, cleanse, map
Data flow
100k customers 6m customers
Data Quality
Single biggest issue in most legacy migration projects
Data frequently comes directly from users
o Typed in an application (or even Excel document)
o People come and go
o Processes and ideas about data change
Garbage in, garbage out
o Migration is your best chance to fix your data, business already expects
complications and issues with the data
Data Quality
Make this part of your technical analysis
Measure on every step of the process
o On input, but also after transformations
o Can help you catch logic errors
Provide frequent reports to business
o KPIs can help you make decisions: data is too bad → redesign solution
Measuring data quality
Fix as early as possible, ideally in the source system
o Needs cooperation with the source data owner
Fail on error or keep on going?
o Both approaches valid for different usecases
Fixing data quality issues
Data Mapping
How to map entities in old system to the ones in the one?
o Same data domain, different ideas about how to represent everything
Knowledge gaps
o Old system might not be known very well anymore
o New system is not yet known very well
The mapping problem
?
How to make it accessible to business users?
How to keep track of different mapping versions?
Use Data Modeling tools or Data catalog
Mapping specification
Split mappings into multiple “pieces” to make them easier to build and
verify
Document your code
Mapping development
Split mappings into multiple “pieces” to make them easier to build and
verify
Document your code
Validate your data and report to business
Mapping development
Test data can be difficult to obtain
o Manually built test data is often insufficient – does not cover corner cases
Use data as close to production data as possible
o Anonymize to prevent data leaks and comply with regulations
Test with business
Test as early (and as often) as possible
Testing the Mapping
Production Data sample Anonymize Use data in QA
Everything is tested, smooth sailing from here…
Communication is the key: frequent updates to your
business users.
Detailed reports help ensure that production data is ok.
Be ready to support the solution – hypercare.
Execution
Telco Migration, The Plan
24 m
Start 12 m
In-house to CLF
CLF to nCRM
Timeline
In-house
system
CLF nCRM
Validate, cleanse, map Validate, cleanse, map
Data flow
100k customers 6m customers
Telco Migration, The Result
36 m
Start 24 m
In-house to CLF
CLF to nCRM
Timeline
In-house
system
CLF nCRM
Validate, cleanse, map Validate, cleanse, map
Data flow
100k customers 6m customers
Project restart
Additional target systems
Bespoke systems
o Vendor no longer available or not cooperating
Complex scenarios
o Many-to-one or many-to-many scenarios
o Lots of customizations over long periods of time
Messy data
o Needing cleaning up, repurposing, reconstructing or enrichment
Tight deadlines
o When traditional approach can’t deliver on time
Data Management Platform
There is no “I can’t” with CloverDX.
Combining capabilities of coding with the
readability of a visual design, CloverDX lets
developers solve hard problems in code while
empowering less technical colleagues.
While CloverDX is built so that
everything can sit on a single
platform, it is flexible to fill in gaps
in an existing technology stack.
Thanks to its open architecture,
almost every part of CloverDX can
be customized, versioned, parametrized.
If 90% of the work is easy and 10% are
the exceptions, error and outliers, then
CloverDX is built to help you deal with
those 10 percent.
Solve 100% of a problem, not just the easy 90%
With CloverDX, you ultimately think of running everything on autopilot.
Automate Everything
Automate the entirety of
your data workloads,
including the exceptions
and intricacies of real-life
processes.
Schedule jobs for
automatic delivery to data
warehouses, data lakes or
operational databases in
cloud or on premise.
Orchestrate external systems
and tools by CloverDX sitting
at the center of your
architecture – events, API
calls or message queues.
www.cloverdx.com

More Related Content

What's hot (20)

PDF
Improving Data Literacy Around Data Architecture
DATAVERSITY
 
PDF
Data Mesh in Practice - How Europe's Leading Online Platform for Fashion Goes...
Dr. Arif Wider
 
PDF
Emerging Trends in Data Architecture – What’s the Next Big Thing?
DATAVERSITY
 
PDF
Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data Pipelines
DATAVERSITY
 
PPTX
How to benchmark the maturity of your saas solution
kanimozhin
 
PDF
Evolution from EDA to Data Mesh: Data in Motion
confluent
 
PDF
Five Things to Consider About Data Mesh and Data Governance
DATAVERSITY
 
PDF
Data Mesh
Piethein Strengholt
 
PDF
Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...
Cathrine Wilhelmsen
 
PDF
Modernizing to a Cloud Data Architecture
Databricks
 
PDF
Data Mesh at CMC Markets: Past, Present and Future
Lorenzo Nicora
 
PDF
AstraZeneca - Re-imagining the Data Landscape in Compound Synthesis & Management
Neo4j
 
PDF
Data Mesh for Dinner
Kent Graziano
 
PPTX
Snowflake Architecture.pptx
chennakesava44
 
PDF
A Kafka journey and why migrate to Confluent Cloud?
confluent
 
PPTX
Building Modern Data Platform with Microsoft Azure
Dmitry Anoshin
 
PDF
Cloud Transformation
Hexaware Technologies
 
PDF
Building a Data Strategy – Practical Steps for Aligning with Business Goals
DATAVERSITY
 
PDF
Building Lakehouses on Delta Lake with SQL Analytics Primer
Databricks
 
PDF
Cloud Migration: Moving Data and Infrastructure to the Cloud
Safe Software
 
Improving Data Literacy Around Data Architecture
DATAVERSITY
 
Data Mesh in Practice - How Europe's Leading Online Platform for Fashion Goes...
Dr. Arif Wider
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
DATAVERSITY
 
Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data Pipelines
DATAVERSITY
 
How to benchmark the maturity of your saas solution
kanimozhin
 
Evolution from EDA to Data Mesh: Data in Motion
confluent
 
Five Things to Consider About Data Mesh and Data Governance
DATAVERSITY
 
Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...
Cathrine Wilhelmsen
 
Modernizing to a Cloud Data Architecture
Databricks
 
Data Mesh at CMC Markets: Past, Present and Future
Lorenzo Nicora
 
AstraZeneca - Re-imagining the Data Landscape in Compound Synthesis & Management
Neo4j
 
Data Mesh for Dinner
Kent Graziano
 
Snowflake Architecture.pptx
chennakesava44
 
A Kafka journey and why migrate to Confluent Cloud?
confluent
 
Building Modern Data Platform with Microsoft Azure
Dmitry Anoshin
 
Cloud Transformation
Hexaware Technologies
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
DATAVERSITY
 
Building Lakehouses on Delta Lake with SQL Analytics Primer
Databricks
 
Cloud Migration: Moving Data and Infrastructure to the Cloud
Safe Software
 

Similar to How to Effectively Migrate Data From Legacy Apps (20)

PDF
Nonprofit data migration webinar 02.20.2014
Brandon Fix
 
PDF
Nonprofit data migration: You can't take it all with you Webinar
Third Sector Labs
 
PDF
10 Decisions You Will Face With Any Donor Data Migration Project
Bloomerang
 
PPTX
10 tough decisions donor data migration decisions (Webinar hosted by Bloomera...
Brandon Fix
 
PPT
What do I know about my customers?
DataValueTalk
 
PDF
Data Transmission PowerPoint Presentation Slides
SlideTeam
 
PDF
Data Migration Steps PowerPoint Presentation Slides
SlideTeam
 
PDF
Data migration patterns special
Manikandan Suresh
 
PPTX
Preparing a data migration plan: A practical guide
ETLSolutions
 
PDF
Data Migration Strategies Powerpoint Presentation Slides
SlideTeam
 
PDF
Data Migration Strategies PowerPoint Presentation Slides
SlideTeam
 
PDF
Data Transformation Powerpoint Presentation Slides
SlideTeam
 
PDF
Data Migration Support PowerPoint Presentation Slides
SlideTeam
 
PDF
Data Migration Done Right for Microsoft Dynamics 365/CRM
Daniel Cai
 
PDF
Data Transformation PowerPoint Presentation Slides
SlideTeam
 
PDF
Automated Data Migration PowerPoint Presentation Slides
SlideTeam
 
PPTX
20171019 data migration (rk)
Ruud Kapteijn
 
PDF
Data Mapping And Integration PowerPoint Presentation Slides
SlideTeam
 
PPTX
Migrating data: How to reduce risk
ETLSolutions
 
PDF
Data Migration Plan PowerPoint Presentation Slides
SlideTeam
 
Nonprofit data migration webinar 02.20.2014
Brandon Fix
 
Nonprofit data migration: You can't take it all with you Webinar
Third Sector Labs
 
10 Decisions You Will Face With Any Donor Data Migration Project
Bloomerang
 
10 tough decisions donor data migration decisions (Webinar hosted by Bloomera...
Brandon Fix
 
What do I know about my customers?
DataValueTalk
 
Data Transmission PowerPoint Presentation Slides
SlideTeam
 
Data Migration Steps PowerPoint Presentation Slides
SlideTeam
 
Data migration patterns special
Manikandan Suresh
 
Preparing a data migration plan: A practical guide
ETLSolutions
 
Data Migration Strategies Powerpoint Presentation Slides
SlideTeam
 
Data Migration Strategies PowerPoint Presentation Slides
SlideTeam
 
Data Transformation Powerpoint Presentation Slides
SlideTeam
 
Data Migration Support PowerPoint Presentation Slides
SlideTeam
 
Data Migration Done Right for Microsoft Dynamics 365/CRM
Daniel Cai
 
Data Transformation PowerPoint Presentation Slides
SlideTeam
 
Automated Data Migration PowerPoint Presentation Slides
SlideTeam
 
20171019 data migration (rk)
Ruud Kapteijn
 
Data Mapping And Integration PowerPoint Presentation Slides
SlideTeam
 
Migrating data: How to reduce risk
ETLSolutions
 
Data Migration Plan PowerPoint Presentation Slides
SlideTeam
 
Ad

More from CloverDX (14)

PPTX
Data architecture principles to accelerate your data strategy
CloverDX
 
PPTX
Characteristics of modern data architecture that drive innovation
CloverDX
 
PPTX
How to build an automated customer data onboarding pipeline
CloverDX
 
PPTX
Automating Data Pipelines: Moving away from Scripts and Excel
CloverDX
 
PPTX
CloverDX 6.2 Release
CloverDX
 
PDF
Deploying ETL to Cloud
CloverDX
 
PDF
Moving Legacy Apps to Cloud: How to Avoid Risk
CloverDX
 
PDF
Starting Your Modern DataOps Journey
CloverDX
 
PPTX
CloverDX for IBM Infosphere MDM (for 11.4 and later)
CloverDX
 
PDF
Modern management of data pipelines made easier
CloverDX
 
PDF
Removing Danger From Data
CloverDX
 
PDF
Data Anonymization For Better Software Testing
CloverDX
 
PDF
How to publish data and transformations over APIs with CloverDX Data Services
CloverDX
 
PPTX
Moving "Something Simple" To The Cloud - What It Really Takes
CloverDX
 
Data architecture principles to accelerate your data strategy
CloverDX
 
Characteristics of modern data architecture that drive innovation
CloverDX
 
How to build an automated customer data onboarding pipeline
CloverDX
 
Automating Data Pipelines: Moving away from Scripts and Excel
CloverDX
 
CloverDX 6.2 Release
CloverDX
 
Deploying ETL to Cloud
CloverDX
 
Moving Legacy Apps to Cloud: How to Avoid Risk
CloverDX
 
Starting Your Modern DataOps Journey
CloverDX
 
CloverDX for IBM Infosphere MDM (for 11.4 and later)
CloverDX
 
Modern management of data pipelines made easier
CloverDX
 
Removing Danger From Data
CloverDX
 
Data Anonymization For Better Software Testing
CloverDX
 
How to publish data and transformations over APIs with CloverDX Data Services
CloverDX
 
Moving "Something Simple" To The Cloud - What It Really Takes
CloverDX
 
Ad

Recently uploaded (20)

PDF
NASA A Researcher’s Guide to International Space Station : Earth Observations
Dr. PANKAJ DHUSSA
 
PDF
Go Concurrency Real-World Patterns, Pitfalls, and Playground Battles.pdf
Emily Achieng
 
PDF
NLJUG Speaker academy 2025 - first session
Bert Jan Schrijver
 
PDF
Next Generation AI: Anticipatory Intelligence, Forecasting Inflection Points ...
dleka294658677
 
PDF
Modern Decentralized Application Architectures.pdf
Kalema Edgar
 
PDF
“Voice Interfaces on a Budget: Building Real-time Speech Recognition on Low-c...
Edge AI and Vision Alliance
 
PPTX
Talbott's brief History of Computers for CollabDays Hamburg 2025
Talbott Crowell
 
PDF
CIFDAQ Market Wrap for the week of 4th July 2025
CIFDAQ
 
PPTX
Mastering ODC + Okta Configuration - Chennai OSUG
HathiMaryA
 
PPTX
Digital Circuits, important subject in CS
contactparinay1
 
PDF
🚀 Let’s Build Our First Slack Workflow! 🔧.pdf
SanjeetMishra29
 
PDF
Home Cleaning App Development Services.pdf
V3cube
 
PDF
UiPath DevConnect 2025: Agentic Automation Community User Group Meeting
DianaGray10
 
PDF
What’s my job again? Slides from Mark Simos talk at 2025 Tampa BSides
Mark Simos
 
PDF
Automating Feature Enrichment and Station Creation in Natural Gas Utility Net...
Safe Software
 
PDF
Transforming Utility Networks: Large-scale Data Migrations with FME
Safe Software
 
PDF
Dev Dives: Accelerating agentic automation with Autopilot for Everyone
UiPathCommunity
 
PDF
Software Development Company Keene Systems, Inc (1).pdf
Custom Software Development Company | Keene Systems, Inc.
 
PPTX
The Project Compass - GDG on Campus MSIT
dscmsitkol
 
PDF
Bitkom eIDAS Summit | European Business Wallet: Use Cases, Macroeconomics, an...
Carsten Stoecker
 
NASA A Researcher’s Guide to International Space Station : Earth Observations
Dr. PANKAJ DHUSSA
 
Go Concurrency Real-World Patterns, Pitfalls, and Playground Battles.pdf
Emily Achieng
 
NLJUG Speaker academy 2025 - first session
Bert Jan Schrijver
 
Next Generation AI: Anticipatory Intelligence, Forecasting Inflection Points ...
dleka294658677
 
Modern Decentralized Application Architectures.pdf
Kalema Edgar
 
“Voice Interfaces on a Budget: Building Real-time Speech Recognition on Low-c...
Edge AI and Vision Alliance
 
Talbott's brief History of Computers for CollabDays Hamburg 2025
Talbott Crowell
 
CIFDAQ Market Wrap for the week of 4th July 2025
CIFDAQ
 
Mastering ODC + Okta Configuration - Chennai OSUG
HathiMaryA
 
Digital Circuits, important subject in CS
contactparinay1
 
🚀 Let’s Build Our First Slack Workflow! 🔧.pdf
SanjeetMishra29
 
Home Cleaning App Development Services.pdf
V3cube
 
UiPath DevConnect 2025: Agentic Automation Community User Group Meeting
DianaGray10
 
What’s my job again? Slides from Mark Simos talk at 2025 Tampa BSides
Mark Simos
 
Automating Feature Enrichment and Station Creation in Natural Gas Utility Net...
Safe Software
 
Transforming Utility Networks: Large-scale Data Migrations with FME
Safe Software
 
Dev Dives: Accelerating agentic automation with Autopilot for Everyone
UiPathCommunity
 
Software Development Company Keene Systems, Inc (1).pdf
Custom Software Development Company | Keene Systems, Inc.
 
The Project Compass - GDG on Campus MSIT
dscmsitkol
 
Bitkom eIDAS Summit | European Business Wallet: Use Cases, Macroeconomics, an...
Carsten Stoecker
 

How to Effectively Migrate Data From Legacy Apps

  • 1. How to effectively migrate data from legacy systems www.cloverdx.com
  • 2. Migration from legacy system Legacy system New system Has not been modernized in a long time, runs on outdated platform In-house or heavily customized Holds business critical data – CRM, ERP, billing, … Required for correct functioning of the organization Modern, more capable system Often cloud-based Provides business benefit like better reporting, data quality, visibility, … Migrate
  • 3. Telco Migration, The Need In-house system CLF nCRM Validate, cleanse, map Validate, cleanse, map Data flow 100k customers 6m customers
  • 4. Planning and scope changes Gaps in knowledge (in source and in target) Data quality Execution and reporting Challenges
  • 6. Optimists planning a migration Analysis and discovery Extraction Validation Cleansing Loading Reconciliation Done
  • 8. Documentation of what needs to be done and of deadlines. Can take long time to complete Work with business and review as early as possible to prevent issues Figuringout the scope Business specification
  • 9. In-depth look at each system involved to determine implementation plan. Requires cooperation from each party Knowledge gaps can prolong the process quite a bit. Figuringout the scope Technical analysis
  • 10. Hardest part, especially for complex migrations with many business rules. What is a cost of a business rule implementation? Big bang migration or trickle migration? Figuringout the scope Effort estimates and planning Analysis Implementation & testing Run Analysis Implementation & testing Run Run Run Run Big bang Trickle
  • 11. Not all data has the same complexity Which data to process first? Simple Complex Data complexity Number of records
  • 12. Not all data has the same complexity More complex data brings more problems Which data to process first? Simple Complex Data complexity Number of records Problems
  • 13. Not all data has the same complexity More complex data brings more problems Migrate complex records first, maybe manually Which data to process first? Simple Complex Data complexity Number of records Problems Do this first
  • 14. Oh no! The scope changed! Factor “known unknowns” into your estimates – possibility to change after technical analysis and technology risks. What if business changed their mind? This cannot be avoided. Full and close cooperation with business helps when this happens.
  • 15. Telco Migration, The Plan 24 m Start 12 m In-house to CLF CLF to nCRM Timeline In-house system CLF nCRM Validate, cleanse, map Validate, cleanse, map Data flow 100k customers 6m customers
  • 17. Single biggest issue in most legacy migration projects Data frequently comes directly from users o Typed in an application (or even Excel document) o People come and go o Processes and ideas about data change Garbage in, garbage out o Migration is your best chance to fix your data, business already expects complications and issues with the data Data Quality
  • 18. Make this part of your technical analysis Measure on every step of the process o On input, but also after transformations o Can help you catch logic errors Provide frequent reports to business o KPIs can help you make decisions: data is too bad → redesign solution Measuring data quality
  • 19. Fix as early as possible, ideally in the source system o Needs cooperation with the source data owner Fail on error or keep on going? o Both approaches valid for different usecases Fixing data quality issues
  • 21. How to map entities in old system to the ones in the one? o Same data domain, different ideas about how to represent everything Knowledge gaps o Old system might not be known very well anymore o New system is not yet known very well The mapping problem ?
  • 22. How to make it accessible to business users? How to keep track of different mapping versions? Use Data Modeling tools or Data catalog Mapping specification
  • 23. Split mappings into multiple “pieces” to make them easier to build and verify Document your code Mapping development
  • 24. Split mappings into multiple “pieces” to make them easier to build and verify Document your code Validate your data and report to business Mapping development
  • 25. Test data can be difficult to obtain o Manually built test data is often insufficient – does not cover corner cases Use data as close to production data as possible o Anonymize to prevent data leaks and comply with regulations Test with business Test as early (and as often) as possible Testing the Mapping Production Data sample Anonymize Use data in QA
  • 26. Everything is tested, smooth sailing from here… Communication is the key: frequent updates to your business users. Detailed reports help ensure that production data is ok. Be ready to support the solution – hypercare. Execution
  • 27. Telco Migration, The Plan 24 m Start 12 m In-house to CLF CLF to nCRM Timeline In-house system CLF nCRM Validate, cleanse, map Validate, cleanse, map Data flow 100k customers 6m customers
  • 28. Telco Migration, The Result 36 m Start 24 m In-house to CLF CLF to nCRM Timeline In-house system CLF nCRM Validate, cleanse, map Validate, cleanse, map Data flow 100k customers 6m customers Project restart Additional target systems
  • 29. Bespoke systems o Vendor no longer available or not cooperating Complex scenarios o Many-to-one or many-to-many scenarios o Lots of customizations over long periods of time Messy data o Needing cleaning up, repurposing, reconstructing or enrichment Tight deadlines o When traditional approach can’t deliver on time Data Management Platform
  • 30. There is no “I can’t” with CloverDX. Combining capabilities of coding with the readability of a visual design, CloverDX lets developers solve hard problems in code while empowering less technical colleagues. While CloverDX is built so that everything can sit on a single platform, it is flexible to fill in gaps in an existing technology stack. Thanks to its open architecture, almost every part of CloverDX can be customized, versioned, parametrized. If 90% of the work is easy and 10% are the exceptions, error and outliers, then CloverDX is built to help you deal with those 10 percent. Solve 100% of a problem, not just the easy 90%
  • 31. With CloverDX, you ultimately think of running everything on autopilot. Automate Everything Automate the entirety of your data workloads, including the exceptions and intricacies of real-life processes. Schedule jobs for automatic delivery to data warehouses, data lakes or operational databases in cloud or on premise. Orchestrate external systems and tools by CloverDX sitting at the center of your architecture – events, API calls or message queues. www.cloverdx.com