Your Organization-S Guide To Data Maturity 2
Your Organization-S Guide To Data Maturity 2
Visit DataCamp
Table of
Contents The state of data-driven digital transformation 1
The state of data- Now, almost a decade later, achieving digital The north star for any aspiring data-driven
driven digital transformation, becoming data-driven, and scaling
data science capabilities is on every organization’s
organization is to become data fluent, where everyone
has the access, and skills they need to work with data
transformation mind. According to a 2021 NewVantage Partners and make data-informed decisions. This ultimately
survey, 99% of firms are reporting active investment in means the democratization of data and skills across a
data science and machine learning. However, fewer range of roles and the maturation of a data culture.
than 30% of organizations are experiencing
transformational business outcomes as a result of In this guide, we outline a framework for evaluating,
these investments, and only 24% of them claim they and scaling data maturity throughout the
have created a data-driven organization. Similarly, organization, define the various data maturity stages
research from BCG shows that more than 80% of an organization goes through, and draw a path of
companies are accelerating their digital initiatives organizations can take to achieve data
transformation plans, but only 30% of them are fluency.
achieving their stated objectives.
Infrastructure — entails creating a data infrastructure that ensures data is collected, discoverable,
reliable, understood, compliant, and actionable throughout the organization.
2
People — entails both forging a data culture where everyone understands the value of data and the
The five data importance of continuous learning, as well ensuring that everyone has the skills to work effectively with data.
maturity levers Tools — which refers to the tools data practitioners across all skill levels use, as well as the development of
frameworks that enable easier work with data throughout the organization.
Organization — which refers to how data talent is organized, and adopting organizational models that
promote scalable data science throughout the organization.
Processes — the workflows that data experts and teams adopt to make their work more predictable and
collaborative, and to ensure alignment with business objectives.
Infrastructure and people are the base foundation supporting the other levers.
Without data culture, skills, and access to data, organizations won’t be able to leverage
data tools, scale data processes, and organize data talent.
The path to data fluency is comprised of four data maturity It’s important to note that this does not mean that everyone needs
stages. It’s important to think of data maturity as a to code, as there are various data personas found in every data-
spectrum, and that some organizations may find themselves driven organization who possess a completely different relationship
relatively more mature in one element of the IPTOP with data to do their best work. For example, data leaders and
framework and less mature in another. With that in mind, we consumers are often non-technical roles that need to be data
can break down the stages of data maturity through the literate in order to be conversational with data experts and identify
number of people in an organization that rely on data to do where data can be used to answer a business question. Data
their best work, and who have the skills and access to make engineers, on the other hand, are highly technical individuals who
data-driven decisions. are responsible for scaling and maintaining data infrastructure
throughout the organization.
Data Fluent
For a deep-dive into all eight Everyone knows how to
data personas found in every access the data they
data-driven organization, Data Progressive need to do their job and
download our white paper on the make data-driven
Every team has at least
L&D Guide to Data Fluency. decisions.
one data practitioner who
Data Scaling can analyze, report, and
Very few people in an present their data
organization have the regardless of their role.
Data Reactive skills and access they
No one accesses or uses need to analyze, report,
data in their daily work. and confidently present
An organization rarely data insights.
reports or presents data.
4
In data reactive organizations, no one accesses or uses data in their daily work,
What data reactive looks like and a company rarely reports on or presents data. There is no data culture or
strategy to support maturing data competencies.
Data
People Organization Fluent
Data
Progressive
No one has the skills to work with data on a daily basis There is no data team and no data experts throughout Data
Scaling
— and the organization does not have nor does it the organization. A lack of data strategy hinders the Data
Reactive
prioritize developing a data culture. allocation of resources for data talent.
At the data reactive stage, organizations suffer from low On the people side, data reactive organizations are
maturity in infrastructure and people, ultimately limiting characterized by cultures that value experience and gut
their maturity ceiling in terms of tools, organization, and instinct over data-driven decision making, there is no
processes. On the infrastructure side, a data reactive dedicated data talent working on scaling data maturity,
organization collects data on an ad hoc basis and stores it the majority of the organization does not value the
in siloes. The data often sits in departmentally owned flat opportunities of employing data analytics in their daily
files, spreadsheets, or legacy tools. There are no data work, and learning and development for data skills is not a
quality standards to support data linkage between priority on the organizational roadmap.
departments, and there is minimal data strategy that
incorporates scaling data infrastructure.
Moreover, scaling data skills and culture means that there is organizational alignment
and excitement around the importance of data initiatives (McKinsey). This excitement
will be the lifeblood that sustains future initiatives when scaling data maturity and will
be the catalyst of growth for organizations on their path to data fluency.
data experts that are working on data initiatives. management is still taking place.
At the data scaling stage, organizations have made the in the way of joining datasets across various departments. As for the remaining components of data maturity (tools,
first steps by creating a data strategy that aligns data More importantly, some departments have gotten a head organization, and processes), organizations at this stage
science with business objectives, is centered around data start over others in terms of accessing and utilizing data of maturity do not have a modern data tooling stack.
culture and skills, and incorporates scalable data to push key strategic pilot projects. Instead, most data work is done in legacy tools such as
infrastructure. Moreover, they have some data talent in spreadsheets, and only a few key data experts leverage
place to drive some strategic projects. Having made these On the people side, data scaling organizations can be advanced data tooling. More importantly, organizations
first steps, data scaling organizations will need to start characterized as having minimal data culture with very have still not developed an organizational model for data
strengthening maturity of tools, organization, and few people who believe in the importance of data, talent, and senior leadership is not yet spearheading or
processes. evidenced by not having the skills or mindset to make sponsoring change management. Finally, only a few ad
data-driven decisions. This can be manifested as a data hoc data processes exist on siloed teams, with no
Infrastructure for data scaling organizations is not yet culture that is still undefined, and whose exact organization-wide norms set in place to work with data or
mature, as only a few key experts understand how data is components are not yet articulated, and where self- collaborate with data experts.
accessed, and there is still no widespread trust in data. motivated learning is king. Moreover, as data initiatives
This means that there is no centralized data storage that start gaining traction alongside data experts, change
allows easy access for data experts and enthusiasts alike, agents and data champions will start to crop up in
datasets are still siloed, and data experts face data quality forward-thinking departments (DataIQ). Your Organization's Guide to Data Maturity 7
issues that lengthen the data cleaning process and stand
4. WHAT DATA SCALING LOOKS LIKE
a thing to do [...] Data Competency is a way of standards. Centralizing data storage is most and the use of data, tie data-driven decision-making
commonly done by leveraging centralized data to business outcomes, and drive continuous
thinking, and not a thing to think about”
storage solutions like data lakes and warehouses from education.
Kirk Borne, Chief Data Scientist at Booz Allen Hamilton cloud providers such as Amazon Web Services,
Microsoft Azure, Google Cloud, Snowflake, and more. • Create an upskilling strategy that’s made for the
long-term — A key component of creating and
As such, organizations need to articulate and define a • Start scaling data quality — One of the main reinforcing a data culture is making sure everyone has
data culture by defining important metrics for business obstacles towards organization-wide trust in data is the skills they need to work with data effectively to do
initiatives, defining and incentivizing new behaviors (e.g., data quality (Monte Carlo). Moreover, data quality their role. At this stage, organizations should set the
reporting on project updates with data points), developing issues hamper productivity as data experts spend stage for an organization-wide data upskilling
a wider learning and development strategy to shore up excessive time on data quality issues rather than strategy that is inclusive and takes into account all
data skills, and rewarding data champions and change advancing an organization’s data maturity types of roles. This begins with identifying relevant
agents. Strengthening infrastructure is the second (McKinsey). As such, organizations looking to mature data personas in the organization, assessing current
foundation for data to permeate throughout the their infrastructure should make data quality and skill sets and identifying skill gaps, and most
organization, which will allow organizations to scale data governance a key aspect of scaling infrastructure on importantly, aligning the upskilling objectives with
access. This includes centralizing data using a centralized the path to data progressive. This includes developing business strategy (DataCamp).
data storage solution, defining data access structure for data quality standards for various departments,
high-impact teams, and establishing data governance and creating documentation around accessing data,
quality mechanisms and policies. investing in data quality monitoring mechanisms, and Learn how AXA XL is using DataCamp
iteratively increasing the scope of data governance to upskill its people
initiatives. Learn more
5. WHAT DATA SCALING LOOKS LIKE
Centralized Model
• Reward the change agents — A data scaling between the centralized data team model and the H OW
MUC ..?
H…? WE.
ULD
organization is typically characterized by the decentralized model vary. The centralized model FINANCE SHO PRODUCT
presence of data champions and change agents who allows the data team to function as a center of
are playing an active role in evolving the status quo. excellence, with the ability to push the data agenda IS...
?
D ATA S C I E N C E
H OW
CAN
W H AT …?
Organizations should reward these change agents by throughout the organization. Moreover, it allows
MARKETING ENGINEERING
increasing the visibility of their work, promoting and collaboration and knowledge sharing between data
empowering them to be more cross-functional, and experts who are working on completely different
Benefits
evangelizing the successes they’ve led across the problems. However, a drawback is that it could limit
Data science functions as a center of excellence
organization. This further catalyzes culture change collaboration and alignment with subject matter
Promotes collaboration and knowledge sharing
and increases the adoption of data-driven buy-in experts, and risks isolating the data team as a support
Data Science Manager has domain knowledge
throughout the organization (Tableau). function. The decentralized model, on the other hand, Incentivizes consistent technology stack and better tooling
allows data experts to be directly embedded in
Drawbacks
Tools business units. This allows better cross-functional
Limits coordination between data science and other stakeholders
alignment between business stakeholders and data
• Modernize your data stack — As data becomes Risk of misalignment between data science and Business Units
scientists. However, this also means that it’s more
centralized and providing scalable access becomes Risks isolating data science as a Support Function
difficult to move data experts between teams, and
top of mind, giving access to data tooling that is
data experts are managed by business leaders.
inclusive to relevant data personas for your
organization will be paramount. This could include
Processes Decentralized Model
open-source programming languages like Python and
R, business intelligence tools such as Tableau or • Create processes for collaboration between data
Microsoft Power BI, easy access to a SQL database talent and business units — Depending on the model FINANCE PRODUCT
Data
People Organization Fluent
Data
Progressive
Data is seen as a strategic asset and there is alignment The data team is set in place, but their impact is siloed Data
Scaling
around its importance. However, organization-wide data and data talent is seen as a support function. Data
Reactive
literacy is lacking and data training is limited to pockets
of the organization.
At the data progressive stage, organizations have made strides to changes have been done to it, nor to find a clear explanation of As for tools, organization, and processes, data progressive
elevate data maturity. This translates into an environment where dataset metadata and other key components of data discovery. organizations have a modern tooling stack but have yet to create
there is organization-wide buy-in over the importance and Moreover, scalable organization-wide data quality standards are scalable frameworks that further democratize working with data.
strategic nature of leveraging data, and where data is largely still lacking, which hurts overall trust in data. Finally, This means that working with relatively advanced tools has a high
accessible. There is an established organizational model for data operationalizing machine learning and data science is still barrier to entry for non-data experts. The data team has an
talent, and modern tooling is accessible across the board for nascent, and many tools aimed at productionizing, monitoring, organizational model (centralized or decentralized), but data
various data personas. However, the final set of challenges here and refining models in production have yet to be adopted. talent is limited to being a support function for other business
are all about democratization—empowering everyone to make units. This limits the data team from advancing the data agenda
data-driven decisions, freeing up data talent from becoming a On the people side, strides in data culture have led to by pursuing high-impact projects that scale throughout the entire
support function, and focusing instead on highly strategic projects organization-wide buy-in on the strategic importance of data. organization. Data progressive organizations also have high
that capture value and raise the organization's data maturity However, data skills training is still departmental. Insufficient maturity processes for streamlining data talent workflows, but
(O’Reilly). organization-wide data literacy slows down innovation and limits ways of working around operationalizing machine learning and
productivity (Accenture). Most importantly, it hinders the data science are still nascent, and processes for scalable data
On the infrastructure side, data is accessible and data adoption of data science as a methodology for solving problems, work are not inclusive to the remainder of the organization.
infrastructure is maturing. However, there is no easy, scalable rather than a skill to acquire. This ultimately results in an
way to easily discover who owns a particular dataset and what organization where there is no common data language, an over-
reliance on data talent to provide data insights even if it’s basic
reports, and where people do not feel empowered to make data Your Organization's Guide to Data Maturity 10
informed decisions.
6. WHAT DATA PROGRESSIVE LOOKS LIKE
The path to data fluency requires a powerful search functionality, a deep overview of • Strengthen data quality and governance — In order
dataset ownership, lineage, metadata, and to create trust and alignment around data,
multipronged approach
programmatic functionality to scale data quality organizations looking to democratize data should look
The arrival at data fluency is arguably the most difficult checks and descriptions (Edmunds). at data governance as a key pillar for facilitating that.
step to make for organizations. At this point, organizations As such, organizations should invest in scaling their
are likely to have scaled their infrastructure to process data quality strategy and processes. This means
terabytes of data daily, hired, organized, and derived investing in data governance platforms, treating data
value from data talent, and enabled access to modern as a product to which it abides by software
tooling. engineering best practices like data service level
agreements (Monte Carlo) and creating data quality
The difficulty is scaling data science to become an certifications for datasets (Airbnb).
organization-wide methodology for solving business
challenges, and where data and data skills are • Invest in operationalization capabilities and tooling
democratized. To reach this point, organizations should — As machine learning models become more
adopt a multipronged approach that tackles all the commonplace—whether in business processes or in
People
• Assess, measure, and track data skill competencies Tools
over time — A key component of scaling data literacy
• Develop frameworks to democratize working with
• Roll out organization-wide data training fit for all is tracking skill development and using assessments to
data — Providing access to tooling is only one part of
data personas — We’ve demonstrated that people are steer learning paths (Deloitte). Assessments provide
the equation in democratizing data. The next frontier
the cornerstone of data maturity across stages. individuals with insights into their data skills maturity,
is lowering the barrier to working with data with
Organizations looking to drive data fluency at this and will guide personalized learning paths for them
frameworks. For example, the data team at Airbnb
stage face the challenge of making data a default based on their strengths, weaknesses, and necessary
developed an R package conveniently named Rbnb,
consideration in most decisions and actions taken. data skills for their particular role. At an aggregate
which provides a streamlined way to move data
One of the main blockers organizations face is level, assessments will provide organizations an
between different places in their data infrastructure,
insufficient data literacy (Deloitte), which limits the overview of their data maturity skill matrix, which acts
branded visualization themes, R Markdown templates
ability of non-experts to become independent when as a balance sheet of the company’s data skills.
for different types of reports, and custom functions to
seeking and creating data insights. When rolling out
optimize different parts of data workflows. At
organization-wide training, the best practice is to
create personalized learning paths for different data DataCamp Signal™ helps DataCamp, we have R and Python packages that
organizations to pinpoint their abstract querying data from our data warehouse with
roles. For example, Airbnb created Data University, a
a few lines of code. Frameworks enable everyone to
program aimed at upskilling the entire organization strengths and skill gaps at scale.
adopt more advanced data tooling, and standardize
on data skills, with content curated for specific data
the data workflow for data experts throughout the
teams in their Data U Intensive program. Cohorts were
• Reward a culture of continuous learning — Data organization.
able to achieve a 30% sustained increase in SQL
usage. Bloomberg uses DataCamp as part of a science is not merely a skill to learn, but a
blended learning program, which has sustained a methodology for solving business problems. As a field,
561% increase in activities on their proprietary Jupyter data science is continuously evolving, with new
Notebook environment (DataCamp). In a data fluent techniques, frameworks, tools, and best practices
organization, learning paths run the gamut from being developed (Datafloq). Organizations should
general data literacy, to data science, machine reward a culture of continuous learning if they want to
learning, and data engineering, and more. This build and sustain data skills across data roles
ultimately enables the adoption of a common data (McKinsey). This not only includes creating a safe
language and reinforces data science as a habit and space for active learning and development, but
methodology for approaching business problems. incentivizing novel, data-driven approaches for solving
problems and promoting learning by doing.
Organization
The Hybrid Model
• Consider moving to a hybrid organizational model — As data competencies mature and more
data experts are developed within the organization, the next step is to organize data talent under
a hybrid model. The hybrid model combines the centralized and decentralized data science team. FINANCE PRODUCT
In a hybrid model, a central data team functions as a center of excellence that drives data
democratization efforts like creating frameworks and standardizing the data tooling stack
DATA S C I E N C E
throughout the organization, all while embedding data scientists into business functions to create
better alignment between data science and business units. However, it’s important to note there
is a spectrum of hybrid models sitting between the two ends, and that organizations should find a
MARKETING ENGINEERING
sweet spot that suits their size, objectives, and data strategy (Altexsoft).
Benefits
Processes
Data science can function as a center of excellence.
• Create and reward processes that put collaboration and democratization at the center — A key Data science can drive common tech stack, tooling, frameworks, and standardization
Data science can collaborate and align on organizational goals.
mark of data fluent organizations is sharing tools, knowledge, and insights. These organizations
Better alignment between Data science and business units
have created processes that simplify access to data insights, making it easy to collaborate with
data experts and work with data. For example, LinkedIn developed DataHub, a data discovery Drawbacks
tool that also allows the organization to share data insights, charts, and dashboards. Airbnb Risk of mismatch of expectation leadership of Data science and business unit.
developed Knowledge Repo, a platform that makes it easy to publish reports and to surface data Everyone has at least two teams.
insights. Netflix created processes and templates for accessing and working with Notebooks
throughout depending on the type of problem being tackled. These processes enable easier
access to data knowledge, and are centered around simplifying work with data.
Your Organization's
Our Guide
Guide to
to Data
Data Maturity 13
Maturity 2
A blueprint for data fluency
Variety of ad-how legacy tools to work Any data work is done on ad-hoc basis
with data that are rarely leveraged and there are no processes for working
with data
What is common between these individuals is a mindset and buy-in around the
importance of leveraging data for making decisions, and for solving problems. This is
why building data skills and culture is paramount for any data maturity initiative to
scale regardless of where the organization sits on the data maturity spectrum.
Your Organization's
Our Guide
Guide to
to Data
Data Maturity 15
Maturity 2
Learn Track skills with skill matrix
How DataCamp supports DataCamp’s growing course library houses more than 350 Track the data skills your team has today and map a
techniques in their target domain, and domain Our mobile app is the perfect way to practice and learn
on the go.
experts can learn the fundamentals of data
literacy and data science.
Apply
Assess Once skills have been assessed, cultivated through A robust training experience
Effective learning starts with understanding skill gaps and courses, and sharpened through practice, learners are We work with the largest brands in the world, including
strengths. With DataCamp Signal™, learners can ready to apply their skills in a project-based environment. PayPal, Uber, HSBC, and Google, to help them transform
understand specific skill gaps they have across various With DataCamp projects, learners can solve a variety of their data skills. Our experienced team provides you with
topics and tools. From data literacy assessments like real-world R and Python data science projects. Learners resources and guidance on everything from adoption
understanding and interpreting data to programming and can opt for guided projects, where they can follow step- best practices to SSO and LMS integrations—giving you
machine learning assessments in R, Python, and SQL, our by-step tasks and receive helpful feedback as they apply the tools you need to upskill your team with confidence.
10-minute adaptive evaluations provide learners with their newfound skills. They can also opt for unguided Join over 7 million learners around the world. Close your
personalized skill gaps and learning paths to address their projects, which are open-ended, offering a variety of talent gap. Visit datacamp.com.
skill gaps. possible solutions and a live-code-along video to follow
how an expert data scientist would approach a solution.