Datasheet - Machine Learning
Datasheet - Machine Learning
MACHINE LEARNING
Master Data Management and Machine Learning are both data-intensive technologies and can
enhance and enable each other. MDM can improve the quality of data used to perform machine
learning, reducing data preparation efforts while improving the accuracy of the model through
better data. Conversely, Machine Learning can also be used to automate MDM, reducing the
burden on administrators and data stewards.
Machine learning approaches can improve outcomes in a wide variety of situations such as these:
The common solution to this problem is increasingly sophisticated ‘data prep’, which is where
data scientists spend most of their time. While some data prep is necessary, much of it is invested
in redundant efforts to mask data quality issues in the most important data, the master data that
represents the customers or products associated with a big data set.
Matching – This is one of the most critical capabilities of an MDM solution. Profisee MDM uses
sophisticated machine learning techniques to enable intelligent matching of data between and
across applications.
Matching and grouping duplicate data is a clustering problem, with some unique challenges in
the context of MDM:
For the technically-minded: Profisee’s ML matching algorithm begins by Featurizing the input
identifying attributes into a sorted vector of n-grams per attribute. Profisee then builds an ML
model based on these Features which is automatically maintained in memory to support high
performance in both initial and ongoing matching scenarios. This model is continuously re-trained
as stewardship occurs, allowing the engine to continue learning from data stewardship actions.
For the less-technically-minded: Profisee MDM has ML-assisted matching which enable high
performance and accuracy!
Data Stewardship – In addition, machine And of course, the faster and more effective the
learning can be used to actively assist data data stewardship, the more data and domains
stewards in resolving data issues by ‘learning’ can be mastered, and the better the overall
from previous manual corrections and data available to drive business intelligence,
suggesting future corrections – thus saving operations and of course ML-based
time and effort from human experts. predictive analytics.
Click Here to
Learn More
© 2021 Profisee Group Inc.
Document_152_01_02