BI and Big Data Management
()
About this ebook
Companies increasingly recognize that the analysis of business information (business intelligence) can generate decisive competitive advantages. In addition, the compliance guidelines BCBS 239, Basel II and III, SOX, and Solvency II have led to legal requirements for a minimum level of quality in reporting and planning data and processes. The establishment of enterprise-wide data management thus continues to be one of the major challenges for IT and management in the years to come.
Data quality is an integral success factor in the establishment of an optimal information infrastructure. A 2002 study from "The Data Warehousing Institutes" (TDWI) calculates that poor data quality in the US cost about $622 billion. Gartner market research stated in 2006: Poor data quality costs a typical organization 20% of revenue….
The worldwide financial and economic crisis after 2007 can retrospectively also be regarded as a data quality crisis. Despite far-reaching compliance requirements, many financial service companies have not been able to aggregate and prepare their risk data in a way to adequately control their risks, and they are still struggling in 2017.
In the era of Big Data, data is viewed as the new oil and the available data volume worldwide multiplies every year. The requirements for transparency and data stream quality continue to increase, because these are considered essential for partially or completely new applications in decision support and other areas.
But what use are larger data piles when quality and origin remain uncertain and when the costs for development and operation in data maintenance, integration, and analysis are proportional to the data volume?
"Data quality is not everything, but without quality of data, it is all nothing."
Metadata and metadata management are important aids for ensuring adequate data quality.
The goal of this book is to take the current concepts and trends and tune the minds of project managers, IT managers, IT architects, analysts, developers, and business leaders back to the topics of data quality management and integrated metadata management.
Ulrich Hambuch
Ulrich Hambuch has been an independent business consultant concentrating on Business Intelligence, Data Warehouse, Big Data, and management since 2011. He has more than 15 years’ experience in IT and has been working on significant BI projects in industry-leading enterprises for over 10 years. In addition to being a consultant and an active partner for SAP and HortonWorks, he is the author of the 2008 book "Erfolgsfaktor Metadatenmanagement - Relevanz des Metadatenmanagements für die Datenqualität bei Business Intelligence" and has been working on vendor-independent certifications for Business Intelligence Consultants and Big Data Engineers with CeLS (an initiative by GECO & IDG).
Related to BI and Big Data Management
Related ebooks
Architecting Big Data & Analytics Solutions - Integrated with IoT & Cloud Rating: 5 out of 5 stars5/5Big Data Analytics: Disruptive Technologies for Changing the Game Rating: 4 out of 5 stars4/5Microsoft SQL Server 2014 Business Intelligence Development Beginner’s Guide Rating: 0 out of 5 stars0 ratingsUnderstanding the Predictive Analytics Lifecycle Rating: 5 out of 5 stars5/5Big Data Revolution: What farmers, doctors and insurance agents teach us about discovering big data patterns Rating: 3 out of 5 stars3/5The Data Warehouse Lifecycle Toolkit Rating: 0 out of 5 stars0 ratingsSelf-Service Analytics with Power BI: Learn how to build an end-to-end analytics solution in Power BI (English Edition) Rating: 0 out of 5 stars0 ratingsBusiness Models in Emerging Technologies: Data Science, AI, and Blockchain Rating: 0 out of 5 stars0 ratingsThe Big Data-Driven Business: How to Use Big Data to Win Customers, Beat Competitors, and Boost Profits Rating: 0 out of 5 stars0 ratingsFundamentals of Analytics Engineering: An introduction to building end-to-end analytics solutions Rating: 0 out of 5 stars0 ratingsSpreadsheets To Cubes (Advanced Data Analytics for Small Medium Business): Data Science Rating: 0 out of 5 stars0 ratingsBigData Analytics: Solution Or Resolution? Rating: 3 out of 5 stars3/5Data Quality: Empowering Businesses with Analytics and AI Rating: 0 out of 5 stars0 ratingsExpert Cube Development with SSAS Multidimensional Models Rating: 0 out of 5 stars0 ratingsBig Data: Understanding How Data Powers Big Business Rating: 2 out of 5 stars2/5Enterprise Architecture Tools Third Edition Rating: 0 out of 5 stars0 ratingsComprehensive SQL Techniques: Mastering Data Analysis and Reporting Rating: 0 out of 5 stars0 ratingsMDM for Customer Data: Optimizing Customer Centric Management of Your Business Rating: 0 out of 5 stars0 ratingsData warehouse Complete Self-Assessment Guide Rating: 4 out of 5 stars4/5Big Data, Big Analytics: Emerging Business Intelligence and Analytic Trends for Today's Businesses Rating: 0 out of 5 stars0 ratingsOracle Information Integration, Migration, and Consolidation Rating: 0 out of 5 stars0 ratingsBig Data Visualization Rating: 0 out of 5 stars0 ratingsBig Data: Opportunities and challenges Rating: 0 out of 5 stars0 ratingsData Lake Architecture Complete Self-Assessment Guide Rating: 0 out of 5 stars0 ratingsLearn Data Warehousing in 24 Hours Rating: 0 out of 5 stars0 ratingsWebERP Standard Requirements Rating: 0 out of 5 stars0 ratingsThe Definitive Guide to Data Integration: Unlock the power of data integration to efficiently manage, transform, and analyze data Rating: 0 out of 5 stars0 ratingsNeo4j High Performance Rating: 0 out of 5 stars0 ratings
Enterprise Applications For You
Notion for Beginners: Notion for Work, Play, and Productivity Rating: 4 out of 5 stars4/5Excel : The Ultimate Comprehensive Step-By-Step Guide to the Basics of Excel Programming: 1 Rating: 5 out of 5 stars5/5Excel 101: A Beginner's & Intermediate's Guide for Mastering the Quintessence of Microsoft Excel (2010-2019 & 365) in no time! Rating: 0 out of 5 stars0 ratingsCreating Online Courses with ChatGPT | A Step-by-Step Guide with Prompt Templates Rating: 4 out of 5 stars4/5Microsoft Excel 365 Bible Rating: 0 out of 5 stars0 ratingsAgile Project Management: Scrum for Beginners Rating: 4 out of 5 stars4/5QuickBooks 2023 All-in-One For Dummies Rating: 0 out of 5 stars0 ratingsExcel Tables: A Complete Guide for Creating, Using and Automating Lists and Tables Rating: 5 out of 5 stars5/5Microsoft Teams For Dummies Rating: 0 out of 5 stars0 ratingsPersonal Knowledge Graphs: Connected thinking to boost productivity, creativity and discovery Rating: 5 out of 5 stars5/550 Useful Excel Functions: Excel Essentials, #3 Rating: 5 out of 5 stars5/5Excel Dashboards and Reports Rating: 5 out of 5 stars5/5Excel Formulas and Functions 2020: Excel Academy, #1 Rating: 4 out of 5 stars4/5The Ridiculously Simple Guide to Google Docs: A Practical Guide to Cloud-Based Word Processing Rating: 0 out of 5 stars0 ratingsQuickBooks Online For Dummies Rating: 0 out of 5 stars0 ratingsQuickBooks Online For Dummies, 2025 Edition Rating: 5 out of 5 stars5/5Excel Power Pivot and Power Query For Dummies Rating: 3 out of 5 stars3/5Microsoft Excel Formulas: Master Microsoft Excel 2016 Formulas in 30 days Rating: 4 out of 5 stars4/5Salesforce.com For Dummies Rating: 3 out of 5 stars3/5Trend Following: Learn to Make a Fortune in Both Bull and Bear Markets Rating: 5 out of 5 stars5/5Enterprise AI For Dummies Rating: 3 out of 5 stars3/5Access 2019 For Dummies Rating: 0 out of 5 stars0 ratings
Reviews for BI and Big Data Management
0 ratings0 reviews
Book preview
BI and Big Data Management - Ulrich Hambuch
BI and Big Data Management
Ulrich Hambuch
––––––––
Translated by Philipp Strazny
BI and Big Data Management
Written By Ulrich Hambuch
Copyright © 2017 Ulrich Hambuch
All rights reserved
Distributed by Babelcube, Inc.
www.babelcube.com
Translated by Philipp Strazny
Cover Design © 2017 Ulrich Hambuch
Babelcube Books
and Babelcube
are trademarks of Babelcube Inc.
Imprint
© / Copyright: 2017 Ulrich Hambuch
E-Mail: [email protected]
Web: http://www.infogenesis.de
First edition
English translation: Philipp Strazny
Cover, Illustrations: Ulrich Hambuch,
Cover image: http://www.pixabay.com
Images: http://www.pixabay.com and http://freeimages.com
Fig. 23 with permission from: ORAYLIS GmbH
This work and all its parts are subject to copyright. Any use outside of the confines of copyright is inadmissible without agreement of publisher and author. This applies in particular to electronic or other reproduction, translation, distribution, and publication.
Bibliographical data of the Deutsche Nationalbibliothek (German National Library):
Die Deutsche Nationalbibliothek lists this publication in the Deutsche Nationalbibliografie; detailed bibliographic information are available online under http://dnb.d-nb.de.
If you do not change direction, you may end up where you are heading.
Lao Tzu
MeYouWe
The magical App for improved cooperation and human development.
––––––––
Introduction
Companies increasingly recognize that the analysis of business information (business intelligence) can generate decisive competitive advantages.
Thus, there is more emphasis on looking for strategies and techniques to make valuable business process data visible, available, and interpretable.
In addition, the compliance guidelines BCBS 239, Basel II and III, SOX, and Solvency II have led to legal requirements for a minimum level of quality in reporting and planning data and processes. The establishment of enterprise-wide data management thus continues to be one of the major challenges for IT and management in the years to come.
Data quality is an integral success factor in the establishment of an optimal information infrastructure. A 2002 study from The Data Warehousing Institutes
(TDWI) calculates that poor data quality in the US cost about $622 billion.
A basic requirement for adequate data quality is standardized and integrated data and information management, and companies taking steps to reach this goal recognize the crucial role of metadata.
Metadata describe data. They abstract from specific applications and thus establish data neutrality. This allows data to be integrated and used in other contexts.
Many projects in the context of decision support information systems, in particular business intelligence systems (BI) or big data initiatives, fail due to insufficient data quality. Data quality deficiencies have ramifications ranging from the need for post hoc data correction over reduced acceptance of the BI system to suboptimal decisions and insufficient support of operative business processes.
Gartner market research stated in 2006: Poor data quality costs a typical organization 20% of revenue.... A 2011 study by the Würzburg researcher BARC determined that poor data quality has various negative effects. It causes workers to be less satisfied when they have to spend a lot of time on unnecessary data cleansing. 61% of the surveyed also report increasing costs from poor data quality. 47% noted a decrease in customer satisfaction.
The worldwide financial and economic crisis after 2007 can retrospectively also be regarded as a data quality crisis. Despite far-reaching compliance requirements, many financial service companies have not been able to aggregate and prepare their risk data in a way to adequately control their risks, and they are still struggling in 2017. Besides factors such as homogeneous conceptual understanding, a modernized process and system architecture, and data governance, adequate and comprehensive data quality management as well as a maximally integrated metadata management also play a crucial role in efficient and reliable data management.
In the era of Big Data, data is viewed as the new oil and the available data volume worldwide multiplies every year. The requirements for transparency and data stream quality continue to increase, because these are considered essential for partially or completely new applications in decision support and other areas.
Figure 1: Volume forecast for digital data generated per year from 2005 to 2020 worldwide (in exabyte). Source: Digital Universe study.
Metadata and metadata management are important aids for ensuring adequate data quality. Metadata fall into roughly two categories:
Table 1: Metadata categories
Data abstraction, i.e. the generation and use of appropriate metadata, could be a suitable tool for getting a handle on growing data volumes. However, enterprises and government institutions hesitate to invest in relevant projects and infrastructures for successful data management, because they generally focus primarily on data collection and