Executing Data Quality Projects (eBook)
352 Seiten
Elsevier Science (Verlag)
978-0-08-055839-4 (ISBN)
* Includes numerous templates, detailed examples, and practical advice for executing every step of the 'Ten Steps” approach.
* Allows for quick reference with an easy-to-use format highlighting key concepts and definitions, important checkpoints, communication activities, and best practices.
* A companion Web site includes links to numerous data quality resources, including many of the planning and information-gathering templates featured in the text, quick summaries of key ideas from the Ten Step methodology, and other tools and information available online.
Danette McGilvray is president and principle of Granite Falls Consulting, Inc., a firm specializing in information and data quality management to support key business processes around customer satisfaction, decision support, supply chain management, and operational excellence.
Information is currency. Recent studies show that data quality problems are costing businesses billions of dollars each year, with poor data linked to waste and inefficiency, damaged credibility among customers and suppliers, and an organizational inability to make sound decisions. In this important and timely new book, Danette McGilvray presents her "e;Ten Steps approach to information quality, a proven method for both understanding and creating information quality in the enterprise. Her trademarked approach-in which she has trained Fortune 500 clients and hundreds of workshop attendees-applies to all types of data and to all types of organizations.* Includes numerous templates, detailed examples, and practical advice for executing every step of the "e;Ten Steps approach.* Allows for quick reference with an easy-to-use format highlighting key concepts and definitions, important checkpoints, communication activities, and best practices.* A companion Web site includes links to numerous data quality resources, including many of the planning and information-gathering templates featured in the text, quick summaries of key ideas from the Ten Step methodology, and other tools and information available online.
Front Cover 1
Executing Data Quality Projects 4
Copyright Page 5
Contents 8
Acknowledgments 12
Introduction 14
The Reason for This Book 14
Intended Audiences 15
Structure of This Book 16
How to Use This Book 18
Chapter 1. Overview 21
The Impact of Information and Data Quality 23
About the Methodology: Concepts and Steps 25
Approaches to Data Quality in Projects 28
Engaging Management 31
Chapter 2. Key Concepts 33
Introduction 35
The Framework for Information Quality 35
The Information Life Cycle 42
Data Quality Dimensions 49
Business Impact Techniques 54
Data Categories 58
Data Specifications 64
Data Governance and Data Stewardship 71
The Information and Data Quality Improvement Cycle 73
The Ten Steps Process 76
Best Practices and Guidelines 78
Chapter 3. The Ten Steps Process 81
Introduction 83
Step 1 Define Business Need and Approach 85
Introduction 85
Step 1.1 Prioritize the Business Issue 88
Step 1.2 Plan the Project 91
Step 2 Analyze Information Environment 95
Introduction 95
Step 2.1 Understand Relevant Requirements 101
Step 2.2 Understand Relevant Data and Specifi cations 103
Step 2.3 Understand Relevant Technology 109
Step 2.4 Understand Relevant Processes 112
Step 2.5 Understand Relevant People/Organizations 117
Step 2.6 Define the Information Life Cycle 121
Step 2.7 Design Data Capture and Assessment Plan 124
Step 3 Assess Data Quality 127
Step 3.1 Data Specifi cations 133
Step 3.2 Data Integrity Fundamentals 137
Step 3.3 Duplication 147
Step 3.4 Accuracy 153
Step 3.5 Consistency and Synchronization 159
Step 3.6 Timeliness and Availability 162
Step 3.7 Ease of Use and Maintainability 166
Step 3.8 Data Coverage 168
Step 3.9 Presentation Quality 170
Step 3.10 Perception, Relevance, and Trust 174
Step 3.11 Data Decay 178
Step 3.12 Transactability 180
Step 4 Assess Business Impact 182
Step 4.1 Anecdotes 186
Step 4.2 Usage 192
Step 4.3 Five “Whys” for Business Impact 194
Step 4.4 Benefit versus Cost Matrix 196
Step 4.5 Ranking and Prioritization 200
Step 4.6 Process Impact 205
Step 4.7 Cost of Low-Quality Data 208
Step 4.8 Cost–Benefit Analysis 214
Step 5 Identify Root Causes 217
Step 5.1 Five “Whys” for Root Cause 220
Step 5.2 Track and Trace 222
Step 5.3 Cause-and-Effect/Fishbone Diagram 223
Step 6 Develop Improvement Plans 227
Step 7 Prevent Future Data Errors 232
Step 8 Correct Current Data Errors 237
Step 9 Implement Controls 241
Step 10 Communicate Actions and Results 246
The Ten Steps Process Summary 252
Chapter 4. Structuring Your Project 257
Projects and The Ten Steps 259
Data Quality Project Roles 271
Project Timing 272
Chapter 5. Other Techniques and Tools 275
Introduction 277
Information Life Cycle Approaches 277
Capture Data 282
Analyze and Document Results 282
Metrics 288
Data Quality Tools 290
The Ten Steps and Six Sigma 296
Chapter 6. A Few Final Words 297
Appendix: Quick References 301
The Framework for Information Quality 303
The POSMAD Interaction Matrix in Detail 305
POSMAD Phases and Activities 307
Data Quality Dimensions 308
Business Impact Techniques 309
Overview of The Ten Steps Process 310
Definitions of Data Categories 312
Glossary 314
Bibliography 322
List of Figures, Tables, and Templates 326
Index 330
About the Author 353
Chapter 1 Overview
If the state of quality of your company’s products and services
was the same level of quality as the data in your databases,
would your company survive or go out of business?
–Larry English
A corollary: If the state of quality of your company’s data was
the same level of quality as your company’s products and
services, how much more profitable would your company be?
– Mehmet Orun
In This Chapter
The Impact of Information and Data Quality
About the Methodology: Concepts and Steps
The Impact of Information and Data Quality
Information quality problems and their impact are all around us: A customer does not receive an order because of incorrect shipping information; products are sold below cost because of wrong discount rates; a manufacturing line is stopped because parts were not ordered—the result of inaccurate inventory information; a well-known U.S. senator is stopped at an airport (twice) because his name is on a government “Do not fly” list; many communities cannot run an election with results that people trust; financial reform has created new legislation such as Sarbanes–Oxley.1
Information is not simply data, strings of numbers, lists of addresses, or test results stored in a computer. Information is the product of business processes and is continuously used and reused by them. However, it takes human beings to bring information to its real-world context and give it meaning. Every day human beings use information to make decisions, complete transactions, and carry out all the other activities that make a business run. Applications come and applications go, but the information in those applications lives on.
That’s where information quality comes into play. Effective business decisions and actions can only be made when based on high-quality information—the key here being effective. Yes, business decisions are based all the time on poor-quality data, but effective business decisions cannot be made with flawed, incomplete, or misleading data. People need information they can trust to be correct and current if they are to do the work that furthers business goals and objectives.
A firm’s basis for competition … has changed from tangible products to intangible information. A firm’s information represents the firm’s collective knowledge used to produce and deliver products and services to consumers. Quality information is increasingly recognized as the most valuable asset of the firm. Firms are grappling with how to capitalize on information and knowledge. Companies are striving, more often silently, to remedy business impacts rooted in poor quality information and knowledge.
– Kuan-Tsae Huang, Yang W. Lee,
and Richard Y. Wang2
Tom Redman says it well:
The costs of poor quality are enormous. Some costs, such as added expense and lost customers, are relatively easy to spot, if the organization looks. We suggest (based on a small number of careful, but proprietary studies), as a working figure, that these costs are roughly 10 percent of revenue for a typical organization…. This figure does not include other costs, such as bad decisions and low morale, that are harder to measure but even more important.3
What is the cost to a company of the sales rep, publicly announced to have won the top sales award for the year along with the trip to Hawaii, only to have it rescinded a few days later because the sales data were wrong? Does the resulting embarrassment and low morale influence that sales rep’s productivity and therefore sales, or even his decision to stay with the company? What is the cost to the embassy whose name was splashed across the front pages of a major U.S. city’s newspaper when its visa applications containing sensitive personal and business information, such as Social Security numbers and strategic business plans, were found thrown in an open dumpster instead of being properly disposed of? Does the resulting lack of trust in the management of that information influence another company’s decision to do business in that country?
What Is Information Quality?
Information quality is the degree to which information and data4 can be a trusted source for any and/or all required uses. Simply put, it is having the right information, at the right time and place, for the right people to use to run the business, serve customers, and achieve company goals. Quality information is also fit for its purpose—the level of quality supports all of its uses.
Definition
Information quality is the degree to which information and data can be a trusted source for any and/or all required uses. It is having the right set of correct information, at the right time, in the right place, for the right people to use to make decisions, to run the business, to serve customers, and to achieve company goals.
Where Do Information Quality Problems Come From?
Information quality problems may be caused by human, process, or system issues. They are not restricted to older or particular types of systems. Although everyone is aware that data cause problems from time to time, it may be difficult to perceive the extent to which these problems affect the business. Some normal business activities are indicative of data quality problems5:
- Correction activities
- Rework
- Reprocessing orders
- Handling returns
- Dealing with customer complaints
Many of these activities do not appear to be associated with information quality, when in fact they are. Since processes and functions are distributed across an organization and many people, the cost and scope of data quality problems are often not visible.
Business processes create, update, and delete data in addition to applying information in many ways. Information technology (IT) teams are responsible for the quality of the systems that store and move the data, but they cannot be held completely responsible for the content. Both IT and the business must share in insisting on clearly articulated requirements, strict testing of systems, and the development of quality processes for data management.
The Information Quality Challenge
I believe that two major trends have created an environment where information quality is getting more of the attention it deserves. One is the increasing number of legal and regulatory data quality requirements. The need for and benefits from information quality have always been there and ready for any organization who invests in it. But human nature being what it is, the threat of bad publicity and high fines and the risk of a CEO going to jail have created the motivation to actually do something about data quality.
The second reason is based on the need for business to see information brought together in new ways. Examples include the need to see what top customers are doing across the enterprise through CRM (Customer Relationship Management), to have data available for decision support through business intelligence and data warehousing, to streamline business processes and information through ERP (Enterprise Resource Planning), and to deal with the high rate of mergers and acquisitions, which require the integration of data from different companies.
All these initiatives require data integration—bringing together data from two or more different sources and combining them in such a way that new and better uses can be made of the resulting information. Data that previously fulfilled the needs of one particular functional area in the business are now being combined with data from other functional areas—often with very poor results. We have different business uses for the same information; different platforms, systems, databases, and applications; different types of data (customer, vendor, manufacturing, finance, etc.); different data structures, definitions, and standards; and data, processes, and technology customized to fit the business, geography, or application. These are the challenges of the current environment.
What we need is the ability to share information with our customers and with each other across the company. We need the ability to find what we need, when we need it, and to be able to trust it when we get it. What is required for that to happen? We must consciously manage information as a resource (a source of help) and as an asset (a source drawn on by a company for making profit). We must have information that is real (an accurate reflection of the real world), recent (up to date), and relevant (that our business and customers need and care about).
This book is here to help.
About the Methodology: Concepts and Steps
“Doctor, my left arm hurts!” The doctor puts your arm in a sling, gives you an aspirin, and tells you to go home. But what if you were having a heart attack? You would expect the doctor to diagnose your condition and take emergency measures to save your life. After you were stabilized you would expect the doctor to run tests, get to the root cause of the heart attack, and recommend measures to correct any damage done (if possible) and prevent another attack from occurring. The doctor would...
Erscheint lt. Verlag | 1.9.2008 |
---|---|
Sprache | englisch |
Themenwelt | Sachbuch/Ratgeber |
Mathematik / Informatik ► Informatik ► Datenbanken | |
Informatik ► Office Programme ► Outlook | |
Mathematik / Informatik ► Informatik ► Software Entwicklung | |
Sozialwissenschaften ► Kommunikation / Medien ► Buchhandel / Bibliothekswesen | |
Technik | |
ISBN-10 | 0-08-055839-9 / 0080558399 |
ISBN-13 | 978-0-08-055839-4 / 9780080558394 |
Informationen gemäß Produktsicherheitsverordnung (GPSR) | |
Haben Sie eine Frage zum Produkt? |
Kopierschutz: Adobe-DRM
Adobe-DRM ist ein Kopierschutz, der das eBook vor Mißbrauch schützen soll. Dabei wird das eBook bereits beim Download auf Ihre persönliche Adobe-ID autorisiert. Lesen können Sie das eBook dann nur auf den Geräten, welche ebenfalls auf Ihre Adobe-ID registriert sind.
Details zum Adobe-DRM
Dateiformat: EPUB (Electronic Publication)
EPUB ist ein offener Standard für eBooks und eignet sich besonders zur Darstellung von Belletristik und Sachbüchern. Der Fließtext wird dynamisch an die Display- und Schriftgröße angepasst. Auch für mobile Lesegeräte ist EPUB daher gut geeignet.
Systemvoraussetzungen:
PC/Mac: Mit einem PC oder Mac können Sie dieses eBook lesen. Sie benötigen eine
eReader: Dieses eBook kann mit (fast) allen eBook-Readern gelesen werden. Mit dem amazon-Kindle ist es aber nicht kompatibel.
Smartphone/Tablet: Egal ob Apple oder Android, dieses eBook können Sie lesen. Sie benötigen eine
Geräteliste und zusätzliche Hinweise
Buying eBooks from abroad
For tax law reasons we can sell eBooks just within Germany and Switzerland. Regrettably we cannot fulfill eBook-orders from other countries.
aus dem Bereich