Data Cleaning: A Practical Perspective (Synthesis Lectures on Data Management, 36)

(2)
Data Cleaning: A Practical Perspective (Synthesis Lectures on Data Management, 36) image
ISBN-10:

1608456773

ISBN-13:

9781608456772

Edition: 1
Released: Sep 01, 2013
Format: Paperback, 86 pages
Related ISBN: 9781450371520

Description:

Data warehouses consolidate various activities of a business and often form the backbone for generating reports that support important business decisions. Errors in data tend to creep in for a variety of reasons. Some of these reasons include errors during input data collection and errors while merging data collected independently across different databases. These errors in data warehouses often result in erroneous upstream reports, and could impact business decisions negatively. Therefore, one of the critical challenges while maintaining large data warehouses is that of ensuring the quality of data in the data warehouse remains high. The process of maintaining high data quality is commonly referred to as data cleaning. In this book, we first discuss the goals of data cleaning. Often, the goals of data cleaning are not well defined and could mean different solutions in different scenarios. Toward clarifying these goals, we abstract out a common set of data cleaning tasks that often need to be addressed. This abstraction allows us to develop solutions for these common data cleaning tasks. We then discuss a few popular approaches for developing such solutions. In particular, we focus on an operator-centric approach for developing a data cleaning platform. The operator-centric approach involves the development of customizable operators that could be used as building blocks for developing common solutions. This is similar to the approach of relational algebra for query processing. The basic set of operators can be put together to build complex queries. Finally, we discuss the development of custom scripts which leverage the basic data cleaning operators along with relational operators to implement effective solutions for data cleaning tasks. Table of Contents: Preface / Acknowledgments / Introduction / Technological Approaches / Similarity Functions / Operator: Similarity Join / Operator: Clustering / Operator: Parsing / Task: Record Matching / Task: Deduplication / Data Cleaning Scripts / Conclusion / Bibliography / Authors' Biographies

Best prices to buy, sell, or rent ISBN 9781608456772




Frequently Asked Questions about Data Cleaning: A Practical Perspective (Synthesis Lectures on Data Management, 36)

You can buy the Data Cleaning: A Practical Perspective (Synthesis Lectures on Data Management, 36) book at one of 20+ online bookstores with BookScouter, the website that helps find the best deal across the web. Currently, the best offer comes from and is $ for the .

The price for the book starts from $22.17 on Amazon and is available from 2 sellers at the moment.

At BookScouter, the prices for the book start at $19.29. Feel free to explore the offers for the book in used or new condition from various booksellers, aggregated on our website.

If you’re interested in selling back the Data Cleaning: A Practical Perspective (Synthesis Lectures on Data Management, 36) book, you can always look up BookScouter for the best deal. BookScouter checks 30+ buyback vendors with a single search and gives you actual information on buyback pricing instantly.

As for the Data Cleaning: A Practical Perspective (Synthesis Lectures on Data Management, 36) book, the best buyback offer comes from and is $ for the book in good condition.

The Data Cleaning: A Practical Perspective (Synthesis Lectures on Data Management, 36) book is in very low demand now as the rank for the book is 7,189,068 at the moment. A rank of 1,000,000 means the last copy sold approximately a month ago.

The highest price to sell back the Data Cleaning: A Practical Perspective (Synthesis Lectures on Data Management, 36) book within the last three months was on November 05 and it was $0.89.