Data cleaning First Edition Association For Computing Machinery 1st edition by Xu Chu, Ihab F Ilyas – Ebook PDF Instant Download/DeliveryISBN: 1450371523, 9781450371520
Full download Data cleaning First Edition Association For Computing Machinery 1st edition after payment.
Product details:
ISBN-10 : 1450371523
ISBN-13 : 9781450371520
Author: Xu Chu, Ihab F Ilyas
Poor data across businesses and the U.S. government are reported to cost trillions of dollars a year. Multiple surveys show that dirty data is the most common barrier faced by data scientists. Not surprisingly, developing effective and efficient data cleaning solutions is challenging and is rife with deep theoretical and engineering problems.
This book is about data cleaning, which is used to refer to all kinds of tasks and activities to detect and repair errors in the data. Rather than focus on a particular data cleaning task, we give an overview of the end-to-end data cleaning process, describing various error detection and repair methods, and attempt to anchor these proposals with multiple taxonomies and views. Specifically, we cover four of the most common and important data cleaning tasks, namely, outlier detection, data transformation, error repair (including imputing missing values), and data deduplication. Furthermore, due to the increasing popularity and applicability of machine learning techniques, we include a chapter that specifically explores how machine learning techniques are used for data cleaning, and how data cleaning is used to improve machine learning models.
Data cleaning First Edition Association For Computing Machinery 1st Table of contents:
1. Introduction
2. Outlier Detection
3. Data Deduplication
4. Data Transformation
5. Data Quality Rule Definition and Discovery
6. Rule-Based Data Cleaning
7. Machine Learning and Probabilistic Data Cleaning
8. Conclusion and Future Thoughts
People also search for Data cleaning First Edition Association For Computing Machinery 1st:
data cleaning certification
data cleaning course
data cleaning activities
data cleaning training
data cleaning guidelines
Tags: Data cleaning, Association, Computing Machinery, Xu Chu, Ihab Ilyas