GENAIWIKI

Data Preparation

Data Cleaning

The process of identifying and correcting errors or inconsistencies in data to improve its quality.

Expanded definition

Data cleaning is a critical step in data preparation that involves removing inaccuracies, duplicates, and irrelevant information from datasets. The goal is to ensure that the data is accurate, complete, and suitable for analysis. Techniques used in data cleaning include normalization, handling missing values, and correcting data entry errors.

Related terms

Explore adjacent ideas in the knowledge graph.