|
In a data warehouse, dirty data is a database record that contains errors. Dirty data can be caused by a number of factors including duplicate records, incomplete or outdated data, and the improper parsing of record fields from disparate systems.
The Data Warehousing Institute (TDWI) estimates that dirty data costs U.S. businesses more than $600 billion each year.
Also see data quality.
>> Find white papers, products and vendors related to dirty data.
Last updated on: Nov 04, 2005
|