"Data Profiling"
Data Profiling is the process of examining data in an existing database and collecting statistics and information about that data. The information collected may be used to:
- determine if existing data can be repurposed
- give metrics on data quality
- assess the risk involved in integrating data for new applications
- apply six sigma methodologies to enterprise data by tracking data quality
- assess whether metadata accurately describes the actual values in the source database
- understand data challenges in any data intensive project, thereby avoiding costly project surprises.