- Unequal distribution of data across nodes or partitions
- Missing or incomplete data values
- Inconsistent data across different datasets
- Incorrect or unreliable data