- To track the flow and transformation of data
- To optimize database performance
- To define the schema of a dataset
- To validate the quality of data