- Providing a web interface for interacting with the cluster
- Storing HDFS data and running tasks
- Running tasks but not hosting data
- Managing the cluster and monitoring its health