- A tool for visualizing data pipelines in EMR
- A service for managing and scaling EMR clusters
- A distributed file system for Hadoop clusters
- Allows access to S3 as if it were HDFS