This course covers the following concepts on each day:
Day 1
- Overview of Big Data
- Ingestion
- Big Data streaming and Amazon Kinesis
- Using Kinesis to stream and analyze Apache server logs
- Storage Solutions
- Querying Big Data using Amazon Athena
- Using Amazon Athena to analyze log data
- Introduction to Apache Hadoop and Amazon EMR
Day 2
- Using Amazon Elastic MapReduce
- Storing and Querying Data on DynamoDB
- Hadoop Programming Frameworks
- Processing Server Logs with Hive on Amazon EMR
- Streamlining Your Amazon EMR Experience with Hue
- Running Pig Scripts in Hue on Amazon EMR
- Spark on Amazon EMR
- Processing New York Taxi dataset using Spark on Amazon EMR
Day 3
- Using AWS Glue to automate ETL workloads
- Amazon Redshift and Big Data
- Visualizing and Orchestrating Big Data
- Visualizing
- Managing Amazon EMR Costs
- Securing Big Data solutions
- Big Data Design Patterns