Big Data and Machine Learning
This topic area will teach you about storing and processing large amounts of data in the cloud. You will learn how to use Hadoop for batch data processing and how to build machine learning models to make predictions. You will bring this all together to build a streaming analytics application based on Amazon’s serverless computing platform.
To achieve Beginner Level mastery of Big Data you must complete this tutorial:
- Video: Hadoop Intro - 15 min
QwikLab: Analyze Big Data with Hadoop - 45 min(no longer available)- AWS Tutorial: Analyze Big Data with Hadoop - 60 min
To achieve Intermediate Level mastery of Big Data you must complete the following:
Data Storage:
- QwikLab: Intro to S3 - 45 min
- QwikLab: Intro to Amazon Redshift - 45 min
Big Data Analytics:
- Video: Short AWS Machine Learning Overview - 2 min
- AWS Tutorial: Analyze Big Data with Hadoop - 60 min (Note this is also under the beginner level)
Machine Learning Models:
- QwikLab: Intro to Amazon Machine Learning - 45 min
- AWS Tutorial: Build a Machine Learning Model - 30 min
- Video Tutorial: Overview of AWS SageMaker - 32 min
- AWS Tutorial: AWS SageMaker - 60 min
Bring it all together: