AWS Big Data Helpful Resources
Updated on: 20 December 2019
Whitepapers and Resources to Review for Exam
- Elasticsearch: Proxy-based access control
- Elasticsearch: Cognito-based access control
- Redshift: Factors affecting Query Search
- Redshift: Designing Tables
- Redshift: Defining Primary Key and Foreign Key constraints
- Redshift: Amazon Best Practices for loading data
- Redshift: Tuning Techniques
- Redshift: Spectrum and S3
- DMS: Microsoft MS/SQL Server as a source for the Database Migration Service
- DMS: Supported Targets for DMS
- EMR: Best Practices Whitepaper
- EMR: EMRFS Consistent View
- EMR: Encryption Options
- Kinesis: Controlling Access with Amazon Kinesis Data Firehose
- DynamoDB: Best Practices for DynamoDB
- Athena: Partitioning Data
- S3: Analytics Storage Classes
- Machine Learning: Types of ML Models
- Exam Readiness: AWS Certified Big Data - Specialty (Digital) -- AWS Exam Readiness for the Big Data Certification
- AWS Big Data Exam Guide -- AWS Exam Guide for Big Data Specialty Exam
- AWS Big Data Sample Questions -- AWS Sample Questions for the Big Data Exam
- AWS What's New Blog -- A resource which shows you a listing of all the new feature announcements on the AWS platform.
- AWS This Is My Architecture -- A site which contains video chalk talks discussing differnent architectures that all run on AWS.
- AWS Blog Listing -- AWS publishes many great blog articles, this is a running list
- AWS Architecture Center -- AWS Architecture site which contains many architectural resources
- AWS Solutions -- AWS Solutions are pre-built Solutions for common applications, check out the Datalakes, IoT, and Analytics categories.
- AWS Whitepapers -- AWS Whitepapers on a variety of topics, look for the Big Data and Analytics Section
- AWS Certification Info -- Information on the AWS Big Data Specialty Exam
Tutorials and Great Labs
Below are some links to some great starter labs as well as tutorials to help you build solutions on AWS.
Sample Projects, Tools, and Code
The below resources are links to tools, code samples and projects.
- AWS Redshift Utils
- AWS Redshift Lambda Loader
- AWS Redshift Advanced Monitor Serverless
- AWS Kinesis Client
- AWS Kinesis Producer Library
- Hadoop Commands Cheat Sheet
AWS and Third-Party Tools
- AWS CLI -- Command Line Interface
- AWS CLI v2 -- A version 2 update, makes SSO easier.
- AWS CDK -- Cloud Development Kit (CloudFormation via TypeScript)
- AWS ECS CLI -- AWS ECS CLI v2
- AWS SAM -- Serverless Application Model (CloudFormation for Serverless Apps)
- AWS SAM CLI Toolkit -- CLI tool for using SAM
- Serverless Framework -- Third Party Serverless CLI tool