AWS Certified Data Engineer Associate 2026 – Hands On!
-
Introduction
-
Data Engineering FundamentalsIntro: Data Engineering Fundamentals0sTypes of Data (Structured, Unstructured, Semi-Structured)0sProperties of Data (Volume / Velocity / Variety)0sData Warehouses vs. Data Lakes (and Lakehouses)0sWhat is a “Data Mesh”?0sManaging and Orchestrating ETL Pipelines0sCommon Data Sources and Data Formats0sQuick Review of Data Modeling, Data Lineage, and Schema Evolution0sDatabase Performance Optimization0sData Sampling Techniques0sData Skew Mechanisms0sData Validation and Profiling0sSQL Review: Aggregation, Grouping, Sorting, Pivoting0sSQL JOIN types0sSQL Regular Expressions (a quick intro)0sA note about SQL coding exercises.Practicing aggregation queries in SQLGit review: architecture and commands0sPracticing grouping queries in SQLPracticing join queries in SQLData Engineering Fundamentals recap
-
StorageIntro: Storage0sSet up an AWS Billing Alarm0sAmazon S30sAmazon S3 – Hands On0sAmazon S3 Security – Bucket Policy0sAmazon S3 Security – Bucket Policy – Hands On0sAmazon S3 – Versioning0sAmazon S3 – Versioning – Hands On0sAmazon S3 – Replication0sAmazon S3 – Replication – Notes0sAmazon S3 – Replication – Hands On0sAmazon S3 – Storage Classes0sAmazon S3 – Storage Classes – Hands On0sAmazon S3 – Lifecycle Rules0sAmazon S3 – Lifecycle Rules – Hands On0sAmazon S3 – Event Notifications0sAmazon S3 – Event Notifications – Hands On0sAmazon S3 – Performance0sAmazon S3 – Encryption0sAbout DSSE-KMSAmazon S3 – Encryption – Hands On0sAmazon S3 – Default Encryption0sAmazon S3 – Access Points0sAmazon S3 – Object Lambda0sAmazon EBS0sAmazon EBS – Hands On0sAmazon EBS Elastic Volumes0sAmazon EFS0sAmazon EFS – Hands On0sAmazon EFS vs. Amazon EBS0sAWS Backup0sAWS Backup – Hands On0sStorage practice exam questions
-
DatabaseIntro: Database0sAmazon DynamoDB0sAmazon DynamoDB – Hands On0sAmazon DynamoDB in Big Data0sAmazon DynamoDB – Throughput (RCU & WCU)0sAmazon DynamoDB – Throughput (RCU & WCU) – Hands On0sAmazon DynamoDB – Basic APIs0sAmazon DynamoDB – Basic APIs – Hands On0sAmazon DynamoDB – Indexes (LSI & GSI)0sAmazon DynamoDB – Indexes (LSI & GSI) – Hands On0sAmazon DynamoDB – PartiQL0sAmazon DynamoDB Accelerator (DAX)0sAmazon DynamoDB Accelerator (DAX) – Hands On0sAmazon DynamoDB – Streams0sAmazon DynamoDB – Streams – Hands On0sAmazon DynamoDB – Time To Live (TTL)0sAmazon DynamoDB – Patterns with S30sAmazon DynamoDB – Security0sAmazon RDS0sShared and exclusive locks in RDS0sAmazon RDS Best Practices0sAmazon DocumentDB0sAmazon MemoryDB for Redis0sAmazon Keyspaces (for Apache Cassandra)0sAmazon Neptune0sAmazon Timestream0sAmazon Redshift Intro & Architecture0sRedshift Spectrum and Performance Tuning0sRedshift Durability and Scaling0sRedshift Distribution Styles0sRedshift Data Flows and the COPY command0sRedshift Integration / WLM / Vacuum0sRedshift Resizing0sRA3 Nodes, Cross-Region Data Sharing, Redshift ML0sRedshift Security0sRedshift Serverless0sRedshift Materialized Views0sRedshift Data Sharing / Data Shares0sRedshift Lambda UDF0sRedshift Federated Queries0sRedshift System Tables and System Views0sRedshift Data API0sRedshift – Hands On0sDatabase practice exam questions
-
Migration and TransferIntro: Migration and Transfer0sApplication Discovery Service & Application Migration Service0sAWS Database Migration Service (AWS DMS)0sAWS Database Migration Service (AWS DMS) – Hands On0sAWS DataSync0sAWS Snow Family0sAWS Snow Family – Hands On0sAWS Transfer Family0sMigration and Transfer practice exam questions
-
ComputeIntro: Compute0sEC2 in Big Data0sEC2 Graviton-based instances0sAWS Lambda0sLambda Integration – Part 10sLambda Integration – Part 20sAWS Lambda – File Systems Mounting0sAWS SAMAWS SAM – CLI Installation0sAWS SAM – Deploy Project0sAWS SAM – with API Gateway0sAWS SAM – with DynamoDB0sAWS Batch0sCompute practice exam questions
-
Containers
-
AnalyticsIntro: Analytics0sAWS Glue0sGlue, Hive, and ETL0sModifying the Glue Data Catalog from ETL Scripts0sRunning ETL Jobs with Bookmarks0sGlue Costs and Anti-Patterns0sAWS Glue Studio0sAWS Glue Data Quality0sAWS Glue DataBrew0sAWS Glue DataBrew Demo0sHandling PII in DataBrew Transformations0sAWS Glue Workflows0sAWS Lake Formation0sAWS Lake Formation Data Filters0sAmazon Athena0sAthena and Glue, Costs, and Security0sAthena Performance0sAthena ACID Transactions0sAthena Fine-Grained Access to AWS Glue Catalog0sApache Spark0sAthena, Glue, and S3 Data Lakes – Hands On0sAthena and CREATE TABLE AS SELECT (CTAS)0sSpark Integration with Kinesis and Redshift0sSpark Integration with Athena0sAmazon EMR0sEMR, AWS integration, and Storage0sEMR Promises; Intro to Hadoop0sEMR Serverless; EMR on EKS0sAmazon Kinesis Data Streams0sAmazon Kinesis Data Streams – Producers0sAmazon Kinesis Data Streams – Consumers0sAmazon Kinesis Data Streams – Hands On0sAmazon Kinesis Data Streams – Enhanced Fan Out0sAmazon Kinesis Data Streams – Scaling0sAmazon Kinesis Data Streams – Handling Duplicates0sAmazon Kinesis Data Streams – Security0sAmazon Data Firehose0sKinesis Data Stream Troubleshooting and Performance Tuning0sKinesis Data Analytics / Amazon Managed Service for Apache Flink (MSAF)0sUpdate: Kinesis Data Analytics EOLKinesis Analytics Costs; RANDOM_CUT_FOREST0sAmazon MSK0sAmazon MSK – Connect0sAmazon MSK – Serverless0sAmazon Kinesis vs. Amazon MSK0sAmazon OpenSearch Service0sAmazon OpenSearch Service, Pt. 20sOpenSearch Index Management and Designing for Stability0sAmazon OpenSearch Service Performance0sAmazon OpenSearch Serverless0sAmazon QuickSight0sQuickSight Pricing and Dashboards; ML Insights0sAnalytics practice exam questions
-
Application IntegrationIntro: Application Integration0sAmazon SQS0sAmazon Kinesis Data Streams vs. Amazon SQS0sAmazon SQS – Dead Letter Queues0sAmazon SQS – Dead Letter Queues – Hands On0sAmazon SNS0sAmazon SNS – with SQS Fan Out0sAmazon SNS – Hands On0sAWS Step Functions0sAWS Step Functions: State Machines and States0sAmazon AppFlow0sAmazon EventBridge0sAmazon EventBridge – Hands On0sAmazon Managed Workflows for Apache Airflow (Amazon MWAA)0sFull Data Engineering Pipelines0sApplication Integration practice questions
-
Security, Identity, and ComplianceIntro: Security, Identity, and Compliance0sPrinciple of Least Privilege0sData Masking and Anonymization0sKey Salting0sPreventing Backups or Replication to Disallowed AWS Regions0sIAM Introduction: Users, Groups, Policies0sIAM Users & Groups Hands On0sIAM Policies0sIAM Policies – Hands On0sIAM MFA0sIAM MFA – Hands On0sIAM Roles0sIAM Roles – Hands On0sEncryption 1010sAWS KMS0sAWS KMS – Hands On0sAmazon Macie0sAWS Secrets Manager0sAWS Secrets Manager – Hands On0sAWS WAF0sAWS Shield0sAWS Services Security Deep Dive – Part 10sAWS Services Security Deep Dive – Part 20sAWS Services Security Deep Dive – Part 30sSecurity, Identity, and Compliance practice questions
-
Networking and Content DeliveryIntro: Networking and Content Delivery0sVPC, Subnets, Internet Gateway, NAT Gateway0sNACL, Security Groups, VPC Flow Logs0sVPC Peering, Endpoints, VPN, Direct Connect0sVPC Cheat Sheet & Closing Comments0sAWS PrivateLink0sWhat is DNS?0sAmazon Route 530sAmazon Route 53 – Registering Domains – Hands On0sAmazon Route 53 – Creating DNS Records – Hands On0sAmazon CloudFront0sAmazon CloudFront – S3 as Origin – Hands On0sAmazon CloudFront – ALB/EC2 as an Origin0sAmazon CloudFront – Cache Invalidation0sNetworking and Content Delivery practice questions
-
Management and GovernanceIntro: Management and Governance0sAmazon CloudWatch – Metrics0sAmazon CloudWatch – Logs0sAmazon CloudWatch – Logs – Hands On0sAmazon CloudWatch – Logs Unified Agent0sAmazon CloudWatch – Alarms0sAmazon CloudWatch – Alarms – Hands On0sAmazon CloudTrail0sAmazon CloudTrail – Hands On0sAWS CloudTrail Lake0sAWS Config0sAWS Config – Hands On0sCloudWatch vs. CloudTrail vs. Config0sAWS CloudFormation0sAWS CloudFormation – Hands On0sSSM Parameter Store0sSSM Parameter Store – Hands On0sSSM Parameter Store – Lambda Integration0sAWS Well-Architected Framework & Tool0sAmazon Managed Grafana0sManagement and Governance practice questions
-
Machine Learning
-
Developer ToolsIntro: Developer Tools0sAWS Access Keys, CLI & SDK0sAWS CLI Setup on Windows0sAWS CLI Setup on Mac OS X0sAWS CLI Setup on Linux0sAWS CLI Hands On0sAWS CDK0sAWS CDK – Hands On0sAWS CodeDeploy0sIMPORTANT: CodeCommit Discontinuation0sAWS CodeCommit0sAWS CodeBuild0sAWS CodePipeline0sDeveloper Tools practice questions
-
Everything Else
-
Wrapping UpIntro: Wrapping Up0sReviewing the Exam Guide (and other AWS resources)0sGeneral AWS Certification Exam Tips0sExam Walkthrough and Signup0sSave 50% on your AWS Exam Cost!0sGet an Extra 30 Minutes on your AWS Exam – Non Native English Speakers Only0sAWS Certification Paths0sThank you!0sBONUS Lecture
NEW: Now includes a full-length practice exam!
The AWS Certified Data Engineer Associate Exam (DEA-C01 or DE1-C01) is one of the most challenging associate-level certification exams you can take from Amazon Web Services, and even among the most challenging overall. Passing it tells employers in no uncertain terms that your knowledge of data pipelines is wide and deep. But, even experienced technologists need to prepare heavily for this exam. This course sets you up for success, by covering all of the data ingestion, transformation, and orchestration technologies on the exam and how they fit together.
Best-selling Udemy instructors Frank Kane and Stéphane Maarek have teamed up to deliver the most comprehensive and hands-on prep course we’ve seen. Together, they’ve taught over 2 million people around the world. This course combines Stéphane’s depth on AWS with Frank’s experience in wrangling massive data sets, gleaned during his 9-year career at Amazon itself.
The world of data engineering on AWS includes a dizzying array of technologies and services. Just a sampling of the topics we cover in-depth are:
Streaming and transforming data with Amazon Kinesis and Amazon Managed Streaming for Apache Kafka (MSK)
Queuing messages with Simple Queue Service (SQS)
Orchestrating data pipelines with Amazon AppFlow, Amazon EventBridge, AWS Step Functions, and Amazon Managed Workflows for Apache Airflow (MWAA)
Transitioning from small to big data with the AWS Database Migration Service (DMS), AWS DataSync, Snow Family, Transfer Family, and more
Storing massive data lakes with the Simple Storage Service (S3) and managing data lifecycles
Optimizing transactional queries with DynamoDB, DocumentDB, Keyspaces, and MemoryDB
Tying your big data systems together with AWS Lambda
Making unstructured data query-able with AWS Glue, Glue DataBrew, and Lake Formation
Processing data at unlimited scale with Elastic MapReduce, including Apache Spark
Applying advanced machine learning algorithms at scale with Amazon SageMaker
Searching and analyzing petabyte-scale data with Amazon Opensearch (formerly Elasticsearch) Service
Querying S3 data lakes with Amazon Athena
Hosting massive-scale data warehouses with Redshift and Redshift Spectrum
Integrating smaller data with your big data, using the Relational Database Service (RDS)
Keeping your data secure with encryption, KMS, Macie, Secrets Manager, IAM, and more
Managing and governing your systems with CloudFormation, CloudTrail, CloudWatch, AWS Config, and more
Throughout the course, you’ll have lots of opportunities to reinforce your learning with hands-on demos, a full-length practice exam, and additional practice questions. We’ll also arm you with some valuable test-taking tips and strategies along the way.
Although this is an associate-level exam, it is one of the more challenging ones. AWS recommends having a few years of both data engineering experience and AWS experience before tackling it. This exam is not intended for AWS beginners.
You want to go into the AWS Certified Data Engineer Associate Exam with confidence, and that’s what this course delivers. Hit the enroll button, and we’re excited to see you in the course… and ultimately to see you get your certification!
– – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – –
Instructor
My name is Stéphane Maarek, I am passionate about Cloud Computing, and I will be your instructor in this course. I teach about AWS certifications, focusing on helping my students improve their professional proficiencies in AWS.
I have already taught 1,500,000+ students and gotten 500,000+ reviews throughout my career in designing and delivering these certifications and courses!
With AWS becoming the centerpiece of today’s modern IT architectures, I have decided it is time for students to learn how to be an AWS Data Analytics Professional. So, let’s kick start the course! You are in good hands!
– – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – –
Instructor
Hey, I’m Frank Kane, and I’m also co-instructing this course. I spent nine years working for Amazon from the inside as a senior engineer and senior manager, and I’m best known for my top-selling courses in “big data”, data analytics, machine learning, AI, Apache Spark, system design, and Elasticsearch.
I’ve been teaching on Xlbake since 2015, where I’ve reached over 800,000 students all around the world!
I’ve worked hard to keep this course up to date with the latest developments in AWS data engineering, and to make sure you’re prepared for the latest version of this exam. Let’s dive in and get you ready!
– – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – –
This course also comes with:
Lifetime access to all future updates
A responsive instructor in the Q&A Section
Xlbake Certificate of Completion Ready for Download
Join us in this course if you want to pass the AWS Certified Data Engineer – Associate Exam DEA-C01 and master the AWS platform!
What's included
- 22 hours on-demand video
- 3 coding exercises
- 1 practice test
- 4 articles
- Access on mobile and TV
- Certificate of completion