AWS Certified Cloud Practitioner (CLF-C02) #3 Domain 1-2 Cloud Design — The Six Well-Architected Pillars

Sunday, May 3, 2026

9 min read

In #2 we locked in the cloud’s value proposition, the CAF, and the global infrastructure. For the second half of Domain 1, we organize the six pillars of the Well-Architected Framework.

Well-Architected isn’t just a checklist — it’s a set of design principles AWS distilled from years of real-world practice. The CLF-C02 exam tosses you a scenario and asks either “which pillar does this scenario emphasize?” or “what’s the best practice for this pillar?”.

The six pillars at a glance #

#	Pillar	One-line definition	Core vocabulary
1	Operational Excellence	The ability to run, monitor, and improve a system	Automation, IaC, CloudFormation, CloudWatch
2	Security	The ability to protect data, systems, and assets	IAM, encryption, least privilege, KMS
3	Reliability	The ability to perform intended functions correctly and recover from failure	Multi-AZ, backups, Auto Scaling, failover
4	Performance Efficiency	The ability to use compute resources efficiently	Right instance type, caching, scaling
5	Cost Optimization	The ability to sustain cost efficiency over time	Reserved Instances, Spot, Trusted Advisor
6	Sustainability	Minimize the environmental impact of workloads	Resource utilization, efficient region selection

You don’t need to memorize the order. But the axis of tradeoffs each pillar tries to resolve must be in your head. Cost optimization and performance efficiency often clash, and pushing reliability up usually pushes cost up. Exam questions test whether you can first identify which pillar’s perspective you should be answering from.

1) Operational Excellence #

Operational Excellence is the ability to run a system reliably and keep improving it. The emphasis is on repeatable processes, not one-time setups.

Design principles #

Perform operations as code (IaC) — Use code like CloudFormation/Terraform instead of manual configuration
Make frequent, small, reversible changes — Many small changes rather than one big one
Refine operations procedures frequently — Validate procedures with practices like game days
Anticipate failure and learn from it — A postmortem culture

Matching services #

Service	Role
AWS CloudFormation	IaC, managing infrastructure as YAML/JSON code
AWS Systems Manager	Operations automation, Patch Manager and Run Command
AWS CloudTrail	Audit logs of API calls
Amazon CloudWatch	Metrics, logs, alarms
AWS Config	Tracks resource configuration changes

Exam question patterns #

“We’re building infrastructure manually and our environments are inconsistent — which pillar should be emphasized?” → Operational Excellence (supporting keywords: IaC, CloudFormation).

2) Security #

Security is the most frequently asked pillar on the exam. It’s the ability to keep data, systems, and assets safe.

Design principles #

Strong identity foundation — Least privilege and MFA via IAM
Apply security at all layers — VPC, subnet, instance, app — each layer
Encrypt data in transit and at rest — TLS / KMS
Prepare for events — Automated response procedures
Keep people away from data — Use automated tools instead of letting humans touch data directly
Audit trails — CloudTrail, Config

Matching services #

Service	Role
AWS IAM	Users, roles, policies
AWS KMS	Encryption key management
AWS WAF	Web application firewall
AWS Shield	DDoS protection
Amazon GuardDuty	Threat detection
AWS Secrets Manager	Secret credential management

Exam question patterns #

“How do you strengthen security when storing sensitive data in S3?” → Encryption at rest with KMS.

“I want a safer access method to instances inside a VPC instead of SSH keys” → AWS Systems Manager Session Manager (a valid answer candidate for either Security or Operational Excellence).

This domain itself gets fuller coverage in #4 and #5.

3) Reliability #

Reliability is the ability of a system to perform its intended function correctly and automatically recover from failures.

Design principles #

Automatically test recovery procedures — Regular automatic tests, not manual ones
Scale horizontally to increase availability — Many small servers rather than one big one
Stop guessing capacity — Let Auto Scaling adjust automatically
Automate changes — Code instead of human error

Matching services #

Service	Role
Auto Scaling	Automatically scales with load
Elastic Load Balancing	Distributes load across multiple instances
Amazon Route 53	DNS failover
AWS Backup	Unified backup management
Multi-AZ deployment	Higher availability for RDS, EFS, and more
AWS CloudFormation	Recreates infrastructure for disaster recovery

Exam question patterns #

“Service must keep running even if an AZ goes down” → Multi-AZ deployment (Reliability).

“Traffic is unpredictable and we sometimes run out of capacity” → Auto Scaling (candidate for both Reliability and Performance Efficiency).

Reliability vs Performance Efficiency #

These two get confused often. A simple distinction:

Reliability — “Does it still work when something fails?”
Performance Efficiency — “Are resources used efficiently?”

The same Auto Scaling is classified as Reliability when the angle is “tolerating failure” and as Performance Efficiency when the angle is “responding to traffic growth cost-efficiently”.

4) Performance Efficiency #

The ability to use resources efficiently to meet system requirements, and to maintain that efficiency as technology evolves.

Design principles #

Democratize advanced technologies — Use AWS services rather than building them yourself (for example, use Lambda instead of building your own container orchestration)
Go global in minutes — Deploy to regions close to users
Use serverless architectures — Hand the operational burden of infrastructure off to AWS
Experiment more often — Periodically reevaluate the right instance types

Matching services #

Service	Role
Amazon CloudFront	Edge caching to cut response time
ElastiCache	Caching (Redis/Memcached)
AWS Lambda	Serverless compute
EC2 instance type selection	Pick the family (M, C, R, T) that matches the workload

Exam question patterns #

“How do we deliver fast response times to users worldwide?” → CloudFront.

“Database response times are slow — how do we make frequently queried data faster?” → ElastiCache.

5) Cost Optimization #

The ability to run a system without unnecessary cost.

Design principles #

Cloud Financial Management — Monitor cost continuously
Adopt a consumption model — Only what you need
Measure overall efficiency — Business outcomes per dollar
Stop spending money on undifferentiated heavy lifting — Hand things like data center operations to AWS
Analyze and attribute expenditure — Tag for per-team or per-project billing

Matching services #

Service	Role
AWS Cost Explorer	Visualizes cost trends
AWS Budgets	Alerts when budgets are exceeded
AWS Trusted Advisor	Cost optimization checks
AWS Pricing Calculator	Upfront cost estimation
Cost and Usage Report (CUR)	Detailed billing data
Savings Plans / Reserved Instances	1- to 3-year commitment discounts
Spot Instances	Up to 90% off

Exam question patterns #

“I want to reduce costs in the dev environment — which pricing model fits best?” → Spot Instances (dev tolerates interruption).

“We’ll run EC2 steadily for 3 years — which model is cheapest?” → Reserved Instances or Savings Plans (3-year + All Upfront is the cheapest).

The pricing models in detail are covered in #8.

6) Sustainability #

The sixth pillar, added in December 2021. It focuses on the ability to minimize a workload’s environmental impact (energy use, carbon emissions).

Design principles #

Understand your impact — Quantify the environmental impact of your workload
Establish sustainability goals — Include environmental metrics in business KPIs
Maximize utilization — Reduce resource waste (one appropriately sized instance rather than many small ones)
Anticipate and adopt new, more efficient hardware — Migrate to newer EC2 families (Graviton and the like)
Use managed services — Leverage AWS’s efficient infrastructure
Reduce the downstream impact — Cut data transfer volume

Matching services #

Service	Role
AWS Customer Carbon Footprint Tool	Visualizes the workload’s carbon emissions
AWS Graviton instances	ARM-based, excellent performance per watt
Spot Instances	Use idle capacity
S3 Intelligent-Tiering	Automatic tier transitions for resource efficiency

Exam question patterns #

“How do we reduce carbon emissions from cloud usage?” → Migrate to Graviton instances or identify and clean up idle resources (Sustainability).

“How do we get the same performance from existing EC2 with fewer resources?” → Right-sizing (candidate for both Sustainability and Cost Optimization).

Tradeoffs among the six pillars #

The pillars often conflict with each other. The exam checks whether you’re aware of those tradeoffs.

Tradeoff	Example
Reliability ↔ Cost	Multi-AZ is pricier but more stable
Performance ↔ Cost	Bigger instances are faster but more expensive
Security ↔ Operational Excellence	Strong controls can slow deployment
Sustainability ↔ Performance	Smaller instances are better for the environment but can mean slower responses

When two answer choices both look right, narrow it down with the pillar the question emphasizes.

Well-Architected Tool #

The AWS Well-Architected Tool is a free service for evaluating your workload against the six pillars. You register a workload in the console, answer a questionnaire, and it returns improvement recommendations. On the exam this usually appears as a simple “what is the Well-Architected Tool for?” type question.

Lens — Well-Architected variants #

Well-Architected is for general workloads, but there are also variants (Lenses) for specific workload types.

Serverless Lens — Lambda-based workloads
SaaS Lens — Multi-tenant SaaS
Machine Learning Lens — ML workloads
Financial Services Lens — Financial services
IoT Lens — IoT workloads

Direct questions on Lenses are rare on the exam, but they sometimes appear as answer choices for “how do you apply Well-Architected to a specialized workload?”

Common traps #

1) Memorizing five pillars #

The most common mistake. There are six. Sustainability was added in December 2021. Older books and blogs still list five, so watch out.

2) Treating Reliability and Availability as the same thing #

Reliability is the ability of a system to do its intended job correctly + recovery from failure. Availability is a subset of that — the percentage of time the system is usable. On the exam, Availability is usually expressed with a number like 99.9%.

3) Confusing Performance and Reliability #

Remember the distinction from earlier:

Reliability = “still works when something fails?”
Performance Efficiency = “uses resources efficiently?”

4) The “Security is always first” trap #

Just because an option says “prioritize Security” doesn’t make it the right answer. You have to align with the scenario the question emphasizes. Picking Security as the answer on a cost question is wrong.

5) Ignoring Sustainability #

Many test-takers skip it because the weight looks low, but the exam almost always has one or two questions that ask directly about Sustainability. At minimum, memorize the six pillars and the Carbon Footprint Tool.

Wrap-up #

What this post locked in:

The Well-Architected Framework has six pillars — Operational Excellence / Security / Reliability / Performance Efficiency / Cost Optimization / Sustainability
Each pillar should be memorized as design principles + matching services + exam question patterns
Tradeoffs — strengthening one pillar can weaken another. The exam asks which pillar you prioritize
The Well-Architected Tool is a console service for evaluating your own workloads against the six pillars
Lenses are variants for specialized workloads (Serverless, SaaS, ML, and so on)
Traps — memorizing five pillars / confusing Reliability with Availability / confusing Performance with Reliability / the “Security first” answer trap / ignoring Sustainability

Next — Domain 2 Security #

Domain 1 is done. Up next is the biggest domain at 30% — Security and Compliance.

#4 Domain 2-1 Security — Shared Responsibility Model and IAM Basics covers where AWS responsibility ends and customer responsibility begins (which shifts by service model), the four IAM essentials (users, groups, roles, policies), MFA, and the root user operating guidance that often shows up as an exam trap.