Versioning + lifecycle rules (protect against deletes and ransomware scenarios).
CRR/SRR for replication needs (cross-region or same-region).
Object Lock (WORM) for immutability requirements.
Backups (fast picks)
EBS snapshots are incremental; copy snapshots cross-region for DR.
AWS Backup helps centralize backup policies across common services.
For databases, prefer managed backup features (RDS automated backups/snapshots).
5) Databases and caching — RDS/Aurora vs DynamoDB
RDS: Multi-AZ vs read replicas (classic SAA)
Feature
Multi-AZ
Read replica
Primary purpose
HA/failover
Read scaling
Writes
One primary
Still one primary
Failover
Automatic
Manual promotion (generally)
Rule: Multi-AZ is about availability; read replicas are about scale.
Aurora (why it’s often a “best answer”)
Higher throughput than standard RDS engines (common exam framing).
Multiple read replicas for read scaling (and faster reads in the same region).
Aurora Global Database for low-latency global reads and faster cross-region DR.
When to choose what (fast)
Need
Best-fit
Relational + joins + transactions
RDS/Aurora
Massive key-value scale
DynamoDB
Sub-millisecond cache
ElastiCache
DynamoDB read cache
DAX
ElastiCache: Redis vs Memcached (SAA-level)
Service
Best for
Notes
Redis
Rich features + durability options
Replication, multi-AZ patterns, data structures
Memcached
Simple cache
Very simple, no persistence
DynamoDB: what wins questions
Prefer Query over Scan.
Choose partition keys to avoid hot partitions.
Use GSIs for new access patterns.
Use On‑Demand for spiky traffic; Provisioned + Auto Scaling for steady predictable workloads.
6) Resilience and DR — RTO/RPO patterns
HA patterns (default architecture)
Multi-AZ for app tiers (ALB + ASG across AZs).
Databases: Multi-AZ where required (RDS/Aurora).
Use queues/caching to absorb spikes and failures.
DR strategies (know the table)
Strategy
Typical RTO
Typical RPO
Cost
Notes
Backup/Restore
High
Hours
Low
Cheapest; slowest recovery
Pilot Light
Medium
Minutes–hours
Med
Minimal core in DR
Warm Standby
Low
Minutes
Med+
Scaled-down prod running
Multi-site active-active
Very low
Seconds
High
Complex; highest cost
Multi-Region data options (high yield)
Data layer
Multi-Region option
S3
CRR
DynamoDB
Global tables
Aurora
Aurora Global Database
DR routing: Route 53 policy selection
Routing policy
Best for
Failover
Active-passive DR
Weighted
Canary / migrations
Latency
Lowest latency routing per user
Geolocation
Compliance/content by country
Exam cue: If you need faster failover with static anycast IPs, consider Global Accelerator. If you need caching + origin failover for HTTP(S), consider CloudFront.
Active-passive vs active-active (quick framing)
Pattern
Pros
Cons
Active-passive
Cheaper; simpler operations
Higher RTO; failover/failback steps
Active-active
Lowest downtime; global performance
Most complex; highest cost
DR sketch (active-passive)
flowchart LR
Users --> R53[Route 53]
R53 -->|Primary| A[Region A]
R53 -->|Failover| B[Region B]
A --> AppA[App + DB]
B --> AppB[Warm standby]
7) Observability and operations — what each tool answers
Service
Think “this answers…”
CloudWatch
“How is it performing?” (metrics/logs/alarms)
CloudTrail
“Who did what?” (API audit trail)
Config
“What changed?” (config history + compliance)
X-Ray
“Where is latency?” (distributed traces)
Exam cue: if the requirement is auditing and investigations, CloudTrail is usually the anchor.
11) Create primary A/AAAA alias to ALB in Region A with Health Check.
22) Create secondary A/AAAA alias to ALB in Region B with Health Check.
33) Set routing policy to Failover: Primary / Secondary.
44) Verify health checks and simulate failover.
Final tip
If multiple answers work, pick the one that best matches the explicit constraint (for example: lowest cost or least operational effort) while still meeting availability and security requirements.