Principal Resilience Architect | Former AWS Global Reliability Lead | Cloud Architect
I help organizations build resilient, scalable cloud platforms and recover confidently when things go wrong.
I currently serve as a Principal Resilience Architect at Arpio, where I work with customers to design and automate disaster recovery solutions for modern cloud workloads running on AWS.
Previously, I spent more than five years at AWS (and 14 total at Amazon) helping customers improve reliability, resilience, and operational excellence. As Global Reliability Lead for AWS Well-Architected, I worked directly with customers and scaled that knowledge globally through workshops, presentations, whitepapers, and technical guidance.
Before AWS, I worked at Amazon.com as a Principal Engineer supporting large-scale systems including Amazon Fresh and Prime Video. Earlier in my career, I worked at Microsoft helping engineering teams modernize software and transition to the cloud.
What I Focus On
- Cloud resilience and disaster recovery
- AWS architecture and modernization
- Kubernetes and Amazon EKS
- Platform engineering and DevOps
- Reliability engineering and operational excellence
- Multi-region and cross-account recovery strategies
- Infrastructure as Code and automation
Speaking & Writing
I regularly speak about cloud resilience, disaster recovery, DevOps, and platform engineering.
Topics I frequently cover include:
- Disaster Recovery for DevOps Teams
- Beyond Backups: Building a Cloud Disaster Recovery Strategy That Works
- Cloud-native resilience patterns on AWS
- Operational excellence and reliability culture
- Automating recovery and reducing operational toil
I am the author of the AWS Architecture Blog disaster recovery series and author of the AWS whitepaper Disaster Recovery of Workloads on AWS: Recovery in the Cloud.
For nore of my work (blogs, podcasts, presentations), see my LinkTree here.
Selected Experience
Arpio
Principal Resilience Architect
Helping organizations automate and operationalize disaster recovery for AWS workloads, including Kubernetes, Amazon EKS, databases, infrastructure, networking, and security dependencies.
Amazon Web Services (AWS)
Global Reliability Lead – AWS Well-Architected
Worked one-on-one with customers to improve resilience, reliability, and operational excellence practices. Created and delivered technical workshops, architecture guidance, and resilience-focused content at global scale.
Amazon.com
Principal Engineer
Led and supported large-scale engineering initiatives across Amazon Fresh and International Technologies. Previously helped create Prime Video.
Microsoft
Technical Evangelist / Developer Advocate
Helped engineering organizations modernize applications and adopt cloud-native architectures.
Philosophy
Reliable systems are not created through backups alone.
Modern resilience requires:
- automation
- repeatability
- testing
- visibility
- operational simplicity
- recovery of complete workloads, not just data
I believe resilience should be built into the engineering platform itself and made accessible to development teams through self-service tooling and standardized patterns.
Connect
- LinkedIn: https://www.linkedin.com/in/setheliot/
- GitHub: https://github.com/setheliot
- AWS Disaster Recovery Whitepaper: http://bit.ly/DR_AWS
- AWS Disaster Recovery Blog Series: http://bit.ly/aws-dr-blog
For speaking engagements, consulting inquiries, podcasts, or collaboration opportunities, please reach out via LinkedIn or make an appointment with me here.