Platform / DevOps / SRE Engineer

Hi, I'm Tony

I'm a platform, DevOps, and SRE engineer. I deploy and operate AI systems reliably in production — the infrastructure they run on, with the same reliability discipline you'd put behind any critical service.

That's a deliberate distinction. I'm not building AI features or training models; I'm the person who keeps agentic systems and LLM workloads running 24/7 — observable, cost-controlled, and dependable. Most of my two decades has been on AWS, with a homelab where I run the same patterns end to end: containerized services, full Grafana/Prometheus/Loki observability, and supply-chain security from build to signed artifact.

This site is where I keep the projects that show that work. Have a look around.

Work Experience

See all work

Sep2024 - Present

Zapcom Group

Principal Engineer
Client: Mass General Brigham · Remote
- Building an Internal Developer Portal (IDP) for Mass General Brigham — self-service provisioning of VMs and storage for clinical researchers, golden path architecture extensible to additional resource types.
- Boosted release velocity 60%+ by designing and standardizing GitLab CI pipelines and Terraform-based infrastructure automation with reusable modules and golden path templates.
- Architected migration from monolithic repository structure to independently deployable services, reducing deployment friction by 40%.
- Built AI-assisted AWS Security Group optimization tool using VPC flow logs to derive least-permissive rule sets, eliminating 90% of manual security operations overhead.
- Established cloud infrastructure standards and deployment golden paths adopted across product teams, improving consistency, onboarding speed, and operational confidence.
- Created Azure DevOps onboarding curriculum for engineering teams expanding into Azure.
Jun2022 - Aug2024

Marriott International

Cloud & DevOps Engineer (Contract)
Remote
- Contributed to CI/CD pipeline development and maintenance on AWS, supporting Marriott’s hospitality technology platform.
- Built and maintained automated file transfer workflows using Apache NiFi, improving reliability and throughput for data movement across the platform.
- Participated in architecture and planning discussions around Dynatrace APM adoption and observability strategy.

Recent projects

See all projects

Let's Connect

If you want to get in touch with me about something or just to say hi, reach out on social media or send me an email.