Open to work · Logan, Utah

Reliable systems, quietly built.

I'm Forrest — a platform engineer keeping life-critical telemetry online with Kubernetes, Kafka, and a healthy obsession with what happens at 3 a.m.

Let's talk Resume

forrest.olson / stack.map 16 nodes online

DB migrated

173GB

Cross-account AWS migration, end-to-end.

Data lost

DNS cutover, infra reconfig, zero downtime tolerated.

Years shipping

QA → DevOps → Platform at Campbell Scientific.

Kids at home

5/5

The reason the rest of this matters.

Kubernetes Kafka Knative PostgreSQL CloudNativePG Redis Keycloak AWS · EKS Cilium ArgoCD Terraform Helm TimescaleDB OAuth / OIDC Kubernetes Kafka Knative PostgreSQL CloudNativePG Redis Keycloak AWS · EKS Cilium ArgoCD Terraform Helm TimescaleDB OAuth / OIDC

01 — About

Engineering, with the lights on.

A short version of who I am and how I think about the work.

“ I build and operate distributed systems where reliability actually matters.

I'm a Platform Engineer working on a multi-cluster, event-driven system processing life-critical telemetry. I spend most of my time in Kubernetes-based environments designing backend systems, debugging production issues, and making complex infrastructure predictable.

I lean heavily into AI-augmented engineering — agent workflows and coding harnesses that help me move faster through large codebases and ship well-tested systems.

Outside of all that, I run a small concessions business, coach my son's kindergarten soccer team, and try to be a good husband and dad. The work matters. The rest matters more.

02 — Work

Where I've been on call.

Most recent first. Three roles under one roof, plus a long-running side business.

Campbell Scientific

4 yrs 7 mos · current

Logan, UT · Hybrid

Platform Engineer current

Nov 2024 — Present

Own backend infrastructure for a multi-cluster, event-driven telemetry platform on AWS EKS, supporting life-critical data systems.
Design and operate distributed services across Kubernetes, Kafka, Knative, and Keycloak — responsible for availability, latency, and reliability.
Debug complex production issues across networking, databases, and event pipelines, including Cilium clustermesh, PostgreSQL (CloudNativePG), and Knative eventing.
Led a cross-account AWS migration end-to-end: ~173 GB database recovery, DNS cutover, and infrastructure reconfiguration with zero data loss.
Built a disaster recovery strategy covering RPO/RTO targets, backup systems, and failure modes for stateful services.
Resolved GitOps bootstrap dependencies by introducing Terraform-based provisioning for reproducible infrastructure.
Leverage AI-augmented engineering workflows to navigate large codebases, accelerate development, and ship reliable systems.

Kubernetes AWS Terraform Kafka Knative

DevOps Engineer

Nov 2022 — Oct 2024 · 2 yrs

Supported production platform across Kubernetes-based infrastructure, debugging issues across networking, PostgreSQL, Redis, and microservices.
Built and maintained CI/CD pipelines and Helm-based deployment workflows for multi-region environments.
Improved observability and incident response processes, working closely with engineering and product teams to resolve production issues.

PostgreSQL Helm CI/CD Observability

QA Engineer

Nov 2021 — Oct 2022 · 1 yr

Designed and executed test plans for backend systems, validating PostgreSQL, Redis, and microservice integrations.
Collaborated with engineering teams to identify defects and ensure production readiness of distributed systems.

Software Testing Backend Systems

Olson Family Enterprises

9+ yrs · self-employed

Logan, UT

Owner

Feb 2017 — Present

A small family concessions business — a long-running side project that keeps me sharp on the non-software side of running things: logistics, hiring, customers, cash. Three brands under one roof:

ofe.llc ↗ sweetclouds.cc ↗ poppinjackskettlecorn.com ↗

EscrowTech International

1 yr 6 mos · part-time

Lehi, UT

Technician

Jan 2020 — Jun 2021

Helped manage IT day-to-day, handled account deposits, and developed an internal web application.

03 — Selected projects

A few things I've shipped or own.

Real systems in production. Numbers are approximate; the lessons are not.

P-01

2025

Migration · Stateful systems

Cross-account AWS migration, zero data loss.

Moved a production telemetry platform between AWS accounts end-to-end: ~173 GB database recovery, DNS cutover, and a full infrastructure reconfiguration. Planned every failure mode, ran the cutover, watched the dashboards stay green.

173 GBrecovered

0 Bdata lost

1 windowno rollback

AWSPostgreSQLCloudNativePGDNSRunbooks

P-02

2025

Reliability · Disaster recovery

A DR strategy that actually gets tested.

Designed and implemented the disaster recovery plan for stateful services on the platform — RPO/RTO targets, backup pipelines, restore drills, and an honest catalogue of failure modes for each service. Built to be exercised, not filed in a wiki.

< 15 minRTO target

≤ 5 minRPO target

monthlyrestore drills

PostgresVeleroS3RunbooksSLOs

P-03

2024

GitOps · IaC

Breaking the GitOps bootstrap loop.

ArgoCD wants a cluster; the cluster wants ArgoCD. Introduced Terraform-based provisioning for the underlying infrastructure so new environments can be brought up reproducibly without hand-tending — turning a chicken-and-egg dance into a single command.

hours → minenv spin-up

repeatableacross regions

declarativeend-to-end

TerraformArgoCDHelmEKS

P-04

ongoing

AI · Engineering workflow

AI-augmented engineering, for real codebases.

A personal practice more than a single project: agent workflows and coding harnesses tuned for navigating and modifying large infrastructure codebases. Not demos — daily-driver tooling for actual work. Always iterating.

agentsharnessesevalscodebase nav

04 — Off the clock