alan chester.

∀ ε > 0, ∃ δ > 0

researcher. scholar. engineer.

notes from the bench.

About

I work at the intersection of cloud infrastructure, agentic AI systems, reinforcement learning, and multi-agent trust. Fifteen years building production systems. Now turning that engineering lens onto the research bench.

The current empirical work: in long-horizon reinforcement learning, do symbolic observations beat pixel observations once encoder capacity is properly controlled? Pokémon Red is the environment, RecurrentPPO via SB3-Contrib the method.

More about me →

Research

the current questions

On reinforcement learning.

How does observation representation affect sample efficiency in long-horizon RL?

Pixel vs. symbolic vs. hybrid observations under capacity-matched encoders.

On team architecture.

How does the unit of work change when teams include autonomous agents?

What structures support variable-elasticity teams.

On multi-agent security.

How does trust topology affect security in multi-agent LLM systems?

Centralized orchestrator vs. peer-authenticated trust.

Writing

All writing →

ESSAY May 2026 ≈ 2 min

Why AI Is Shockingly Expensive to Run

The economics of inference, explained without euphemism.

Projects

All projects →

pokemon-red-ai ACTIVE

PythonRLGymnasium

Reinforcement learning research on observation representations in Pokémon Red. Pixel, symbolic, and hybrid conditions under capacity-matched encoders. RecurrentPPO via SB3-Contrib.

github

claude-teams-operator ACTIVE

KubernetesGoAI

Kubernetes operator that runs Claude Code agent teams as distributed pods.

github

ea-agent ACTIVE

PythonAIObsidian

AI personal executive assistant built around Obsidian.

github