The Eval Harness Caught a Real Bug on Day One In my previous post, I built the agent and the safety framework around it which included contracts, guardrails, metrics. The agent ran. It returned structured JSON every time. It failed safely…
I'm a staff-level engineer building intelligent test systems at the intersection of QA and AI. Over 15+ years I've helped organizations such as Elastic, GitHub, and Mozilla ship with confidence by designing automation frameworks, quality toolchains, and CI/CD pipelines that scale with distributed systems.
Outside of QA, my experience spans into data engineering with a focus on building data pipelines and data quality, creating large-scale data warehouses and data integrations with tools like Airflow, dbt, and Snowflake.
I have a passion for open source and have contributed to projects across testing, data engineering, and AI. This includes tools like Playwright, Airflow, and various LLM-based tools. Today I focus on applying LLMs to quality engineering: intelligent test generation, defect prediction, and AI-assisted code review as well as AI Agent quality and resilience. When I'm not writing tests or building pipelines, you'll find me in the gym or adding to my cuff link collection.
Scalable test frameworks for distributed systems — from unit to end-to-end, built to survive organizational growth.
Applying LLMs to test generation, intelligent test selection, and defect prediction to make quality proactive.
Pipeline optimization, shift-left testing strategies, and tooling that keeps teams shipping fast with confidence.
Data pipelines, telemetry validation, and analytics infrastructure that make system behavior legible.
Elastic
Mozilla
Astronomer
NorthOne
Paylocity
Reef.ai
Pentaho
Wolfram Research
UpdaterThe Eval Harness Caught a Real Bug on Day One In my previous post, I built the agent and the safety framework around it which included contracts, guardrails, metrics. The agent ran. It returned structured JSON every time. It failed safely…
I Built an AI Agent. Then I Built a Framework to Make Sure It Actually Works. There’s no shortage of tutorials showing you how to wire up an LLM and call it an AI agent. Most of them stop at “look, it returned something.” I wanted to take…
This is everyone’s story, right? 1.) Graduated valedictorian from an A Ranked School and being captain of the Programming Club at 17 years of age. 2.) Graduated Summa Cum Laude from MIT with a Bachelors in Computer Science, minor in…
I have been wanting to build an app for a while now. As a test automation engineer, I get to work with a lot of really cool tech and web apps but as a former web developer, I find myself needing to scratch the itch to build a web…
With the whole host of new Angular goodness making its way to the mainstream, I asked myself whether it was worth it to learn the entire new Angular2 workflow and set of tools for my small site. For this project, the answer is no but I did…
“ I am not a tester. ”, “ QA will figure it out. ”, “ Works for me. Should be good in production. ”: These words are uttered in development organizations across the globe. I can’t lie. I even uttered it once during a production push as a…