← Back to Sandbox

Email A/B testing

Growth Flywheel

The Hypothesis

Can an AI agent design, deploy, and iterate on email variants autonomously — without human oversight?

The Concept

Traditional A/B testing is slow and manual. Pick two subject lines, split the list, wait three days, pick a winner. This experiment lets an AI agent design, deploy, and iterate on email variants autonomously — testing subject lines, body copy, send times, and CTAs simultaneously across hundreds of micro-segments.

The Flow.
Campaign brief
Generate variants
12–20 variants: subject, body, CTA, send time
Micro-segment allocation
50–200 person test groups
Deploy and wait
4–8 hour observation window
Score and prune
kill losers, promote winners
Roll out best to full list
full list send

The agent completes multiple test cycles in the time it takes a human to set up one A/B test.

Email A/B testing

The hypothesis

Can an AI agent design, deploy, and iterate on email variants autonomously — without human oversight?


The concept

Traditional A/B testing is slow and manual. Pick two subject lines, split the list, wait three days, pick a winner. This experiment lets an AI agent design, deploy, and iterate on email variants autonomously — testing subject lines, body copy, send times, and CTAs simultaneously across hundreds of micro-segments.


How it works

  1. Campaign brief
  2. Generate variants — 12–20 variants: subject, body, CTA, send time
  3. Micro-segment allocation — 50–200 person test groups
  4. Deploy and wait — 4–8 hour observation window
  5. Score and prune — kill losers, promote winners
  6. Roll out best to full list — full list send

The agent completes multiple test cycles in the time it takes a human to set up one A/B test.


What it explores


What we found


Learnings


Where it goes next

This experiment directly fed into Flywheel’s email campaign agent. The micro-segmentation and autonomous test loop architecture is now a core feature of the product.

Want early access?
Some of these become products.

Innovation and frustration start in the sandbox. Tell us about your what-ifs and let's test something.

Start a conversation