← Back to Sandbox

Junior agent apprentice

Agent Learning Flywheel

The Hypothesis

Can an AI agent start as a junior team member, learn from a human expert through questions, and grow into full autonomy?

The Concept

Inspired by the smol-ai/developer approach — what if instead of giving an agent full capabilities from day one, you started it as a junior staff member? A junior email marketer, for example — one that knows the basics but works alongside a human expert, asks questions when it's unsure, and gradually builds competence through real work. The agent learns not from training data, but from mentorship.

The Flow.
Assign junior role
agent starts with limited scope (e.g. draft subject lines)
Agent works and flags uncertainty
completes tasks, asks the human when unsure
Human reviews and answers
expert corrects, explains, sets guardrails
Agent absorbs context
builds a working memory of preferences, patterns, decisions
Expand scope gradually
agent takes on more as confidence and accuracy grow

A mentorship model — the agent earns autonomy through demonstrated competence, not configuration.

Junior agent apprentice

The hypothesis

Can an AI agent start as a junior team member, learn from a human expert through questions, and grow into full autonomy?


The concept

Inspired by the smol-ai/developer approach — what if instead of giving an agent full capabilities from day one, you started it as a junior staff member? A junior email marketer, for example — one that knows the basics but works alongside a human expert, asks questions when it’s unsure, and gradually builds competence through real work. The agent learns not from training data, but from mentorship.


How it works

  1. Assign junior role — agent starts with limited scope (e.g. draft subject lines)
  2. Agent works and flags uncertainty — completes tasks, asks the human when unsure
  3. Human reviews and answers — expert corrects, explains, sets guardrails
  4. Agent absorbs context — builds a working memory of preferences, patterns, decisions
  5. Expand scope gradually — agent takes on more as confidence and accuracy grow

A mentorship model — the agent earns autonomy through demonstrated competence, not configuration.


What it explores


What we found


Learnings


Where it goes next

We’re exploring whether the apprenticeship model can be compressed — using a “fast mentorship” phase where the agent asks targeted questions before its first task, rather than learning through multiple rounds of work. The goal: apprenticeship-level accuracy with closer to instant-prompt speed.

Want early access?
Some of these become products.

Innovation and frustration start in the sandbox. Tell us about your what-ifs and let's test something.

Start a conversation