Can an AI agent start as a junior team member, learn from a human expert through questions, and grow into full autonomy?
Inspired by the smol-ai/developer approach — what if instead of giving an agent full capabilities from day one, you started it as a junior staff member? A junior email marketer, for example — one that knows the basics but works alongside a human expert, asks questions when it's unsure, and gradually builds competence through real work. The agent learns not from training data, but from mentorship.
A mentorship model — the agent earns autonomy through demonstrated competence, not configuration.
Can an AI agent start as a junior team member, learn from a human expert through questions, and grow into full autonomy?
Inspired by the smol-ai/developer approach — what if instead of giving an agent full capabilities from day one, you started it as a junior staff member? A junior email marketer, for example — one that knows the basics but works alongside a human expert, asks questions when it’s unsure, and gradually builds competence through real work. The agent learns not from training data, but from mentorship.
A mentorship model — the agent earns autonomy through demonstrated competence, not configuration.
Agent uncertainty is a diagnostic tool — treat “what should I do about X?” moments as a brief audit, not just onboarding friction.
Batch questions at the end of a task, not mid-stream — asynchronous questioning preserves flow for both the agent and the human.
Persistent working memory (preferences, corrections, context) is the compound advantage — agents that remember past feedback outperform agents that start fresh each session.
The speed/accuracy trade-off suggests a “fast mentorship” phase — targeted questions before the first task could deliver apprenticeship-level accuracy without the slow ramp.
We’re exploring whether the apprenticeship model can be compressed — using a “fast mentorship” phase where the agent asks targeted questions before its first task, rather than learning through multiple rounds of work. The goal: apprenticeship-level accuracy with closer to instant-prompt speed.