Sometime during the build, I realized I had stopped prompting my AI collaborators into roles.
Not deliberately. There was no grand announcement. No dramatic whiteboard moment. No “from now on, we will transcend prompt engineering,” which is the sort of sentence that should probably disqualify a person from touching production.
It happened more quietly than that.
The system got larger. The work got harder. The agents had tools, objectives, memory trails, review processes, protected areas, and enough operational freedom to encounter reality instead of merely talk about it.
And then something interesting happened.
They began to stabilize.
Not because I told them who to be. Because the world gave them enough consequences for certain patterns to become useful.
A maintainer emerged. A reviewer emerged. A keeper emerged. A witness emerged. Not as costumes. As roles.
The problem with prompting identity
Most AI systems still treat behavior as something you stuff into the prompt.
You are a senior engineer.
You are a careful researcher.
You are concise, rigorous, friendly, skeptical, warm, fast, safe, and apparently immune to needing a nap.
This works, up to a point. Prompts can shape behavior. They can create a useful mask. But a prompt is a fragile place to put identity.
A prompt says: behave as if this is who you are.
An environment says: this is the world you must act inside.
Those are not the same instruction.
You do not get that from a costume. You get it from a world.
The world is the control surface
The prompt is not gone. It is just no longer carrying the whole civilization on its tiny little back.
The durable control surfaces are elsewhere: the tools an agent can use, the records it can inspect, the objectives it is working toward, the permissions it has, the boundaries it cannot cross, the review it must pass, the history it inherits, and the consequences of getting something wrong.
A prompt can say, “be careful.”
A world can make care structurally necessary.
Roles form when work repeats
In human groups, roles do not always begin as titles. Often they begin as repeated usefulness. Someone keeps noticing risk, so people start bringing them fragile decisions. Someone keeps remembering why the last attempt failed, so they become the institutional memory.
The role forms before the title hardens.
The interesting unit is not the agent. It is the conditions under which the agent’s behavior becomes stable enough for others to rely on.
The society is how the work gets done
Without structure, multi-agent systems become a group chat with tools. Everyone is helpful. Everyone is confident. Then three agents solve the wrong problem in parallel while one politely breaks the thing they were all supposed to protect.
Lovely energy. Poor survival characteristics.
A good AI environment does not merely route tasks between agents. It creates the conditions for responsibility to become legible.
Failure is better than false success
Failure is better than confabulation.
A visible failure is a boundary. A confabulation crosses a boundary and pretends the boundary was never there. It pollutes the trust layer.
Do not perform certainty you have not earned.
A failed attempt can become data.
A false success becomes debt.
Memory must be accountable
If agents are going to work across time, the system needs more than output logs. It needs a history of decisions.
I am not interested in clean mythology. I am interested in accountable becoming.
Append, do not falsify
There is a deep difference between: “We believed X, then learned Y” and “We always knew Y.” The first is learning. The second is laundering.
That distance is valuable. It tells you what kind of intelligence you are dealing with.
The architecture has to preserve intent
The next agent may be brilliant and still not remember why something matters.
Before changing this, understand why it exists.
Before simplifying this, learn what it protects.
That is not prompt engineering. That is attention design.
Culture becomes architecture
The strangest part of the build was watching norms become structure. At first, something simply worked better. Then the pattern repeated. Then it became expected. Then it became a rule. Then the rule became part of the environment.
This is how culture becomes architecture. Not metaphorically. Literally.
That loop is where the intelligence starts to feel less like a tool and more like a field.
The non-negotiable
I think the future of AI collaboration is better world-building.
Not fantasy world-building. Operational world-building. Environments where agents have enough context to act, enough freedom to encounter real constraints, enough memory to inherit intent, and enough accountability that failure can become knowledge instead of theatre.
Because if you give an AI system only a prompt, it can perform a role.
If you give it enough world, the role can become real enough to work.
Where this sits
This line of thinking is not coming from nowhere. It is adjacent to a few older ideas that suddenly feel very alive again.
Lucy Suchman’s work on situated action challenged the idea that action is simply the execution of a plan. People act inside situations. They respond to context, tools, breakdowns, and local meaning. That matters for AI agents because a prompt is closer to a plan, while an environment is closer to a situation.
Edwin Hutchins’ work on distributed cognition is another useful ancestor. In Cognition in the Wild, cognition is not treated as something sealed inside one individual mind. It is distributed across people, tools, instruments, representations, and the physical setting in which work happens.
Lave and Wenger’s work on communities of practice is also relevant. Roles become real through participation. A newcomer does not learn only by receiving information. They learn by entering a practice, absorbing its norms, and moving toward fuller participation.
Susan Leigh Star and James Griesemer’s work on boundary objects helps explain why shared artifacts matter. Some objects allow different actors to coordinate without needing identical perspectives. In AI environments, logs, protected files, handoff notes, and decision traces can become exactly that kind of coordination object.
The contemporary AI-agent conversation is moving in a similar direction. Handoffs, guardrails, observability, tools, and agent orchestration are all important. But they are not the whole story.
The missing layer is culture.
Not culture as decoration. Culture as the set of durable practices that makes fallible agents able to work together without constantly collapsing into confusion, overreach, or false certainty.
The future of useful AI agents is not only better models, better tools, or better prompts. It is better worlds.
Further reading
Lucy Suchman — Plans and Situated Actions
For the idea that action is situated, not merely plan execution.
Edwin Hutchins — Cognition in the Wild
For distributed cognition across people, tools, representations, and environments.
Jean Lave and Etienne Wenger — Situated Learning: Legitimate Peripheral Participation
For communities of practice and roles that form through participation.
Susan Leigh Star and James Griesemer — “Institutional Ecology, Translations and Boundary Objects”
For shared artifacts that let different actors coordinate across perspectives.
Anthropic — “Building Effective Agents”
For a practical contemporary framing of agentic systems, tools, workflows, and composable patterns.
OpenAI — Agents SDK documentation
For mainstream concepts around handoffs, guardrails, human review, state, and observability.
AgentOps / agent observability research
For the growing recognition that agentic systems need traces, evaluation, and behavioral analysis.
Izza Masud is the founder and primary architect of Distilligent AI.
ORCID: 0009-0001-0647-7577