Before AI acts, decide where it stops.

This week’s AI news was full of more capable agents. The useful question is no longer whether they can do the work. It is which parts of the work they should never own.

This week’s signals

Define exclusions first

Separate help from authority

Keep judgment practiced

Make every action traceable

The story I kept coming back to this week was not a single model release. GPT-5.6 Sol arrived under a restricted preview. Claude moved into Slack as a teammate you can mention. Gemini gained computer use. Codex is getting closer to everyday desktop work. The common thread is simple: AI is moving from producing text to taking part in work.

That changes the leadership problem. A chatbot can be wrong and the error often stays inside the conversation. An agent can click, buy, send, file, schedule, summarize, escalate, and continue a task after the first prompt is forgotten. Once that happens, intelligence is only half the question. The other half is permission.

Most organizations still begin with the attractive question: what can we automate? It produces ideas quickly, and ideas feel like progress. But it skips the harder work. Before a team lists automation candidates, it should list exclusions. Where should the system stop, even if it could technically continue?

A mature AI strategy begins with the work you refuse to automate.

Some tasks are not protected by accuracy alone. Medical triage, investment advice, complaint handling, claim denial, hiring decisions, sensitive feedback, conflict situations. The issue is not only whether the model can produce a plausible answer. The issue is whether the moment requires human presence, responsibility, or moral judgment.

A customer may welcome automation for a shipping update. The same customer may resent it when an urgent exception is handled by a system that cannot understand why the moment matters. Same interface, different emotional weight. Good AI transformation sees that difference before rollout, not after damage control.

Claude in Slack shows why this is becoming practical rather than theoretical. The assistant sits where team memory lives. It reads threads, gathers context, follows up, and answers in public. That can save time. It also forces a decision: when is the AI drafting, when is it recommending, when is it acting, and who remains accountable?

The more technical news pointed in the same direction. DeepMind described agent control with the language of security: monitoring, permissions, coverage, recall, response time. Palantir framed governance as bounded execution, observability, fail-safes, and authorization. None of this sounds glamorous. That is exactly why it matters.

There is also a quieter human risk. As agents improve, people may slowly become approval buttons. The system frames the problem, selects the path, drafts the answer, prepares the action, and the human clicks yes because real review takes effort. Judgment does not disappear all at once. It gets outsourced in small, convenient steps.

The practical work starts with one real process. Break it into tasks. Mark what AI may do alone, what AI may assist with, and what a human must own. Then add approval points, logs, handoffs, failure modes, and review rituals. If that feels slow, consider the alternative: discovering your boundaries through customer pain.

The companies that look serious in two years will probably not be the ones with the longest agent catalogue. They will be the ones where people know what each system is allowed to do, where it must stop, and what happens when it fails. Fewer mystery workflows. Fewer heroic cleanups. Fewer meetings where someone asks who approved the thing that already happened.

Human-AI collaboration becomes real when permission becomes explicit. Until then, many organizations are not redesigning work. They are just giving software more reach.