Discussion about this post

User's avatar
Amarda Shehu's avatar

Mark, the chasm is real, and from an institutional vantage I would only sharpen one part of it. The chasm is growing fastest inside the organizations that most need to interpret what is happening. University leaders, and I say this as one of them, are making governance decisions calibrated to a version of the technology they encountered two model generations ago (if that!). The lag is now long enough to constitute a structural risk in its own right.

Your deterministic versus non-deterministic question is where I may push back a little. The task categories where agents have already crossed into knowledge work, LMS submissions, legal document review, sales outreach, customer support triage, share a condition that is easy to miss in the story about capability. Each had been templated, quietly, over years of interface design and workflow compression, to the point where an agent could/can now complete it because the deterministic-enough signal was already there. The template preceded the agent. What the automation did was just make visible a prior condition.

This has an implication for your prediction about the next year. If the pattern truly holds, the tasks most immediately vulnerable to agentic automation are the ones we had already rendered measurable and repeatable for reasons unrelated to AI, often to make them easier to manage, scale, or grade. What remains, and what now becomes precious, is the layer of knowledge work that resisted templating: situated judgment, institutional memory, the reading of texture that Polanyi called tacit and that most organizations discover they need only once the templated layer is gone. I have written about one version of this.

Oh, and on Mythos, a small note. It is plausible and indeed widely reported that Anthropic uses its own models heavily in internal engineering. Whether that constitutes recursive self-improvement (RSI) in the technical sense is a separate question, and Anthropic's own public position on Mythos Preview, as of earlier this month, is that they are less confident than they used to be that junior-researcher work is safe from automation but that the answer to the direct RSI question is still probably not. The difference between 'yes' and 'less confident than before' might change how one should read the tempo of what comes next.

Thank you for writing this. The chasm framing is one I had hoped to write next, as I see what my students can do with lower-tier models versus the $200/month ones. Interesting times ahead all in all.

Tom Scheinfeldt's avatar

Thanks, Mark. It will be incredibly useful to have this post in my pocket when explaining these changes to normie colleagues.

14 more comments...

No posts

Ready for more?