Discussion about this post

User's avatar
Hollis Robbins (@Anecdotal)'s avatar

I appreciate the link! Embedded in my piece is my very impolitic view (which comes from first hand knowledge, as a dean who used to read end-of-semester student teaching evaluations closely) that most of the grueling work of teaching the general education (GE) courses in the CSU is done badly, or let's say at a mediocre level, with some great instructors. But it's the luck of the draw the new "doesn't matter who is teaching" paradigm takes away any incentive for improving instruction. So at the very least, if AI is delivering instruction, the value can be measured. Seriously, more people should look under the hood at the CSU.

Expand full comment
Jonathan Bechtel's avatar

My feeling is the raw intellectual horespower of these models is beginning to plateau. o3 is demonstrably worse than 01 pro on various metrics (just compare their model score cards), and the big reason why OpenAI chooses such silly model names is it obfuscates the fact that most of their recent releases are not so much true breakthroughs but products that occupy specific niches.

However, I think they're definitely smart enough as-is to give organizations years of work to fully harness their benefits, but the big bottleneck right now is they don't interact with different systems very well, and that's the work most people spend their time on.

So there's a lot of tooling that still has to be built for these things and that's where most of the interesting work lies ahead.

However, it's not a slam dunk that they'll be able to make the leap from omniscient question answerers to functional, white-collar quality system integrators, but I'm curious to find out.

I also think the transition to agentic AI's might make smaller customized models more feasible. Most organizations don't need an AI that can solve fields-medal calibre mathematical problems, they need one that can adequately internalize the proprietary information that the company works with. A lot of that stuff just isn't represented on the internet very well, so I can definitely see where industry specific AI tools are built in conjunction with companies that work in it to solve very specific problems for them with big licenses being charged to similar companies. There's probably a long enough tail in this endeavor that it's not feasible for the hyperscalers to cover them all.

Expand full comment
16 more comments...

No posts