TL;AR — newsletters, already read

There is a persistent gap between what a model can do in a demo and what an enterprise will actually deploy. Procurement cycles are long, security reviews are longer, and the switching costs — once a workflow is embedded — cut both ways. The lesson is that adoption is slow to arrive and slow to leave.

Memory bandwidth has quietly become the constraint that dictates real-world throughput. You can stack more accelerators, but if the model cannot be fed fast enough, the extra compute sits idle. This is why the high-bandwidth memory roadmap is worth tracking as closely as the flagship chip roadmap.

Reading an AI earnings call is an exercise in separating booked revenue from backlog from ambition.
Signed contracts and committed capacity are real; framework agreements and letters of intent are options on the future.
The market routinely conflates the two, and that is where the mispricings live.

The Open-Weight Models Are Closing the Gap