TL;AR

The Open-Weight Models Are Closing the Gap

What the Latest Model Release Actually Changes

Watch the depreciation schedules, not just the capex line. Hyperscalers are quietly extending the useful life they assign to servers, which flatters near-term margins even as the underlying hardware ages faster in AI workloads than the accounting assumes. When those schedules snap back to reality, the earnings hit arrives all at once.

Advanced-packaging line bonding memory stacks onto a compute die.

The headline number everyone fixates on is training compute, but the margin story is increasingly about inference. As model providers push cheaper, faster variants, the cost of serving a query has collapsed by roughly an order of magnitude in eighteen months — and that changes which products are viable to build on top of them.

This post is exclusive to subscribers

Subscribe to The Model Report for $12/month to read this and every exclusive post.