Discussion about this post

User's avatar
JP's avatar

The 'second source becomes a platform' thesis is compelling for training. For inference though, I reckon the real threat to both Nvidia and AMD isn't each other but custom ASICs that skip the general-purpose tax entirely. Taalas is claiming 17,000 tokens/sec by hardwiring a specific model into silicon, which rewrites the cost math completely. Covered the custom silicon angle here: https://reading.sh/what-happens-when-ai-inference-gets-10-times-faster-bf0286a34a45?sk=8dfc863d0c5e9e9d15da1b2d49737b6b

Mikey Clarke's avatar

True facts. There's tremendous promise in actively dodging publicity and hype. Hype brings bubbles and instability. Corner the market's Boring Bits and you build stable, enduring moola.

No posts

Ready for more?