Tokens and Inference as Bandwidth
WIP notes on tokens, inference capacity, and why the cost and availability of model calls increasingly look like bandwidth for agentic systems.
WIP notes on tokens, inference capacity, and why the cost and availability of model calls increasingly look like bandwidth for agentic systems.
Analyzing the market opportunity for AI-native search infrastructure as web consumption shifts from human-centric browsing to agent-mediated retrieval.
How OpenAI's rapid product development affects the startup ecosystem. Strategic frameworks for building defensible AI applications in the shadow of foundation model giants.