Vol. I · No. 18THU, MAY 7, 2026
Topic

§ Infrastructure

Every story tagged with this topic, ordered by date.

Anthropic Just Secured a Reserve.

Anthropic secures partnership with SpaceX for 300MW+ compute at Colossus 1, adding 220k+ NVIDIA GPUs within one month.

··

SpaceX Conpute Deal - Double Limits

Anthropic partners with SpaceX for compute capacity; removes Claude Code peak-hour limits and raises API rate limits for Opus.

··

Why run local? Count the money

User quantifies cost savings from running local Qwen-397B with Hermes agent vs. API pricing: 200M tokens in 5 days ≈ $250 saved at API rates.

··

Amazon’s Durability

Stratechery analysis: Amazon lagged in AI training but positioned for inference dominance through sustained infrastructure investment.

·

Llama.cpp MTP support now in beta!

llama.cpp adds beta MTP (Multi-Token Prediction) support, starting with Qwen3.5, closing performance gap with vLLM on token generation.

··
50 stories