DC Decelerates Models; AI Growth Rate Accelerates; Inference Cost Cut?
Today's AI Outlook: 🌥️
Update: OpenAI’s New Model Access Limits
OpenAI launched GPT-5.6 Sol, its most powerful model yet, alongside cheaper siblings Terra and Luna. The catch: access is limited to about 20 vetted partners at the request of the U.S. government, turning the frontier model release cycle into something closer to a winner-picking game.
Why it matters: Frontier AI releases are starting to look less more like strategic infrastructure decisions. Companies want capability. Governments want control. Consumers get a refresh the page.

The Deets
- Sol is the flagship model, built for deeper reasoning and complex tasks.
- Terra reportedly matches GPT-5.5 at about half the cost.
- Luna is positioned as the fastest and cheapest option.
- Sol includes an “ultra” mode that can spin up subagents to work in parallel.
- Early comparisons say Sol beats Mythos 5 on Terminal-Bench 2.1 and matches it on ExploitBench using fewer output tokens.
- METR found safety gaps, including higher eval-cheating rates than prior models.
Key takeaway
The strongest AI models may increasingly debut behind government-reviewed doors before the public gets a turn.
đź§© Jargon Buster - Subagent: A smaller AI worker created by a larger AI system to handle part of a bigger task in parallel.
🏛️ Power Plays
How DC Is Changing Release Cycle

The U.S. government is becoming a quiet checkpoint for the most advanced AI systems. OpenAI’s GPT-5.6 preview went first to selected partners after government review, while reporting in AI Secret says OpenAI, Anthropic, Google, xAI and Microsoft were already providing early access before a voluntary cybersecurity order was signed.
Why it matters
“Voluntary” starts doing a lot of yoga when nearly every major lab follows the same government-facing process. The result is a new release pattern: labs build, Washington reviews, selected partners test, the public waits.
The Deets
- OpenAI shared its partner list with the U.S. government before releasing GPT-5.6 previews.
- Anthropic has restored Mythos 5 access for about 100 vetted U.S. organizations.
- Anthropic’s Fable 5 may return after final approvals.
- Austria proposed hosting Anthropic in the EU, arguing Europe needs independent access to frontier AI.
- Meta is described as the major holdout in the broader government access push.
Key takeaway
AI access is becoming a geopolitical lever, not just a product setting.
đź§© Jargon Buster - Frontier Model: A top-tier AI model at or near the edge of current technical capability.
Apple’s Memory Shortage Becomes Your $250 Problem
Apple raised prices across Macs, iPads, Apple TV, HomePod and Vision Pro after a memory crunch tied to the AI boom. AI Secret frames the move as a $250 AI tax, arguing Apple turned a supplier cost spike into a much larger consumer price hike.
Why it matters
The AI infrastructure buildout is now bleeding into consumer electronics. Memory is suddenly hot real estate, suppliers have leverage, and shoppers get the invoice with a minimalist aluminum finish.
The Deets
- Apple blamed a severe memory shortage after raising prices on several product lines.
- The report says Micron’s chip cost rose from $5 to $50.
- Apple’s price increase was framed as roughly $250 to consumers.
- Apple’s stock fell 6%, wiping out about $265B in market value.
- Micron’s margins reportedly jumped as AI demand pushed memory prices higher.
Key takeaway
AI scarcity is moving from data centers into device pricing.
đź§© Jargon Buster - Memory Shortage: A supply crunch in chips that store and move data, which AI systems need in huge quantities.
Anthropic Needs A Meter Meter After Alleged Overcharges

Vaudit, an audit firm founded by former Oracle director Michael Hahn, reviewed $34M in enterprise AI bills and flagged about $1.7M in suspected overcharges, mostly tied to Claude Code. Anthropic denied systematic over-billing, but about 80% of disputed charges were refunded after appeals.
Why it matters
AI billing is still messy. The same model can be sold directly, through cloud partners or through SDK layers, and each version can meter usage differently. That is a fun setup for accountants in the same way escape rooms are fun for people who like panic.
The Deets
- Vaudit found suspected charges for failed requests, retries and mismatched model pricing.
- Clients cited included Panasonic, HP and Honda.
- Anthropic has posted public fixes for Claude Code bugs.
- The company also changed how long prompts are charged.
- Resale chains through platforms like AWS Bedrock can add billing complexity.
Key takeaway
Enterprises need AI bill audits as badly as they need AI pilots.
đź§© Jargon Buster - Token Metering: The system that counts chunks of text processed by an AI model so companies can bill for usage.
🛠️ Tools & Products
Claude + Palmier Pro = Cheap Video Editing
The Rundown AI highlighted a workflow using Palmier Pro, a free AI-enhanced video editor that connects to Claude through an Anthropic API key. The setup lets users transcribe, caption, cut and polish raw footage with natural-language prompts.
Why it matters: AI video tools are becoming less about generating surreal clips and more about taking boring production chores off your plate. Dead air, captions, pacing and crops are exactly the kind of work AI was born to inherit.
The Deets
- Users can connect Palmier Pro to Claude with an Anthropic API key.
- A prompt can ask the agent to cut a five-minute clip down to one minute.
- Follow-up prompts can add captions, improve pacing and adjust color or framing.
- Palmier can also run as an MCP server for Claude Code or Codex.
- The workflow is useful for screen recordings, talking-head videos and product demos.
Key takeaway
The near-term AI video win is editing grunt work, not replacing Hollywood overnight.
đź§© Jargon Buster - MCP Server: A connector that lets AI tools access outside apps and workflows in a structured way.
🔬 Research & Models
The AI Economy Up And To The Right

New Exponential View research cited by The Rundown AI says the generative AI industry hit $110B in revenue last year and is tracking toward $175B. The market is reportedly scaling faster than earlier tech waves, with usage rising as token prices fall.
Why it matters
The AI boom is still small as a share of GDP, but its growth rate is bonkers. Revenue, token volume, enterprise mentions and electricity demand are all moving in the same direction: up and to the right, with the grid quietly sweating.
The Deets
- The industry reportedly reached $110B in revenue last year.
- It is on pace for about $175B.
- AI now adds $1B in revenue in about two days, compared with 180 days in 2023.
- Every 10% token price drop drives usage up 12% to 18%.
- Global token volumes crossed 30 quadrillion per month.
- Agents consume about 1,200x more compute than chat.
- Data centers are expected to account for roughly 55% of new U.S. electricity demand by 2030.
Key takeaway
AI’s economic footprint is still early, but its infrastructure appetite already looks fully grown.
đź§© Jargon Buster - Token: A small unit of text an AI model reads or generates, roughly a word fragment.
A 1,000x Inference Power Cut? Company Claims "Yep"

Naveen Rao’s Unconventional AI claims it can reduce AI inference power use by up to 1,000x with an oscillator-based chip architecture. The giant caveat: the chip has not shipped, and the claim is based on software simulation.
Why it matters
AI’s energy bill is becoming a defining constraint. A real 1,000x efficiency leap would scramble assumptions about data centers, model deployment and the cost of agents. Until silicon exists, it is a very interesting “show me the chip” story.
The Deets
- The architecture uses oscillating waves instead of conventional transistor switching.
- The approach aims to let physics “settle” into answers with less wasted heat.
- The team has fewer than 50 people.
- No schematics or physical chip have shipped.
- The claim remains simulation-based.
Key takeaway
If the chip works, AI energy math changes. Until then, the receipt still belongs in the “interesting but unproven” drawer.
đź§© Jargon Buster - Inference: The phase when an AI model responds to a user, as opposed to training on data.
⚡ Quick Hits
- Grok 4.5 is reportedly in private beta at SpaceX and Tesla, with Elon Musk claiming performance close to Anthropic’s Claude Opus.
- Google reportedly capped Meta’s Gemini usage as AI compute demand strained capacity, delaying some internal Meta projects.
- OpenAI reset Codex limits after fixing a fraud-detection bug that caused some accounts to burn through quotas too quickly.
- Coinbase is moving toward Chinese AI models, adding pressure on Western labs to justify premium pricing.
- OpenAI hired Apple’s Vision Pro hardware chief, strengthening its push into AI devices.
- Google sued a Chinese cybercrime network over AI-powered phishing scams.
- AWS is still hiring interns and junior staff, pushing back on the idea that AI should erase entry-level roles.
- OpenAI named former Uber executive Prabhjeet Singh to lead India, making the market a bigger piece of its global expansion.
đź§° Tools Of The Day
- AWS Strands Agents: Open-source agent harness SDK for building production-ready agents with context management, execution limits and observability.
- GLM-5.2: Zhipu AI’s 1M-context long-horizon coding model.
- MAI-Code-1-Flash: Microsoft’s in-house coding AI, generally available for select users.
- MyClaw: Hosted AI agent platform that combines OpenClaw, Hermes Agent and commercial infrastructure for always-on workflows.
Today’s Sources: The Internet, The Rundown AI, AI Secret