Share

Model Orchestras; Gaming Games; Google Goes Hollywood

Model Orchestras; Gaming Games; Google Goes Hollywood

Today's AI Outlook: ⛅️

Sakana’s Fishy Frontier Bet

Japan’s Sakana AI launched Fugu, a model-orchestration system that routes one request across multiple AI models, then combines the results behind a single API. The timing is the point: after U.S. export restrictions pulled access to Anthropic’s top models, Sakana is pitching Fugu as a way to get near-frontier performance without depending on one model, one lab or one geopolitical mood swing.

Why it matters

AI labs are trying to reach the frontier without always building one giant model from scratch. Fugu shows how model orchestration could become a practical shortcut, though early user feedback suggests the experience may trail the benchmark flexing.

The Deets

  • Fugu chooses helper models, assigns tasks, checks outputs and merges answers through one API.
  • It comes in two versions: a faster Fugu for everyday coding and chat, plus Fugu Ultra for heavier work like patent research and security testing.
  • Sakana says Fugu performs near or above Anthropic’s Fable 5 and Mythos preview on several coding, reasoning and science benchmarks.
  • Early reception has been mixed, with users questioning both performance and the cost of running a hidden model stack.

Key takeaway

Fugu is a clever bet on AI teamwork, but the market still wants proof that the scoreboard translates into real-world speed, quality and cost.

🧩 Jargon Buster - Model orchestration: A system that uses multiple AI models for one task, assigning each model a role and combining the results into one answer.


Cheap AI Fills World Of Warcraft With Bot Players

A Reddit user filled a private World of Warcraft server with 1,800 AI players, each connected to DeepSeek’s API, creating a chatty digital crowd that made the old game feel populated again. The kicker is cost: the setup reportedly runs for about 43 euros a month, roughly the price of dinner, or one very regrettable airport sandwich.

Why it matters

This is a preview of what happens when digital intelligence becomes cheap enough to spend casually. Today, the AI players mostly talk. As prices fall and agent capabilities improve, worlds like this could shift from decorative chatter to characters that quest, trade, organize and cause glorious MMO nonsense.

The Deets

  • The AI characters banter about raids, talent trees and in-game memories.
  • The post reportedly hit 1,100 upvotes and topped the board.
  • The current limitation is action, since the bots can talk but do not meaningfully play.
  • The broader signal is that believable digital crowds are getting cheaper to create.

Key takeaway

Cheap AI turns simulated people from a studio-budget feature into a hobbyist experiment, and that changes games, virtual worlds and eventually customer service avatars that refuse to log off.

🧩 Jargon Buster - API: A software connection that lets one app send requests to another service, such as asking an AI model to generate text.


⚡ Power Plays

Compute Landlords Are Having A Moment

SpaceX reportedly signed a deal with Reflection AI to rent $6.3B of Nvidia computing power from its Colossus data centers. The same infrastructure built to train Grok models is now turning into a revenue engine for other AI labs trying to secure scarce compute.

Why it matters

The AI race is increasingly about who controls the machines, power and cooling. SpaceX may not need Grok to be the top model if Colossus becomes prime real estate in the GPU gold rush.

The Deets

  • Reflection AI launched in October to build open frontier systems for government and enterprise.
  • Reflection has not yet released a public model.
  • The Rundown AI reported other Colossus customers include Anthropic, Google and Cursor.
  • AI Secret reported Reflection will pay SpaceX $150M a month for GB300 compute.

Key takeaway

SpaceX is turning AI infrastructure into a business line, proving the landlord can win even when the tenants are the ones training the models.

🧩 Jargon Buster - GB300 compute: Nvidia’s high-end AI computing hardware used for training and running advanced AI models.


Tesla Trademarks The AI Data-Center Lego Brick

Tesla filed a U.S. trademark for Megapod on June 18, describing a modular AI data-center hardware system covering servers, AI compute, networking, power distribution, cooling and management software. There are no specs, prototype, price or ship date yet, but the idea is clear: prefab compute infrastructure that can be hauled in and plugged into the AI boom.

Why it matters

Power and cooling are now major bottlenecks. Tesla already knows batteries, manufacturing and energy infrastructure, and xAI has reportedly bought roughly $1B in Megapack grid batteries. Megapod suggests Tesla wants the boring but lucrative layer beneath the GPUs.

The Deets

  • The filing is an intent-to-use trademark, meaning Tesla has not launched the product.
  • The system would package data-center hardware into a modular setup.
  • Tesla recently ended its own training-chip effort, according to AI Secret.
  • The play targets deployment infrastructure rather than competing directly with Nvidia chips.

Key takeaway

Tesla appears to be eyeing the part of AI nobody puts on hype slides: the power, cooling and deployment stack that decides whether GPUs sit idle or print money.

🧩 Jargon Buster - Intent-to-use trademark: A filing that reserves a name for a planned product or service before it is sold publicly.


AI’s Energy Appetite Keeps Ordering Seconds

Microsoft and Chevron are planning a 2.67GW gas plant in West Texas to power Microsoft’s AI and cloud data centers, while Nvidia said its Rubin servers are the first with 100% liquid cooling, using hot-tub-temperature coolant to cut cooling energy and reduce water use by “up to 100%.”

Why it matters

AI infrastructure is becoming a climate and grid story as much as a software story. Companies are searching for more power while also trying to make cooling less wasteful, which is a bit like ordering a triple cheeseburger with a Diet Coke and hoping the spreadsheet applauds.

The Deets

  • Microsoft and Chevron’s gas-plant plan raises questions about Microsoft’s carbon goals.
  • Nvidia’s Rubin servers use liquid cooling across the system.
  • Nvidia claims the setup can reduce cooling energy and sharply reduce water use.
  • The wider AI infrastructure footprint remains unresolved.

Key takeaway

AI’s next bottleneck may be measured in gigawatts, water use and permitting timelines.

🧩 Jargon Buster - Liquid cooling: A method of cooling servers with liquid coolant instead of relying mostly on air, often improving efficiency in dense data centers.


🛠️ Tools & Products

Google, A24 Bring AI To The Director’s Chair

Google put $75M behind indie studio A24, pairing it with DeepMind researchers and infrastructure to build AI tools for filmmaking. The pitch is filmmaker-shaped workflows, not instant movie generation. A24’s tech arm, led by former Adobe executive Scott Belsky, is working on AI storyboarding tools that he said will not resemble standard prompt-based generation.

Why it matters

Hollywood’s relationship with AI is a bit whiplashed. Studios are wary of lawsuits, labor backlash and fan revolt, but they also want tools that make production faster and cheaper without looking like a content vending machine.

The Deets

  • Google gets its first studio stake.
  • A24 gets access to DeepMind research support across several projects.
  • A24 is not handing over its film library or data.
  • The deal follows A24’s “Backrooms” success, even as director Kane Parsons has criticized AI’s broader cultural impact.

Key takeaway

Google and A24 are trying to make AI feel like a creative assistant rather than a replacement director, though audiences may still bring tomatoes.

🧩 Jargon Buster - AI storyboards: Visual planning tools that use AI to help filmmakers sketch scenes, shots and sequences before production.


Groq Gets A Reset Button

Groq raised $650M after Nvidia licensed its IP and hired key leaders, according to AI Secret. The company is shifting focus toward its inference cloud while rebuilding its team.

Why it matters

Inference is becoming a giant market, but it is also brutally competitive. Groq’s new chapter suggests AI chip and cloud companies are still searching for the right mix of hardware, software and customer demand.

The Deets

  • Groq raised $650M.
  • Nvidia licensed Groq IP and hired key leaders.
  • Groq is refocusing on inference cloud.
  • The company is rebuilding after leadership and strategy changes.

Key takeaway

Groq’s future now depends on whether it can turn specialized inference into a durable cloud business.

🧩 Jargon Buster - Inference cloud: A cloud service built to run AI models quickly and efficiently for customers.


🔬 Research & Models

Frontier AI Still Has A Cash-Burn Problem

A SemiAnalysis report cited by AI Secret stress-tested top AI subscriptions and found ugly unit economics. A $200-per-month ChatGPT Pro plan could burn about $14,000 in compute at API rates under heavy use, while Anthropic’s $200 Claude Max could run near $8,000. OpenAI reportedly goes negative past 11.4% utilization.

Why it matters

Flat-rate AI subscriptions were designed for chat. Agents can use far more compute because they plan, call tools, retry tasks and run longer workflows. Meanwhile, cheaper open models are pressuring API revenue, making it harder for frontier labs to price their way to profit.

The Deets

  • Heavy users can cost far more than they pay in subscription fees.
  • Agent workloads can consume vastly more compute than simple chatbot prompts.
  • Microsoft, Meta and Amazon are reportedly capping internal AI spend.
  • Lindy moved all traffic to DeepSeek V4, saving millions, according to AI Secret.

Key takeaway

Frontier AI companies have users, hype and capital, but their pricing math still looks like a magic trick.

🧩 Jargon Buster - Unit economics: The profit or loss a company makes from one customer, product or transaction after accounting for costs.


⚡ Quick Hits

  • OpenAI expanded its Daybreak cyber program with a Codex Security plugin, GPT-5.5-Cyber and a “Patch the Planet” effort with Trail of Bits to fix open-source vulnerabilities.
  • The Five Eyes cyber agencies warned that AI is changing cyber risk in “months, not years,” urging executives to harden defenses.
  • Micron signed a strategic deal with Anthropic to supply memory and storage chips, co-design AI infrastructure and invest in Anthropic’s Series H.
  • California Rep. Sam Liccardo introduced the SKILL Act, which would offer companies up to $5,000 per worker in tax credits for AI job training at colleges.
  • OpenAI is sunsetting Pulse, its personalized ChatGPT daily-digest feature, after the product failed to stick.

🧰 Tools Of The Day

  • Sakana Fugu: Sakana’s new orchestration model routes tasks across multiple models through one API, aiming for near-frontier performance with less dependence on any single provider.
  • Typeless: An AI voice-command tool for dictating, researching, summarizing and drafting without living on the keyboard.
  • Codex Security: OpenAI’s plugin for finding and patching software vulnerabilities as part of its broader cyber push.
  • Eve: Vercel’s open-source framework for turning a file directory into an agent.
  • MyClaw: A cloud service that connects OpenClaw to apps like Gmail, Slack, Docs, Sheets, Notion, GitHub, Telegram and WhatsApp.
  • Tely AI: A health care-focused AI tool pitched for helping providers answer patient questions, retarget leads and book appointments.

Today’s Sources: The Internet, The Rundown AI, AI Secret

Subscribe to AI Slop

Sign up now to get access to the library of members-only issues.
jamie@example.com
Subscribe