Share

Spud Done Cooking; AI Power Users' Job Fears; MS Will Now Do Your Work

Spud Done Cooking; AI Power Users' Job Fears; MS Will Now Do Your Work

Today's AI Outlook: ☀️

OpenAI’s Spud Launch Puts Anthropic Back On Defense

OpenAI finally launched GPT-5.5, codenamed Spud, and the potato jokes will probably write themselves. The model is being pitched as a new class of intelligence, with strong gains across reasoning, coding, agentic work and computer-use tasks.

The launch lands at a rough moment for competitor Anthropic, which is dealing with complaints about rate limits and model quality, giving OpenAI a clean narrative reset after months of Claude momentum.

Why it matters

The frontier model race has shifted from “Who chats best?” to “Who gets work done reliably, cheaply and repeatedly?” GPT-5.5 looks designed for that world: agents, coding systems, automation workflows and self-checking multi-step tasks.

The Deets

  • GPT-5.5 reportedly tops public benchmark scores across reasoning, coding, computer use and agentic tasks.
  • It lands in ChatGPT plans and Codex, including Thinking and Pro variants.
  • API pricing is listed at $5 per million input tokens and $30 per million output tokens.
  • OpenAI says the model keeps similar speed to GPT-5.4 while improving efficiency.
  • OpenAI also says it used Codex and GPT-5.5 to rewrite GPU code and improve its own infrastructure.

Key takeaway

The agent era is becoming a routing war, not a loyalty contest. Developers will pick models based on task completion, cost and reliability, not vibes, although vibes are clearly back in OpenAI’s favor.

🧩 Jargon Buster - Agentic Tasks: Work where an AI does more than answer. It plans, uses tools, checks progress and keeps going toward a goal.


⚖️ Power Plays

Washington Says China Copying U.S. Homework At Scale

The White House published a memo accusing Chinese AI firms of industrial-scale distillation against U.S. frontier labs. The accusation is that companies are using fake API accounts, jailbreaks and model outputs to train smaller systems that mimic advanced U.S. models. The timing is spicy: ahead of a scheduled Trump-Xi meeting in Beijing on May 14-15.

Why it matters

This reframes the China AI race. Instead of only debating whether Chinese labs are catching up through better engineering or open-source efficiency, the White House is arguing that part of the gap is being closed through aggressive model-copying tactics.

The Deets

  • The memo accuses Chinese firms of using thousands of fake API accounts and jailbreaks to harvest frontier model outputs.
  • Anthropic previously accused DeepSeek, Moonshot and MiniMax of distillation.
  • The Chinese embassy rejected the accusations as “pure slander.”
  • A House Foreign Affairs bill would push the administration to place distillation offenders on the U.S. export blacklist.

Key takeaway

AI geopolitics is moving from chips to outputs. The next export-control fight may be about not just who can buy GPUs, but who can access frontier model behavior.

🧩 Jargon Buster - Distillation: Training a smaller AI model using the outputs of a larger, more powerful model so it can imitate parts of its performance.


🛠️ Tools & Products

Make Claude Your Personal Morning Editor

The Rundown AI has a practical Claude workflow for your mornings: connect Slack, Notion, Gmail and Calendar, then have Claude turn the last 24 hours of updates into a personalized morning newspaper. The prompt asks Claude to rank what matters, format it like a paper and include top stories, action items and schedule prep.

Why it matters

This is where AI assistants get sticky. Not by being one more chatbot tab, but by becoming the layer that turns your scattered inboxes, calendars and work apps into a readable briefing before coffee has achieved legal consciousness.

The Deets

  • The workflow uses Claude or Claude Cowork with Slack, Notion, Gmail and Calendar connected.
  • Users ask Claude to create a static Morning Edition from the past 24 hours.
  • Claude can revise the layout, ordering and emphasis based on feedback.

Key takeaway

Claude is leaning into the life admin layer: calendars, groceries, travel, music, reservations and daily prep. The assistant battle is getting domestic.

🧩 Jargon Buster - Skill: A reusable AI workflow that performs a specific task, such as creating a morning briefing or summarizing updates.


Microsoft Brings Agent Mode To Office

Microsoft launched Agent Mode in Word, Excel and PowerPoint, letting Copilot directly edit documents, spreadsheets and slides. That moves Copilot closer to becoming a coworker that actually touches the files, rather than a very confident intern standing nearby with suggestions.

Why it matters

Office is where a lot of enterprise work still lives. If agents can edit spreadsheets, rewrite decks and manipulate documents safely, AI adoption becomes less abstract and much more embedded in daily work.

The Deets

  • Agent Mode is coming to Word, Excel and PowerPoint.
  • Copilot can directly edit content instead of only generating suggestions.
  • The move fits the broader shift toward AI systems that can complete workflows inside existing productivity tools.

Key takeaway

The office suite is becoming an execution surface for agents. The next productivity jump may come from AI that edits the file, not AI that tells you how to edit the file.

🧩 Jargon Buster - Agent Mode: A product setting where an AI can take direct actions inside software, such as editing a document or changing a spreadsheet.


💸 Funding & Startups

Nuclear Gets A Data Center Glow-Up

Amazon-backed X-Energy raised $1.02B in its IPO, betting on rising energy demand from AI data centers. The funding signal is clear: as AI workloads grow, the infrastructure story increasingly becomes an energy story.

Why it matters

Frontier AI is not only a software race but a power race. More models, more agents and more compute-heavy products mean more demand for electricity, cooling and long-term energy planning.

The Deets

  • X-Energy raised $1.02B in its IPO.
  • The company is tied to the broader nuclear-power push around AI data center demand.
  • AI Secret framed the raise as part of the energy buildout needed to support AI infrastructure.

Key takeaway

AI’s appetite for compute is pulling nuclear back into the startup and infrastructure conversation.

🧩 Jargon Buster - Data Center Demand: The rising need for electricity, chips, cooling and buildings to run AI models and cloud services.


🔬 Research & Models

The Workers Getting Most From AI Are Also The Most Nervous

Anthropic published an economic follow-up to its Claude survey, analyzing responses from 80,508 workers and tying them to its Economic Index usage data. The result is awkward for the “AI makes everyone feel empowered” storyline: the workers seeing the biggest productivity gains are also the most worried about job displacement, especially early-career workers.

Why it matters

The adoption story is not simply “AI helps, therefore people relax.” For many workers, the better the tool gets, the more obvious it becomes that parts of their job can be compressed, automated or reassigned.

The Deets

  • Anthropic linked Claude usage patterns to worker sentiment across 80,508 respondents.
  • Workers in jobs that use Claude most voiced displacement fears 3x more than those in jobs using it least.
  • Engineers showed particularly high anxiety.
  • Many respondents said AI helps them move faster and free up time.
  • Early-career workers expressed the loudest fears about displacement.

Key takeaway

AI productivity is arriving with a psychological invoice. The people closest to the tools often see both the upside and the threat first.

🧩 Jargon Buster - Economic Index: Anthropic’s data effort to track how people use Claude across different types of jobs and tasks.


DeepSeek V4 Arrives With Big Numbers (And Bigger Caveats)

DeepSeek V4 launched with two open models, 1M context, stronger agents, better coding and lower-cost deployment. AI Secret says the model looks strong, but argues the benchmark story is outpacing real-world proof. The claim: V4 may beat Sonnet 4.5 and approach Opus 4.5, but that still leaves questions about how it performs against the true frontier under messy production workloads.

Why it matters

Open models keep getting more capable, but benchmarks are not the same as production reliability. Long context and strong charts matter less if agents struggle when real work gets chaotic.

The Deets

  • DeepSeek V4 reportedly includes two open models.
  • It offers 1M context.
  • It claims gains in agents, coding and deployment cost.
  • Benchmark strength can fade when agents face messy context, tool use and delivery pressure.

Key takeaway

DeepSeek V4 may be a serious open-model release, but the real test is not the chart. It is whether agents can finish hard work without leaving a cleanup bill.

🧩 Jargon Buster - Context Window: The amount of text, code or data an AI model can consider at once while generating a response.


⚡ Quick Hits

  • Google says 75% of new code is AI-generated, up from 50% last fall and 25% a year earlier, according to There’s An AI For That.
  • Sony AI’s ping-pong robot Ace reportedly beat elite and professional humans under official table-tennis rules.
  • Meta added parental tools showing teens’ weekly Meta AI topics, plus conversation prompts and an AI wellbeing council.
  • ITA Airways will use SITA’s AI flight optimization tool to cut fuel use and emissions.
  • SpaceX disclosed plans to make its own GPUs, while warning about chip supply, costs and manufacturing execution risks.
  • Log files are becoming the new AI search console, as publishers try to understand how GPTBot, ClaudeBot, ChatGPT-User and PerplexityBot crawl their sites.

🧰 Tools Of The Day

  • Scrunch helps brands see how AI systems talk about them, what blocks visibility and how to improve discoverability.
  • Whacka turns ideas into working apps and retro games from a phone, handling build, design and deployment.
  • Betula.ai manages inbound and outbound calls, screens spam, books appointments and remembers customer conversations.
  • Hedy AI listens to meetings and interviews in real time, then coaches users on what to say next.
  • ThumbnailCreator.com creates YouTube thumbnails from video URLs and claims a 73% higher click-through rate.
  • Conscriba generates WebMCP tools from websites so agents can interact with content and test descriptions for better discovery.
  • Halomate is a mobile AI workspace for running GPT, Claude, Gemini, DeepSeek and Grok side by side with specialized memory agents.
  • Agentspan is an open-source framework and runtime for building, running and observing durable AI agents.

Today’s Sources: The Internet The Rundown AI, AI Secret, There’s An AI For That

Subscribe to AI Slop

Sign up now to get access to the library of members-only issues.
jamie@example.com
Subscribe