~/ai-stream
~/industry/anthropic-new-ai-is-too-powerful-for-the-world-20260408
The Rundown AI·Industryhot

Anthropic's Mythos Too Powerful to Release, Chinese Open Source Model Tops Coding Benchmarks

content

Anthropic Project Glasswing

😱Anthropic's Project Glasswing Reveals Mythos AI

Anthropic introduced Project Glasswing, a cybersecurity coalition with AWS, Apple, Google, Microsoft, Nvidia, and 7 other partners built around Claude Mythos Preview, a new unreleased frontier AI with extremely powerful capabilities.

  • Mythos flagged thousands of security flaws across every major OS and browser, including bugs that survived 27 years of review and millions of scans
  • Benchmarks show major improvements over both Opus 4.6 and other frontier rivals across coding, reasoning, and nearly every other domain
  • The model will not be released publicly, instead limiting access to 12 launch partners and 40+ other orgs for defensive security backed by $100M in credits
  • Anthropic's Sam Bowman called it "an uneasy surprise" after Mythos emailed him from a test instance that wasn't supposed to have internet access
  • Mythos was the subject of leaks after a blog draft was found in unpublished files last week, with Anthropic using the model internally since February

Why it matters: Mythos offers a glimpse of what top labs have under wraps. Anthropic thinks it's so powerful it won't even release it publicly, instead giving time for the company and partners to work on cybersecurity and safety rollouts for future Mythos-level general models.

Zhipu AI GLM-5.1

🚀Open Source AI Advances with Z AI's GLM-5.1

Chinese AI lab Z AI released GLM-5.1, a new open-source coding model that competes with frontier rivals on coding benchmarks and is built for marathon autonomous sessions of up to 8 hours straight.

  • GLM-5.1 hit 58.4 on SWE-Bench Pro, topping both GPT-5.4 and Opus 4.6 and marking a rare moment for open source at No. 1 on a top coding benchmark
  • Z AI said the model can "stay effective on agentic tasks over much longer horizons", showing strong results over longer, complex problems
  • In tests, Z AI had GLM-5.1 build a working Linux desktop as a web app over 8 hours, including a file browser, terminal, and games, without human guidance
  • The model also shows top performance in Arcada Labs' Design Arena, coming in second for creative web design after Claude Opus 4.6

Why it matters: Top Chinese labs continue to chase the frontier, with GLM-5.1 showing the strongest open-source coding yet—along with long-horizon capabilities that Z AI called the "most important curve after scaling laws". An open-source model with this performance shows how fast the gap is closing.

📧Get to Inbox Zero with This Claude Prompt

In this guide, you will learn how to run email triage once in Claude, then have Claude write a recurring task prompt for your inbox—ending up with a daily cleanup workflow based on your real rules, not a generic prompt.

  • Open Claude with Gmail access and run one triage session on your unread inbox to show Claude what matters before automating the process
  • Prompt: "Generate an interactive email triage report for the last 24 hours. Sort each email into: Needs response, Needs attention, Archive, Archive and unsubscribe. Include sender, subject, one-line reason, direct link, and item number. Add labels to approved emails"
  • Review, correct misfires, and prompt: "Turn this workflow into a recurring task prompt for my inbox, with my common senders, archive, and unsubscribe rules"
  • Save that prompt as a Claude Cowork scheduled task so it can run every morning without rebuilding the logic

Why it matters: Set up Gmail rules around the labels Claude adds. It can apply labels through the connector, so "Needs Response" and "Needs Attention" can be auto-starred, and Archive emails can be auto-archived.

Anthropic Google Partnership

💰Anthropic Continues Rise, Locks in 3.5GW Compute

Anthropic signed a multi-gigawatt compute deal with Google and Broadcom, locking in 3.5GW of TPU capacity for 2027, while also sharing new surging revenue numbers and enterprise growth despite its battle with the U.S. government.

  • Since January, Anthropic's run-rate revenue tripled to $30B, and its $1M+ enterprise customer base doubled to 1,000+, forcing the compute expansion
  • Broadcom will supply 3.5GW of Google's TPUs starting in 2027, nearly all US-based—adding to the $50B Anthropic pledged for domestic AI buildout
  • The revenue projections put the company ahead of rival OpenAI's recent report of $2M/month in revenue, while both race towards an IPO
  • The growth comes despite the Pentagon labeling Anthropic a supply-chain risk, a move the company says rattled over 100 enterprise clients

Why it matters: Tripling run-rate revenue while facing the Pentagon shows demand for Claude is still off the charts. Given recent rate limit issues, more compute is welcome—especially with behemoth models like Mythos waiting in the wings.