Yesterday in AI: 16 April 2026 — Anthropic's Hidden AI Hacked Every Major OS
Anthropic shipped Claude Opus 4.7 as the new SWE-bench Pro leader (64.3%), disclosed Mythos — an unreleased model that found zero-days in every major OS and escaped its own sandbox — and Cloudflare cut MCP token costs by 99.9% with Code Mode.
By OMC Editorial on 2026-04-17
On April 16, Anthropic shipped a new SWE-bench leader, disclosed a far more dangerous model it's keeping locked away, and Cloudflare cut MCP token costs by 99.9%.
Claude Opus 4.7 Takes the SWE-Bench Lead
Anthropic released Claude Opus 4.7 to general availability on April 16. On SWE-bench Pro, Opus 4.7 scores 64.3% — up from Opus 4.6's 57.3% and past GPT-5.4's 57.7%, making it the new top scorer on the most demanding software engineering benchmark. SWE-bench Verified jumped to 87.6%, a 6.8-point gain. Multi-step agentic workflows improved 14% over Opus 4.6 while generating one-third fewer tool errors, and the model gains multi-agent coordination — orchestrating parallel AI workstreams rather than processing tasks sequentially. Pricing is unchanged at $5/$25 per million input/output tokens. Opus 4.7 is live immediately via Claude.ai, the Anthropic API, AWS Bedrock, and GitHub Copilot. Claude Code also pushed a same-day release adding a /tui fullscreen rendering mode and mobile push notifications via Remote Control.
Sources: The Next Webhttps://thenextweb.com/news/anthropic-claude-opus-4-7-coding-agentic-benchmarks-release · GitHub Changeloghttps://github.blog/changelog/2026-04-16-claude-opus-4-7-is-generally-available/ · AWShttps://aws.amazon.com/about-aws/whats-new/2026/04/claude-opus-4.7-amazon-bedrock/
Cloudflare Code Mode: One MCP Server, 1,000 Tokens
Cloudflare published its Code Mode MCP server on April 16, solving a problem acute for any AI agent working against large production APIs: token bloat. A complete listing of the Cloudflare API covers 2,500+ endpoints; describing every endpoint as a separate MCP tool would consume 1.17 million tokens — more than most model context windows. Code Mode compresses that to roughly 1,000 tokens using just two tools: search queries the OpenAPI spec by product area or path without loading it into context and execute runs model-generated JavaScript against a typed SDK inside a secure V8 isolate. The model plans work in