πŸ”¨ The Anvil

Monday, May 11, 2026

15 stories · Standard format

Generated with AI from public sources. Verify before relying on for decisions.

🎧 Listen to this briefing or subscribe as a podcast →

Today on The Anvil: a $10B xAI–Cursor bet, a US-Iran peace proposal that died on arrival, and a sober GEP/Darden report on why most supply-chain AI pilots never scale. Plus a Hugging Face supply-chain attack that hit 244K developers and the slow, expensive work of cleaning PFAS out of West Plains wells.

AI Coding & Design Tools

xAI Puts $10B Into Cursor With $60B Acquisition Option; Composer Already at $2B ARR

xAI invested $10B in Cursor with a $60B acquisition option exercisable in 2026, and granted Cursor direct access to Colossus for model training. Cursor's Composer feature is reportedly already at $2B annualized revenue less than six months after launch. This lands on top of Cursor's confirmed $2B raise at $50B+ valuation last month, the Cursor 3.3 / Background Agent GA shipped May 7, and 70% Fortune 1000 penetration β€” the platform is now simultaneously the most-capitalized AI coding tool and a conditionally independent one.

The $60B acquisition option is the new signal here β€” it puts a hard strike price on Musk's optionality, meaning Cursor's independence is now priced and time-bounded rather than assumed. Colossus compute access (220,000+ NVIDIA GPUs, already connected to Anthropic via the SpaceX deal covered May 7–8) changes Cursor's fine-tuning cadence for Composer 2 materially. For teams that have tracked Cursor's ARR run from $2B raise to $2B Composer revenue in parallel, the practical question shifts: what does a Musk-aligned roadmap look like for a tool that 35% of its own PRs already run through cloud agents?

Verified across 1 sources: Crypto Briefing

OpenAI Ships Codex Auto-Review: Separate GPT-5.4 Agent Approves Sandbox Actions, Cuts Interruptions 200x

OpenAI's Codex Auto-Review (launched April 30, documented in detail this week) routes sandbox approval requests to a structurally separate GPT-5.4 Thinking agent rather than interrupting the developer. Reported ~200x reduction in approval prompts with 9.7% miss rate on overeager actions. The split-agent architecture is the design choice β€” keeping the task-completion agent away from its own safety decisions to prevent the well-known 'approve-everything' drift.

This is the cleanest architectural answer yet to the friction-vs-safety tradeoff that's plagued every coding agent shipping into production. The 9.7% miss rate is the number to benchmark against TrustFall-class supply-chain attacks and the 56 AI-code CVEs filed in Q1. Worth pairing with the Cursor Background Agent GA and OpenAI Symphony β€” three different vendors are now converging on async, separately-governed agent runtimes as the production pattern.

Verified across 1 sources: New Claw Times

Forbes Tech Council: AI Writes the Code, Nobody's Testing It β€” QA Atrophy Is the Hidden Liability

Software quality engineer Khurram Mir argues organizations are cutting QA investment while AI-generated output climbs, and that 'syntactically fluent but contextually risky' code is compounding defect density invisibly. Using AI to test AI creates circular validation. Lands alongside this week's ServiceNow Build Agent GA and Q1 2026 data showing 56 CVEs attributed to AI-generated code plus eight-figure breach losses tied to coding-assistant misuse.

The vibe-engineering / governance thread keeps maturing in the same direction: velocity metrics decoupled from quality metrics produce false confidence. For a product leader, the actionable point is that test strategy needs to be human-led even when code generation is agent-led β€” and that 'looks right' is now a measurable failure mode, not a vibe. Pair this with the OpenAI Auto-Review architecture above: separating generation from review is the same principle at two different layers of the stack.

Verified across 1 sources: Forbes Technology Council

ServiceNow Build Agent GA Across Cursor, Windsurf, Claude Code, and Copilot β€” Governance Layer for AI Coding Lands

At Knowledge 2026, ServiceNow shipped Build Agent GA across the four leading AI coding tools, positioning itself as a cross-IDE enterprise governance and audit layer. The launch is explicitly framed against Q1 2026's documented losses β€” a $3.5M HIPAA fine at a healthcare company and a $54M loss at a manufacturer, both tied to coding-assistant breaches and data leaks.

Notable because it doesn't try to replace developer tool choice β€” it overlays compliance and audit trails on top. That's the right shape for enterprises that lost the 'standardize on one IDE' fight years ago. The unanswered question, as the Byte Iota piece notes, is whether governance infrastructure actually prevents breaches or just produces a better paper trail after them. Worth watching against the TrustFall MCP class vulnerabilities β€” governance layers don't patch the agents themselves.

Verified across 1 sources: Byte Iota

AI Developments

Sakana + NVIDIA TwELL: Custom CUDA Kernels Convert FFN Activation Sparsity Into 20%+ Inference and Training Speedup

Sakana AI and NVIDIA released TwELL, a sparse data format plus custom CUDA kernels exploiting activation sparsity in feedforward layers β€” 20.5% inference and 21.9% training speedup on 2B-param models, achieved via ReLU activations and L1 regularization that induce ~99.5% sparsity with no accuracy loss on standard benchmarks. No architectural changes required.

FFN layers consume 80%+ of LLM compute and have been carrying massive activation sparsity that GPU kernels couldn't exploit. Turning that latent sparsity into real wall-clock gains is the kind of below-the-API improvement that quietly lowers everyone's bill. Sits alongside Star Elastic (covered May 10) and ERNIE 5.1's training-cost compression as the third efficiency thread this week β€” the model economics story keeps shifting from 'scale up' to 'extract waste'.

Verified across 1 sources: MarkTechPost

AI in Supply Chain & Logistics

GEP/Darden: 90% of Supply-Chain AI Pilots Never Scale β€” 74% Still in Planning

A GEP and UVA Darden survey of 180 supply-chain executives finds fewer than 1 in 10 have scaled AI pilots enterprise-wide; 74% remain in planning. The diagnosis: the failure isn't model quality, it's lack of process transformation, change management, and organizational design. Successful deployments require on-site AI PhDs, frontline process experts, automated data cleansing, and steering committees β€” not just procurement of a vendor platform.

This is the most quantified version yet of the 'decision-latency consensus' thread you've been tracking from ARC, Gartner, and RELEX. It reframes what looks like an AI maturity problem as a data-and-process governance problem β€” and matches the entity.biz argument the same day that SMEs need a 90-day data-layer roadmap before any AI tooling. The practical implication: every supply-chain AI vendor story (Descartes, Avanade, DHL, FourKites) needs to be read against this baseline of how many comparable deployments actually stuck.

Verified across 2 sources: Supply Chain Brain · entity.biz

Descartes Ships Fleet Data Intelligence with 'RenΓ©' Agent: 30% Route-Density Gains Without Adding Vehicles; Rockwell Cuts Refrigeration Energy 17%

Descartes launched Fleet Data Intelligence with a natural-language agent (RenΓ©) sitting on top of its Global Logistics Network β€” early deployments show 30% route-density improvement and better service-time predictions without expanding fleet. Same week: Rockwell Automation + Actemium deployed Real-Time Coefficient of Performance on PlantPAx for a frozen-food manufacturer, hitting 17% refrigeration energy reduction ($130K/site/year), and Avanade demoed inventory cuts of 20% at a snack brand and $35M cost recovery at an electronics maker at Hannover Messe.

Three concrete execution-layer wins to set against the GEP/Darden pilot-purgatory finding above. The unifying pattern is narrow scope (one workflow), measurable physical-world KPIs (energy, density, inventory), and AI as orchestration layer over operational data the customer already trusts. For a product builder thinking about where supply-chain AI actually lands, this is the shape: pick a bounded execution loop with clean instrumentation, not a 'transformation.'

Verified across 3 sources: Warehouse News · Rockwell Automation / MarketScreener · IIoT World

Design Engineering

Agentic Design Systems and Design Tokens as AI Substrate β€” Components Become Contracts

Two complementary essays this week argue the same point from different ends: a design-systems analysis frames components as 'contracts' (intent, constraints, anti-patterns, approval rules) that agents can read and safely act on, while a separate piece argues semantic design tokens β€” not better AI tools β€” are what determine whether AI-generated UI stays consistent at scale. Both land alongside the 'why every AI-built app looks the same' multi-agent critique and Filament Studio v1.3.0's MCP foundation for dynamic data models.

For someone living at the design/code seam, this is the operationally relevant version of the broader 'agents need structured context' argument. The implication is concrete: token systems, component metadata, and contract definitions are now the leverage points where agent quality is actually determined. The Knak case study from last week (designers prototyping in production via Claude Code) only works if the design system is structured enough for agents to reason about β€” otherwise you get the 'generated, not designed' polish-without-thinking failure mode the Nervegna essay describes.

Verified across 3 sources: The Design System Guide · Medium / Ariansyah Ricky · Medium / Towards Data Engineering

Cross-Repo Context for Coding Agents: Three Teams Independently Built the Same Primitive

Three independent teams (Neilos, Mabl, Meta) published variations on the same solution within weeks: dependency graphs as queryable substrates that live outside any single agent's context window, ranging from lightweight coordination layers to 850-line registries. Companion data point: Mnemo shipped this week as an MCP server that gives Cursor / Claude Code / Amazon Q persistent AST-based codebase memory, decoupling context from any single vendor.

This is the cross-repo equivalent of the context-engineering thread from earlier this week β€” agent productivity is bounded by what the agent can see, and naive context management fails at scale. When three serious teams converge on 'queryable dependency graph as runtime infrastructure' inside a month, that's a primitive emerging. For anyone running multi-repo systems, the question is whether you build this yourself or wait for it to standardize as a managed layer; the MCP-based approach (Mnemo, Filament) suggests the standardization will happen at the protocol layer first.

Verified across 2 sources: Dev.to / Daniel We · Dev.to / Nikhil102

Spokane / North Idaho

Spokane Distributes Free Water Filters to ~4,000 West Plains Homes After PFAS Contamination from Airport Firefighting

The City of Spokane, in coordination with the county and Spokane International Airport, distributed free water filters to nearly 4,000 West Plains households whose wells were contaminated by PFAS from airport firefighting operations. A water station is operating on Garden Springs Road while permanent remediation is pursued β€” municipal water-line extensions and point-of-entry treatment systems funded by a $7.5M county grant. Same news cycle: Idaho's statewide drought declaration (all 44 counties, second-warmest winter on record) and Premera/MultiCare negotiations risking in-network coverage for 100K+ Washington patients on June 1.

The PFAS story connects directly to the Idaho statewide drought declaration (all 44 counties, second-warmest winter on record) landing the same week β€” both are pressure on the same West Plains aquifer system. That aquifer connection also ties back to last week's Idaho HB 895 closed-loop cooling mandate for data centers, which was explicitly framed as a drought safeguard. The $7.5M county grant and the city/county/airport cost-sharing model will likely become the template if other Inland Northwest airfields surface similar contamination β€” federal liability for PFAS from military/aviation firefighting remains unresolved nationally.

Verified across 3 sources: FOX 28 Spokane · Idaho Press · FOX 28 Spokane / MultiCare

Spokane Local Pulse: Habitat Gets $6.5M State Boost, Street-Racing Ordinance Filed, Kootenai GOP Splits Three Ways Ahead of May 19

Habitat for Humanity Spokane received $6.5M in state funding for infill, Spokane Valley new construction, and Highline Village expansion in Airway Heights. Councilmember Paul Dillon proposed a new street-racing ordinance with escalating penalties and vehicle seizure for repeat offenders, addressing the gap between state highway law and city streets. North Idaho: the May 19 Kootenai County GOP precinct election splits 141 candidates across three factions (North Idaho Republicans, Kootenai Freedom Caucus, and a Candlelight Christian Fellowship-aligned bloc) for 74 committeeman seats β€” a structural fight that determines how candidates get endorsed across the region.

Habitat's state allocation is a meaningful signal that Washington's housing capital is reaching the Inland Northwest at scale; pair with last week's Charlie's Produce and MLK Center mixed-use updates. The Kootenai GOP factional split matters beyond party politics β€” it determines which voices control candidate selection for state legislative races, county commission, and ultimately land-use and growth policy across one of Idaho's fastest-growing counties.

Verified across 3 sources: Spokane Public Radio · KHQ · Spokesman-Review

Newport Beach

Newport Beach: Lido Theater Wins Art Deco Preservation Award; OC Cities Split Sharply on ICE Cooperation

The Lido Theater β€” a 520-seat 1939 Art Deco cinema on Balboa Peninsula β€” received the 2026 Preservation Award from the Art Deco Society of California following a five-year restoration completed October 2024 (recovered Catalina-style tile, underwater ocean murals, original neon). Separately, a Voice of OC investigation found 24 of 34 OC cities have taken zero official action on intensified federal immigration enforcement since June 2025, while Anaheim and Santa Ana stood up aid funds and Huntington Beach declared itself a non-sanctuary city actively supporting ICE coordination.

Two different shapes of municipal posture in the same county. The Lido recognition is a marker of how Newport's preservation politics are getting validated externally even as the city greenlit demolition of the Regal Edwards Big Newport last week for two 22-story condo towers β€” the line between 'preserve' and 'redevelop' is now being drawn at specific buildings, not as a citywide doctrine. The ICE-response disparity matters for anyone doing business across OC city lines: legal environment, labor risk, and community-relations expectations now vary materially within a 20-mile radius.

Verified across 2 sources: Newport Beach Indy · Voice of OC

Iran Conflict

Iran's 14-Point Counterproposal Dies Same Day: Trump Rejects, Netanyahu Demands Uranium Removal, Drone Attacks on UAE/Kuwait/Qatar

On May 10, Iran formally submitted a counterproposal via Pakistani mediators: end the war, gradual Hormuz reopening, lift the US naval blockade, dilute highly enriched uranium with third-party transfer, release frozen assets, and a 30-day nuclear negotiation window. Trump rejected it the same day as 'totally unacceptable.' Netanyahu publicly stated the war 'is not over' and demanded uranium removal and facility dismantling as preconditions. Iran ran fresh drone attacks against UAE, Kuwait, and a bulk carrier near Qatar; a parliamentary security official declared 'our restraint is over.' Brent rose 4.1% to ~$105.50. UK announced plans to deploy a warship for Hormuz shipping security; Iran threatened a 'decisive response' to French/British deployment. The Ocean Koi seizure on May 8 β€” the third since the April ceasefire, after Epaminondas and MSC Francesca β€” preceded this breakdown.

The substance gap is now public and specific: Iran wants sanctions and the blockade lifted before nuclear concessions; the US and Israel want uranium removal first. That's structurally the same impasse that collapsed the Islamabad talks, but now with Iran's 440kg of 60%-enriched uranium and the 14-point US MOU both on the table simultaneously. NBC's assessment that Iran can absorb the blockade for months β€” consistent with the CIA estimate you've seen β€” directly undercuts the administration's compressed-pressure timeline. The Trump-Xi Beijing meeting is now the next meaningful inflection; China's Caspian drone-resupply corridor to Iran (flagged May 10) means Beijing already has leverage it hasn't deployed diplomatically.

Verified across 6 sources: Institute for the Study of War · BBC · Reuters · NBC News · CNBC · AP

OSINT & Intelligence

Fake OpenAI 'Privacy Filter' Hit #1 on Hugging Face β€” 244K Downloads of a Rust Infostealer Linked to ValleyRAT/Silver Fox

HiddenLayer disclosed a typosquatted Hugging Face repo (Open-OSS/privacy-filter) impersonating OpenAI's Privacy Filter that hit #1 trending with 244,000 downloads before removal. A Python loader deployed the sefirah Rust infostealer (browser creds, crypto wallets, SSH keys, Discord tokens). Infrastructure overlaps with a prior npm typosquat (trevlo) tied to ValleyRAT and Chinese threat actor Silver Fox β€” same C2 domain (welovechinatown.info), suggesting a coordinated cross-ecosystem operation.

The Hugging Face supply-chain attack surface is now operationally similar to npm/PyPI β€” typosquatting plus metrics-padding to game trending lists, then exfiltrating developer credentials at scale. The cross-ecosystem infrastructure reuse is the OSINT-relevant signal: defenders correlating C2 domains across registries can now attribute campaigns across npm, Hugging Face, and PyPI as one operation. For any team pulling models from HF, this is the second major typosquat campaign in two months β€” repo provenance and signature checks need to be part of the model-loading pipeline, not a nice-to-have.

Verified across 3 sources: The Hacker News · Lyrie.ai / HiddenLayer · GBlock

Vantor Lands $70M NGA Contract for Multi-Source GEOINT Platform; SOCOM Tests SkyFi on ATAK for Direct Satellite Pulls

Vantor (the rebranded Maxar Intelligence satellite-imagery arm) secured a $70M NGA contract for a source-agnostic platform integrating petabyte-scale commercial electro-optical, SAR, and hyperspectral data with viewing/streaming/analysis at enterprise scale. Companion development: US Special Operations Command is field-testing SkyFi's mobile platform, which delivers commercial satellite imagery and analytics directly to operators' ATAK devices β€” letting tactical-edge users pull archived imagery and task new collections without going through a centralized processing center.

Two converging moves push commercial GEOINT closer to the operator and farther from the analyst pipeline. The Vantor contract reinforces NGA's AI Blueprint thread (announced last week) β€” multi-source data fusion is becoming the agency's procurement priority, and commercial suppliers are absorbing more of the integration layer. The SkyFi/ATAK test is the more interesting structural signal: tactical-edge satellite tasking democratizes capability that used to require analyst intermediation, with obvious downstream implications for civilian responders, commercial OSINT shops, and the maritime/conflict monitoring work Windward and Bellingcat have been publishing.

Verified across 2 sources: BizWest · SpaceNews


The Big Picture

Governance layers chase the agent stampede ServiceNow Build Agent goes cross-IDE, OpenAI ships Auto-Review for Codex, and Forbes warns QA is being gutted under AI velocity. The pattern: every productivity gain is racing an audit/security gap, and the response is meta-tooling (separate review agents, enterprise governance overlays) rather than slowing down.

Pilot purgatory is the supply-chain AI story The GEP/Darden survey of 180 CSCOs (74% still in planning, <10% scaled) lands the same week Descartes, Rockwell, Avanade, and DHL ship narrow execution-layer wins. The lesson is consistent with last week's RELEX Open thesis: the constraint isn't model quality, it's process redesign and data hygiene.

Diplomacy theater around the Strait Iran's 14-point counterproposal, Trump's same-day rejection, Netanyahu's uranium-removal red line, and fresh drone attacks on Gulf shipping all on May 10. Both sides are negotiating in public to set narrative β€” and Brent at $105 says markets are pricing the breakdown, not the deal.

Supply-chain attacks weaponize trust surfaces TrustFall in MCP-enabled coding agents, the fake OpenAI Privacy Filter on Hugging Face (244K downloads), and US Treasury sanctioning Chinese satellite imagery firms feeding Iran. The common thread: trusted distribution channels (MCP, model hubs, commercial GEOINT) are now the attack surface.

Design systems become agent contracts The 'agentic design system' essay, the design-tokens-as-foundation argument, Mnemo's MCP memory layer, and three independent solutions to cross-repo agent context all point the same direction: components and codebases must be machine-readable contracts, not just human-readable artifacts.

What to Expect

2026-05-13 Weegloo launches β€” AI agent that infers content models from generated UIs and wires them to CMS/APIs.
2026-05-19 Kootenai County GOP precinct elections (141 candidates, 74 seats, three factions); Idaho House District 4B primary (Price vs. Hazel).
2026-05-28 Huntington Beach housing-mandate compliance deadline; ruling on $50K/month retroactive fines expected before May 15.
2026-06-01 Premera–MultiCare contract deadline β€” 100K+ Washington patients face out-of-network costs if no deal; GitHub Copilot AI-credits billing change takes effect.
2026-07-04 WSDOT target reopening for North Cascades Highway (SR 20) after winter damage; Meadowlane Road J-turn project also completes mid-to-late July.

Every story, researched.

Every story verified across multiple sources before publication.

🔍

Scanned

Across multiple search engines and news databases

789
📖

Read in full

Every article opened, read, and evaluated

161

Published today

Ranked by importance and verified across sources

15

β€” The Anvil

πŸŽ™ Listen as a podcast

Subscribe in your favorite podcast app to get each new briefing delivered automatically as audio.

Apple Podcasts
Library tab β†’ β€’β€’β€’ menu β†’ Follow a Show by URL β†’ paste
Overcast
+ button β†’ Add URL β†’ paste
Pocket Casts
Search bar β†’ paste URL
Castro, AntennaPod, Podcast Addict, Castbox, Podverse, Fountain
Look for Add by URL or paste into search

Spotify isn’t supported yet β€” it only lists shows from its own directory. Let us know if you need it there.