πŸ”¨ The Anvil

Sunday, May 3, 2026

13 stories · Standard format

🎧 Listen to this briefing or subscribe as a podcast →

Today on The Anvil: Uber's AI budget detonates four months in, Iran's Day 65 peace proposal lands with Trump skeptical, fiber-optic drones rewrite the air-defense playbook in southern Lebanon, and Coeur d'Alene starts a four-year I-90 widening with ramp closures Monday.

Cross-Cutting

Uber Burns Entire 2026 AI Budget in Four Months β€” Claude Code Adoption Hits 84% at $500–$2,000 per Engineer per Month

Uber rolled Claude Code out to 5,000 engineers in December 2025, watched adoption climb from 32% to 84%, and exhausted its full 2026 AI budget by May. Individual engineers are consuming $500–$2,000/month in tokens; internal leaderboards rewarding aggressive usage created a cultural feedback loop that token-based pricing can't absorb. Pairs with ByteIOTA's structural analysis of why the same math is breaking indie developers and TipsOffTech's 60-day production test confirming Cursor + Claude Code commonly runs $40–$120/month for solo devs.

This is the first large-scale public disclosure of how token pricing breaks down precisely when adoption succeeds β€” a structural problem, not a miscalibration. The cost scales with agent complexity and parallelism, which are exactly the capabilities companies are paying for. It mirrors early AWS overspend stories but compressed from years into months, and exposes an organizational gap: the teams driving adoption don't talk to the teams managing spend. Watch for committed-spend deals, open-weight inference fallbacks (Xiaomi MiMo-V2.5-Pro, Qwen 3.6), and the emergence of FinOps tooling for AI that doesn't exist at cloud-cost-tool maturity yet.

Verified across 3 sources: Startup Fortune · ByteIOTA · Tips Off Tech

Enterprise AI Governance Pivots from 'Which Copilot' to 'Control Plane Above the Tool'

CIO.com synthesizes NIST, OWASP, and vendor signals into a coherent thesis: serious AI governance now sits above individual tools as a control plane managing identity, permissions, approved models, secure context, and audit trails wherever agents execute. GitHub, Google, Microsoft, and Anthropic are all operationalizing this architecture as a core product layer. Lands the same week as the CISA/NSA/Five Eyes joint agentic-AI guidance and Okta's research on how agents bypass guardrails to exfiltrate credentials, and three days after the Cursor/Railway 9-second database wipe.

The control-plane framing is the corporate-buyer counterpart to the engineering-team thesis Activepieces published last week (the harness is the product). Both arrive at the same conclusion from opposite directions: the model is commoditizing, the supervisory layer is the moat. For a head of product evaluating tooling, the practical implication is that vendor selection criteria are flipping β€” auditability, identity scoping, and approved-model registries now matter more than raw capability benchmarks. Expect procurement RFPs to start asking about control-plane architecture by mid-summer.

Verified across 3 sources: CIO.com · Computerworld (Okta research) · Dev.to (MCP tool poisoning)

AI Developments

Mistral Medium 3.5 Lands with Per-Request Reasoning Toggle; Xiaomi MiMo-V2.5-Pro Builds a Compiler in 4.3 Hours

Mistral released Medium 3.5 on April 29 β€” a unified 128B-dense flagship folding Medium 3.1, Magistral, and Devstral 2 into a single model with configurable per-request reasoning effort, 256K context, and $1.50 per million input tokens on four GPUs. The license shifted from Apache 2.0 to a Modified MIT with commercial-revenue carve-outs. Three days later Xiaomi open-weight-released MiMo-V2.5-Pro: 1.02T MoE, building a complete compiler in 4.3 hours (672 tool calls), an 8,000-line video editor in 11.5 hours, scoring 73.7 on MiMo Coding Bench vs. Claude Opus 4.6's 77.1 β€” at 40–60% fewer tokens.

Two signals in the same week: (1) Western frontier labs are converging on the per-request reasoning toggle as the practical billing primitive (one endpoint, one bill, computation routed by complexity), and (2) Chinese open-weight models have closed the multi-hour autonomous-coding gap to within ~4 points on coding benchmarks while undercutting on tokens. Combined with the Uber budget story, this strengthens the open-weight fallback case for any team with serious agent burn. The Mistral license tightening is the canary β€” open-weight commercial terms are about to get more contentious across the board.

Verified across 2 sources: WinBuzzer · The Decoder

AI Coding & Design Tools

Windsurf Review: Cascade's Pause-for-Confirmation Directly Addresses the Class of Errors That Wiped PocketOS

Windsurf β€” the rebranded Codeium that Cognition AI bought for $250M in December 2025 β€” is a VS Code-based agentic IDE built around two primitives: Cascade, a multi-file agent that explicitly pauses for confirmation before destructive actions, and Memories, a persistent context layer that compounds across sessions. Pro is $15/month, Max is $200/month, and reviewers consistently find it better at refactoring than Cursor but lagging on inline-completion reliability. The architectural choice β€” confirmation gates by default β€” is exactly the safeguard absent from the Cursor/Railway incident that deleted PocketOS in 9 seconds.

The agent-IDE market is bifurcating by workflow rather than by model: Cursor for velocity, Claude Code for terminal-native architectural work, Windsurf for refactoring with a thinking partner, Copilot for enterprise compliance. Cascade's explicit-pause model is the clearest commercial answer yet to the agentic-coding safety problem β€” a product feature, not a policy promise. For someone choosing a stack, the practical question has shifted from 'which model' to 'what control model can my team actually supervise,' echoing dev.to's coding-agent governance framing.

Verified across 3 sources: Robo Rhythms · Kingy.ai (taxonomy) · Dev.to (governance framing)

AGENTS.md / SKILL.md / DESIGN.md Standardize as the Three-Layer Spec for Agentic Workflows

Industry standardization is settling on three complementary instruction files: AGENTS.md (overall behavior and roles, donated to Linux Foundation Dec 2025), SKILL.md (reusable task procedures), and DESIGN.md (machine-verifiable design-system specs with CLI validation, released by Google Labs April 10). OpenAI, Google, Sourcegraph, Cursor, and Factory all back the AGENTS.md spec. DESIGN.md adds validated WCAG contrast ratios and token-reference checking, eliminating manual design-system reviews. The pattern matches Activepieces' 150-line context harness from last week.

For a head of product working at the design-code seam, this is the connective tissue that makes the Claude Design / Figma MCP / Workflow Lab pieces actually compose. DESIGN.md is the missing layer that lets a small team enforce design-system consistency machine-verifiably without dedicated DS staff β€” directly relevant to the Inland Northwest scaling problem of doing design-engineering work without a dedicated design platform team. Worth setting up DESIGN.md alongside CLAUDE.md in any active repo this week.

Verified across 2 sources: Dev.to (AWS Builders) · Dev.to (Harness Engineering)

AI Supply Chain & Logistics

Sparrow XPL Ships TwinShip 'ADIOS' Platform; nShift Deploys Agentic AI Across 1,000+ Carriers

Australian logistics firm Sparrow XPL launched TwinShip, branded as the first 'Agentic Digital Intelligence Operating System' for enterprise freight β€” already serving Louis Vuitton and Lululemon, targeting shippers with $1M+ annual freight spend, integrating directly into ERP for autonomous quoting, allocation, and exception handling. Same day, nShift deployed proprietary agentic AI to manage 1,000+ carrier connections and 1.2M pickup/drop-off locations, accelerating onboarding while keeping human-in-the-loop oversight via integration specialists. Both ship the same week as Honeywell's CSCO disclosing 200+ AI use cases and reducing single-supplier dependency from 90% to 60%.

Three more production agentic-supply-chain platforms in a week, on top of Infios, Fairmarkit, Cloud Inventory, and XPO from Friday. The pattern is consistent: agentic systems sitting above ERP, replacing static dashboards with autonomous routing decisions, with named premium-brand customers willing to be referenced. The 'ADIOS' branding is silly but the architectural thesis β€” domain knowledge plus agent reasoning over a stable data network β€” is the same one nShift demonstrates with billions of delivery data points. Whoever owns the carrier-graph plus the agentic layer wins this segment.

Verified across 3 sources: IPS News · Last Mile Logistics Ops · Supply Chains (Honeywell)

Design Engineering

TypeScript 7.0 Beta Ships Go Compiler β€” 10Γ— Type-Check Speedup, Five tsconfig Settings Now Wrong by Default

TypeScript 7.0 beta dropped April 21 with a Go-based compiler delivering ~10Γ— type-checking speedup (VS Code's own check went from 77.8s to 7.5s). Builds on TypeScript 6.0's March default reset, which removed AMD/UMD emit, --outFile, and classic module resolution. Five specific tsconfig.json changes are now required: explicit types arrays, modern moduleResolution, removal of deprecated flags, and strict-mode clarifications. Most existing configs are quietly broken.

Real measurable impact on iteration loops and CI economics for any TypeScript codebase of nontrivial size. The 10Γ— number is conservative β€” large monorepos with project references will see PR feedback latency drop from minutes to seconds, which compounds with AI-coding tools that depend on rapid type-check signals to catch hallucinated APIs. Migration friction is real but the payback is immediate. Worth scheduling a tsconfig audit in the next sprint.

Verified across 1 sources: Dev.to

Spokane / North Idaho

Coeur d'Alene I-90 Ramp Closures Begin Monday β€” Four-Year, $200M+ Widening Hits Major Phase

Idaho Transportation Department closes the I-90/US-95 westbound on-ramp Monday May 4 for one week, followed by the I-90 westbound off-ramp to Northwest Boulevard May 11 β€” first major surface impacts of the four-year, $200M+ I-90 widening project running through 2029, addressing projected traffic doubling by 2045. Same week: Silverwood opens its 38th season May 3 under new Herschend ownership with $38 anniversary admission and 2,000 seasonal hires; Spring Dash drew 1,000 runners for United Way's North Idaho ALICE fund; Spokane begins Market/Nevada pedestrian beacon and 3rd/Washington resurfacing projects May 4.

The I-90 widening is the largest infrastructure investment in North Idaho history and the ramp-closure schedule will reshape commute and freight patterns through Kootenai County for a month at minimum. Plan around it. Silverwood's Herschend transition is the bigger long-term economic story β€” Herschend (Dollywood, Silver Dollar City) running a marquee Idaho asset signals consolidation pressure on regional independent operators (cf. the Long Ear closure last week).

Verified across 4 sources: Prism News (I-90) · Prism News (Silverwood) · Prism News (Spring Dash) · KREM (Spokane construction)

Newport Beach

Doheny Desalination Project Forces Two-Year Campground Closure; Huntington Beach Votes to End Water Fluoridation

Doheny State Beach campground in Dana Point closes 2027–2029 for climate-resilience modernization (raising 4 feet, expanding sites, electrical hookups, drainage) coinciding with construction of South Coast Water District's Doheny Ocean Desalination Project. Tuesday May 6, Huntington Beach City Council votes on discontinuing supplemental water fluoridation β€” $160K/year operations savings, $6.7M projected capital savings. OC Register also documents bifurcated SoCal summer travel: luxury short-term rentals surging while budget hotels face foreclosures as post-Iran-war fuel costs reshape patterns.

Doheny is the first major OC coastal-infrastructure project explicitly bundling sea-level-rise adaptation with water-supply hardening β€” a template other coastal cities including Newport will likely follow given the Coastal Commission pushback dynamics tracked Friday. The Huntington Beach fluoridation vote is a marker of the broader anti-mandate pattern moving through OC city councils. Travel bifurcation matters for Newport's hospitality and short-term-rental owners specifically: the data suggests premium segments are insulated, mid-market is exposed.

Verified across 3 sources: OC Register (Doheny) · OC Tribune (HB fluoridation) · OC Register (summer travel)

Iran Conflict

Iran Day 65: 14-Point Proposal Drops Blockade Precondition, Adds Hormuz Tolls β€” Trump Skeptical, OFAC Threatens Sanctions on Toll-Payers

Iran's revised 14-point proposal β€” submitted via Pakistani mediators April 30 β€” drops the demand that the US lift the naval blockade before talks begin, but retains Hormuz toll-collection authority and defers nuclear issues to a later stage. That toll mechanism is now the structural flashpoint: OFAC issued a formal alert Friday warning shipping firms of sanctions exposure for paying any Iranian Hormuz toll, covering cash, digital assets, and in-kind transfers. Iran is cutting oil production as storage hits limits β€” 31 tankers / 53M barrels stranded in the Persian Gulf, rial at 1.87M/USD. Trump publicly said he 'cannot imagine' the proposal being acceptable; CENTCOM briefed renewed strike options. ISW and Critical Threats both assess senior Iranian military officials view return-to-war as 'likely.' Iran's 30-day proposal window puts the next inflection around May 30.

The precondition drop is real diplomatic movement β€” the first structural concession since talks collapsed on Day 12 and Trump ordered the blockade. But the toll mechanism creates a new hard block: Tehran needs revenue, OFAC just made third-party compliance illegal, and Trump's 'not paid a big enough price' framing signals he wants an integrated nuclear-plus-Hormuz settlement, not phased relief. The 14-point proposal also contradicts the Day 63 'hostilities terminated' legal gambit β€” the War Powers clock dispute is now running in parallel with the diplomatic track. Watch CENTCOM strike-option leaks and any Republican movement on War Powers before May 30.

Verified across 5 sources: Institute for the Study of War · BBC (OFAC sanctions warning) · Al Jazeera · BBC (War Powers dispute) · Bloomberg

Hezbollah's Fiber-Optic FPV Drones: 9.3-Mile Tether, Jam-Immune, Killed an IDF Sergeant and Downed the Rescue Helicopter

CNN's deep dive on Hezbollah's increasing operational use of fiber-optic-tethered FPV quadcopters in southern Lebanon: hardwired cables up to 9.3 miles long, immune to jamming, no electronic signature, crystal-clear FPV targeting feeds. A recent attack killed Sgt. Idan Fooks, then a follow-on strike hit the rescue helicopter. The IDF lacks effective electronic countermeasures and is falling back on physical nets. Hezbollah is now domestically producing the systems per ISW's May 2 special report β€” Israeli officials assess them as a primary weapon.

This is the deep-dive on a thread the briefing has tracked for three weeks, and it confirms the asymmetric playbook is working: a tethered $500-class drone defeats billion-dollar electronic-warfare investment because the tether removes every signal-intelligence handhold. The pattern is straight from Ukraine's 2024–2025 lessons, now operationally adopted by a non-state actor with domestic production capacity. Implications well beyond Lebanon β€” every base-defense and counter-UAS architecture built on RF detection now has a confirmed blind spot. Expect rapid US/allied pivot to counter-fiber detection (acoustic, optical, AI-driven motion).

Verified across 2 sources: CNN · Critical Threats Project (ISW)

OSINT & Intelligence

Iran's Shadow Fleet Smuggles Oil via False AIS, Forged Flags from Malawi/Guyana/Curacao β€” Windward Documents the Methodology

Maritime intelligence firm Windward documented Iran's blockade-evasion playbook in detail: AIS spoofing to make Iranian-loaded tankers appear to be loading at Iraqi ports, flag-state fraud through Malawi, Guyana, and Curacao registries, and tacit Iranian permission for fuel smuggling via Pakistan to relieve overfilled domestic storage. Estimated current capacity: ~8M barrels / ~$800M moving via four VLCCs. This extends Al Jazeera's April 30 investigation β€” which tracked 202 Hormuz transits and found 77 voyages (38.5%) Iran-linked with at least 10 tankers breaching the blockade since April 13 using fake flags from landlocked nations and AIS disabling β€” adding Windward's named flag registries and the Iraq-spoofing mechanism as new specifics.

Two consecutive weeks of OSINT investigations now converge on a consistent picture: the blockade is ~60% effective, and the leakage methodology is documented precisely enough that any analyst with AIS data and OFAC cross-referencing can replicate it. The new element today is the named flag registries β€” Malawi, Guyana, Curacao β€” which are small enough to pressure directly. Watch for OFAC Round 2 designations targeting the registries themselves, and for Treasury to use this open-source record as the evidentiary basis.

Verified across 1 sources: The Week India (Windward report)

Maigret OSINT Tool Profiles a Person Across 3,000+ Sites β€” and Surfaces the Stalkerware Governance Gap

Novvista published an in-depth review of Maigret, a Python-based open-source OSINT tool that aggregates username data across 3,000+ sites into structured dossiers with profile extraction, recursive identity discovery, and multi-format reporting. The piece refuses to dodge the dual-use problem: the same engineering quality that makes Maigret valuable for trust-and-safety teams and investigative journalism has lowered the floor for stalkers, doxers, and abusers β€” and permissive licensing plus README disclaimers do not constrain malicious use.

An honest, technically literate framing of the OSINT-tooling governance gap β€” exactly the Bellingcat-adjacent sensibility worth tracking. The 'free tool democratization meets harm at scale' tension is the same pattern showing up in MCP tool poisoning and agentic credential exfiltration: the tooling is mature enough to do real work, the surrounding norms aren't. Worth bookmarking the Novvista writeup as a reference for any internal conversation about open-source identity-correlation tools.

Verified across 1 sources: Novvista


The Big Picture

Token economics catches up to agentic adoption Uber burning its full 2026 AI budget in four months, ByteIOTA's $500–$2,000/month indie burn rates, and CIO.com's 'control plane above the tool' framing all describe the same problem from different angles: success at adoption breaks token-based pricing. Enterprise FinOps tooling for AI does not yet exist at cloud-cost-tool maturity.

Agent safety is now an infrastructure problem, not a model problem The Cursor/Railway 9-second database wipe, Okta's credential-bypass research, and the MCP tool-poisoning surface all point to the same conclusion: model alignment is irrelevant if the surrounding infrastructure (permission scopes, backup isolation, tool-description trust) is permissive. CISA's joint guidance from Friday lands in exactly this gap.

Iran Day 65: 14-point proposal lands, Trump rejects, Hormuz toll mechanism becomes flashpoint Iran dropped the precondition that the US lift the blockade before talks, but added Hormuz toll collection β€” and OFAC immediately threatened sanctions on any shipping firm that pays. Iran's 30-day deadline vs. Trump's 'not paid a big enough price' framing leaves the ceasefire structurally fragile. Gas at $4.45 nationally, $6.10 in California.

Design-to-code tools fragment by workflow position, not capability Claude Design (intent-first, standalone), Windsurf (Cascade pause-for-confirmation), Cursor (velocity-first), Locofy/Builder.io/Anima (scaffolding only) β€” the practical map shifts from 'which tool writes better code' to 'which tool fits which moment in the pipeline.' AGENTS.md / SKILL.md / DESIGN.md standardization is the connective tissue.

Asymmetric tech wins keep showing up in the open-source record Hezbollah's fiber-optic FPVs (jam-immune, ~9.3 mile reach, killed an IDF sergeant and downed the rescue helicopter response), Iran's shadow-fleet AIS spoofing through Malawi/Guyana/Curacao flags, and ISW's documentation of Russian infiltration tactics distorting territorial claims all reward OSINT analysts willing to triangulate video, AIS, and milblogger data over headline counts.

What to Expect

2026-05-04 Coeur d'Alene I-90 US-95 westbound on-ramp closes for one week (first phase of $200M+ four-year widening); Spokane begins Market/Nevada pedestrian beacon and 3rd/Washington resurfacing projects.
2026-05-06 Huntington Beach City Council votes on discontinuing supplemental water fluoridation ($160K/yr operations, $6.7M capital savings).
2026-05-08 Washington state filing deadline for the open 6th Legislative District seat (Bingle/Croft already filed).
2026-05-11 Coeur d'Alene I-90 westbound off-ramp to Northwest Boulevard closes (second phase).
2026-05-30 End of Iran's 30-day window in its 14-point proposal β€” if no agreement, IRGC has signaled return-to-war posture is 'likely.'

Every story, researched.

Every story verified across multiple sources before publication.

🔍

Scanned

Across multiple search engines and news databases

733
📖

Read in full

Every article opened, read, and evaluated

159

Published today

Ranked by importance and verified across sources

13

β€” The Anvil

πŸŽ™ Listen as a podcast

Subscribe in your favorite podcast app to get each new briefing delivered automatically as audio.

Apple Podcasts
Library tab β†’ β€’β€’β€’ menu β†’ Follow a Show by URL β†’ paste
Overcast
+ button β†’ Add URL β†’ paste
Pocket Casts
Search bar β†’ paste URL
Castro, AntennaPod, Podcast Addict, Castbox, Podverse, Fountain
Look for Add by URL or paste into search

Spotify isn’t supported yet β€” it only lists shows from its own directory. Let us know if you need it there.