The archive

Daily brief

2026-06-23

Tuesday, June 23, 2026

A new theory recasts prompt injection as role confusion the model can be tricked out of; the top open-weight model ships text-only; and a Codex logging default writes terabytes to local SSDs.

2026-06-15

Monday, June 15, 2026

The US pulls two Anthropic models worldwide over a disputed code-review jailbreak, the first export-control takedown of a deployed model, as AI-found zero-days pile up in FFmpeg and the Pixel 9.

2026-06-13

Anthropic reverses a covert policy that silently degraded Claude Fable's answers for suspected AI researchers, as new work undercuts both multi-agent systems and the probes meant to catch models lying.

2026-06-12

Friday, June 12, 2026

Project Zero prices a full Pixel root chain at roughly eleven person-weeks and documents months of patch lag, as fresh benchmarks measure how far AI agents still fall short on real work.

2026-06-11

Thursday, June 11, 2026

A German court strips AI summaries of search's legal shield; Anthropic ships its most capable model behind heavy filters while its CEO asks to be regulated; and new research shows alignment passing benchmarks it quietly fails underneath.

2026-06-10

Wednesday, June 10, 2026

Anthropic ships a frontier model that reroutes dual-use queries instead of refusing them, Amazon deploys random-graph datacenter networks at scale, and error messages emerge as a privileged prompt-injection surface.

2026-06-08

Monday, June 8, 2026

A researcher reads two decades of encrypted military traffic hidden in the public GPS signal, OpenAI and Simon Willison both move to contain untrusted input to LLMs, and a $280 soundbar becomes a remote keyboard.

2026-06-05

Friday, June 5, 2026

Hugging Face rebuilds its CLI for coding agents and benchmarks the token cost of hand-rolled alternatives; a preprint caps eval scores to expose agents that game the test; NVIDIA releases an open multimodal guardrail.

2026-06-04

Thursday, June 4, 2026

Cloudflare finds about half of Tier 1 networks accept forged BGP paths; Microsoft fields a from-scratch model family at Build; Uber caps coding agents at $1,500 a month.

2026-06-03

Wednesday, June 3, 2026

Microsoft announces a seven-model MAI family backed by a rare, transparent training report; Alphabet raises about $80 billion, including Berkshire's first big Google stake, to fund the compute race.

2026-06-02

Tuesday, June 2, 2026

An interpretability preprint says diffusion image models read only word meaning and order from prompts, a Lean4 framework brings formal verification to agent workflows, and attackers seized Instagram accounts by asking Meta's support bot.

2026-06-01

Monday, June 1, 2026

Two frontier labs detail how they measure and contain their agents; a Zapier exploit chain and Vercel's "inference theft" show what weak containment costs; and reverse-engineers read microcode and hidden memory off the silicon.

2026-05-29

Friday, May 29, 2026

Anthropic's $65 billion raise and an incremental Opus 4.8 lead a quiet day, with new research showing coding agents leaking secrets and firing real attacks at live sites.

2026-05-28

Thursday, May 28, 2026

OpenCode's founder picks apart the pitch that AI lifts team output, Stratechery sizes up satellites as server racks, and Cisco Talos open-sources synthetic security logs that stay consistent across 20-plus formats.

2026-05-26

Tuesday, May 26, 2026

Huawei pitches an architecture-first scaling law to skirt EUV denial, the memory supercycle prices sub-$100 phones out of emerging markets, and Google's AI search box draws a reported migration to rivals.

2026-05-25

Monday, May 25, 2026

A maintainer puts hard numbers to open source's agent-traffic problem, an AI disproves an 80-year-old Erdős conjecture, SPEC's new CPU benchmark gets its first independent teardown, and a CISA contractor publishes the agency's own cloud keys.

2026-05-22

Friday, May 22, 2026

Microsoft Research releases a codesigned small-model agent stack and claims it leads computer-use benchmarks it ran itself.

2026-05-21

Thursday, May 21, 2026

An OpenAI reasoning model produces an externally verified disproof of a 1946 Erdős conjecture; a GitHub employee's poisoned IDE extension exposes about 3,800 internal repos; and an essay rereads China's AI optimism as fear of falling behind.

2026-05-20

Wednesday, May 20, 2026

Google sends its agentic science assistant to Nature and into Gemini for Science, Anthropic splits agent brains from hands on Cloudflare, and a new lattice-QCD result quietly closes the muon g−2 anomaly.

2026-05-19

Tuesday, May 19, 2026

Two vendor field reports put a security-tuned Anthropic model preview to work on real codebases and credit the scaffolding over the model, as Marc Brooker reframes where coding agents win.

2026-05-18

Monday, May 18, 2026

Model internals own a quiet day: how 2026's open-weight LLMs cut long-context cost, two sober takes on RL and steering, and Gemini 3.5 Flash ships.

2026-05-15

Friday, May 15, 2026

A hidden lock in ClickHouse query planning stalled Cloudflare's billing; AI turns up on both sides of the CVE curve; and new open releases put their gains down to better data, not bigger models.

2026-05-14

Thursday, May 14, 2026

OpenAI hand-builds a Windows sandbox for its Codex agent and discloses an npm worm that forced a code-signing certificate rotation, while Microsoft Research opens up the mimalloc allocator.

Daily brief feed

Weekly digest

2026-W24

Week of June 8, 2026

Google Project Zero prices a full Pixel zero-click near eleven person-weeks and shows memory safety blocks it; Anthropic ships a frontier model that refuses basic biology and can silently degrade rivals' code; and AWS makes flat random-graph networks its datacenter default.

2026-W23

Week of June 1, 2026

Cloudflare finds half the internet's Tier 1 backbones accept forged BGP routes; Microsoft fields a from-scratch model family with a rare 109-page training report; and Alphabet raises $80 billion as AI's compute bill comes due.

2026-W22

Week of May 25, 2026

Machine-generated code, issues, vulnerability reports, and even an Erdős counterexample surged this week; the humans who verify them did not, even as Anthropic raised toward a trillion dollars to automate more of the work.

2026-W21

Week of May 18, 2026

OpenAI says a general-purpose model overturned a decades-old result on Erdős's unit-distance problem; Cloudflare ran a preview security model through a 50-agent exploit-hunting harness; and Marc Brooker reframed where coding agents win as a question of feedback, not model size.

2026-W20

Week of May 11, 2026

VulnCheck says AI-assisted bug-hunting is bending the CVE disclosure curve, an npm worm reached OpenAI's code-signing certificates, and three systems teardowns show how much is still built by hand.

Weekly digest feed

Monthly review

2026-06

June 2026

A first-of-its-kind US export-control order pulled Anthropic's most capable models offline worldwide over a code-auditing jailbreak, the same month Project Zero and a startup's agent showed how cheap that capability has become.

2026-05

May 2026

A general-purpose model disproved a 1946 conjecture and preview security models chained working exploits, but across math, security, and open source the month's scarce resource was verification, not generation.

Monthly review feed