Claude Code Team Writes 65% of Product Code, Including Claude Tag Itself
Anthropic reveals Claude Tag has been powering its own development internally all year — now it writes nearly two-thirds of the product team's code.
The Claude Code team has been shipping with Claude Tag internally all year, and the results are striking. The AI now writes 65% of the product team's code, including most of what built Claude Tag itself. The system handles tasks ranging from feature development to bug fixes, operating as a first-class member of the engineering team. Anthropic shared a thread detailing how the team uses it every day — from code review assistance to generating entire pull requests that pass review with minimal changes. The milestone underscores the accelerating trend of AI systems not just assisting developers but becoming core contributors to production software.
OpenAI Ships 30+ Models and API Upgrades in Six Months
New models include GPT-5.5, GPT-5.4 mini, GPT-5.4 nano, GPT-Realtime-2, and GPT-Realtime-Whisper. Alongside the model launches, OpenAI shipped dozens of feature upgrades for developers building on the API, from enhanced function calling to expanded context windows and lower latency endpoints.
Mistral OCR 4 Covers 170 Languages With Bounding Boxes and Confidence Scores
Mistral OCR 4 introduces structured output with bounding boxes, block classification, and inline confidence scores across 170 languages. The new version represents a significant leap in document understanding, providing developers with richer machine-readable outputs that go far beyond simple text extraction.
"This is a new paradigm for interacting with Claude that is significantly more inline with all the other human activity org-wide."
— Andrej Karpathy on Claude Tag
NVIDIA Powers 81% of TOP500 Supercomputers
NVIDIA-powered systems account for 81% of the TOP500 and 89% of all new systems, with AI training throughput double the combined total of all other platforms and inference throughput triple. The top eight most energy-efficient systems on the Green500 are also NVIDIA-powered.
OpenAI DevDay 2026 Opens Applications
DevDay returns to San Francisco on September 29, featuring tech talks, hands-on demos, workshops, and direct access to the developer tools team. Applications close July 10.
OpenAI Funds Maintainers, Invests in Rust, Expands Codex for OSS
OpenAI directly funded open-source maintainers, invested in the Rust ecosystem, launched the Patch the Planet security initiative, and expanded Codex for OSS to more maintainers in just one week.
Engram Founded: PhD Dropout Launches Continual Learning Startup
jxmnop left a PhD at the end of 2025 and co-founded Engram, a startup dedicated to training AI models that learn over time, remember details, and adapt — rather than static one-shot models.
Chollet: 2040 AI Will Be Built on Symbolic Learning
François Chollet believes current AI stacks have 3–4 orders of magnitude data inefficiency and 4–5 orders of compute inefficiency. Near-optimal AI will be achieved through symbolic learning, not scaling alone.
Cornell Recovers $100K in Unidentified Payments Using Claude
Cornell's finance and AI teams built a custom Claude treasury skill that automated manual investigations, resulting in $100,000 recovered in back payments — a case study in combining grassroots AI adoption with dedicated builder teams.
DFlash on Blackwell: Up to 15x Throughput, SGLang Integration
SGLang now supports block-diffusion drafting (DFlash) on NVIDIA Blackwell, achieving up to 15x throughput improvement. Migrating from EAGLE requires only a configuration change, not code edits.
NVIDIA Agent Toolkit Brings Nemotron Models to Enterprise
NVIDIA's new Agent Toolkit integrates open Nemotron models, tools, skills, and a secure runtime, helping teams build domain-specific AI agents tuned to their own workflows.
Baidu Releases Unlimited-OCR, Joining the OCR Model Wave
Baidu open-sourced Unlimited-OCR, adding to a growing field alongside Mistral OCR 4, as OCR capabilities see rapid advancement across multiple labs.
Seedance 2.0 Touted as First Production-Grade AI Video Model
Higgsfield claims Seedance 2.0 in native 4K is the first truly production-ready AI video model, crossing the threshold from demo to deployable output.
GLM Now Available on Perplexity Agent API
Perplexity's Agent API now supports the GLM model, confirmed by Arav Srinivas. The Zhipu-developed model has drawn attention for its competitive performance against frontier models.
Ling and Ring 2.6: Agentic Intelligence at Trillion-Parameter Scale
The Ling and Ring 2.6 technical report claims efficient and instant agentic intelligence operating at trillion-parameter scale, pushing the frontier of large-scale AI agents.
Runway Launches Seedance 4K, Mini, and Kling 3.0 Turbo
Runway integrates Seedance 4K, Seedance Mini, and Kling 3.0 Turbo into one platform, claiming "the world's best models, in one place."
Claude Tag: AI Joins Slack as a Team Member
Anthropic launches Claude Tag, letting teams assign tasks to Claude in Slack with step-by-step delivery in-thread. Available as Research Preview for Enterprise and Team customers.
Nathan Lambert's 7.4-Hour Distillation Lecture Drops
New lecture covers the full distillation literature from Hinton 2015 to modern multi-teacher on-policy strategies.
Krea AI Open-Sources Krea-2-Raw on Hugging Face
Krea AI released Krea-2-Raw and Krea-2-Turbo model weights to Hugging Face, with the open-source image model ranking first on Artificial Analysis.
Krea 2 Tops Text-to-Image on Artificial Analysis
Independent lab Artificial Analysis ranks Krea 2 as the top text-to-image model, marking a milestone for open-source AI image generation.
Ilya Sutskever: Pre-Training Scaling Is Hitting a Data Wall
Ilya Sutskever says the AI scaling era is running out of road because pre-training is about to hit a finite-data wall, forcing the field to find new approaches.
GLM Rivals Frontier Models in Blind Tests
Arav Srinivas says GLM matches frontier models in blind comparisons, potentially revitalizing the open-source AI movement.
Claude Tag's Channel-Based Context Design Explained
oran_ge breaks down Claude Tag's design: it uses channel-based context to define multi-agent boundaries, rather than traditional identity or team-wide context models.
Cursor Adds Team Plugin, Skill, and MCP Leaderboard
Cursor introduced a leaderboard for popular plugins, skills, and MCPs across teams, with one-click install from the new Customize page.
vLLM Integrates DFlash Speculative Decoding
vLLM now supports DFlash speculative decoding via the open-source Speculators library — switching from EAGLE-3 is config-only, no code changes required.
NVIDIA AI Infra Powers 700+ NAIRR Research Projects
NVIDIA's two-year contribution to the NAIRR pilot has helped power over 700 U.S. research projects via DGX reference architecture.
New Intelligent Voice Model Has a 'Seedance Moment'
oran_ge reports a new voice model that goes far beyond traditional TTS, describing it as a genuine intelligent voice system rather than a reading machine.
Huawei All-CPU Supercomputer Tops Top500
An all-CPU supercomputer using Huawei chips ranked first in the Top500, marking China's return to the top of the supercomputing leaderboard.
Mythos-Level Models Face Open-Source Risk Within 6–12 Months
emollick warns that all Mythos-level models invite similar risks, and open-source versions may appear within months — urging clearer government focus on risk preparation.
Frontier RL Environments Hackathon: 200 Builders, 24 Hours
MiniMax, hud_evals, and Y Combinator ran a hackathon turning browser-use, robotics, logistics, and worldsims into RL training environments.
Recovering World Models by Inverting the Bellman Equation
Google DeepMind researchers discovered that inverting the Bellman equation can recover an agent's world model from its value function — a finding with deep theoretical implications.
SGLang Serves DeepSeek-V4 on GB300, 5x Throughput
A new lmsys blog details using SGLang to serve DeepSeek-V4 on GB300, achieving 5x throughput with equivalent interactivity from day zero.
Claude Design One-Click Deploy to Vercel
Vercel CEO demonstrates one-click deployment from Claude Design directly to Vercel, shrinking the gap between AI-generated UI and deployment to near zero.
Gemini Vision Passes Black Ant Perception Test
fofrAI's test of Gemini's vision skills went viral with nearly 650K views, as the model impressively passed a challenging visual recognition scenario.
DeepSeek API Now Compatible with Claude Code Tools
DeepSeek's API models are now compatible with tools like Claude Code, broadening their interoperability across the AI development ecosystem.
Hugging Face Becomes Go-To Data Store for Robotics
Clement Delangue reports HF robotics datasets exploded from 1,000 in early 2025 to 60,000 today, as the platform becomes the preferred home for massive, append-only data.
Seedance 2.0 4K Handles Any Genre and Camera Movement
Seedance 2.0 at 4K resolution supports any genre, VFX, or camera movement, demonstrating broad versatility and high visual quality. The team teases that Seedance 2.5 will raise the bar further.
OpenAI Expands Daybreak Project for Machine-Speed Patching
OpenAI is expanding Daybreak to democratize vulnerable software patching at machine speed, using Codex security plugins and other tools.
Chollet: Complexity Is a Tax on Every Future Change
François Chollet argues that elegant code is about maintainability, not aesthetics — complexity is a tax you pay on every future change.
Google Employee Fired for Writing Workspace CLI in Rust
Google employee Justin Poehnelt, a seven-year veteran, was fired for building and publishing a Google Workspace CLI in Rust under the official GitHub org — sparking community controversy.
NVIDIA Brings 24/7 AI Agents to Telecom at DTW Ignite
NVIDIA showcased data, model, simulation, and security runtime stacks enabling telecom operators to deploy autonomous agentic workflows for network operations.
GitHub Copilot App Gains Bring Your Own Key (BYOK) Support
GitHub Copilot added BYOK functionality, compatible with Ollama, foundry, and any OAI-completions or Anthropic-compatible service.
Verification Stack: 2.4x Faster PRs from Agent-Generated Code
gneubig's team open-sourced a method that cuts garbage from agent-generated code, merging PRs 2.4x faster while maintaining code quality.
Waymo Zero Fatalities, Tesla FSD ~4x Safer Than Human Drivers
levelsio aggregated safety data showing Waymo has had zero fatalities, and Tesla FSD's fatal accident odds are about 1/4 of average human driving.
'Vibe Coding' Surges as AI-Assisted Programming Goes Mainstream
venturetwins declares that AI-assisted "vibe coding" is becoming as popular as playing video games, reflecting the casualization of software creation.
ByteDance Officially Announces Seedance 2.5, Updates Seedance 2.0
Seedance 2.0 now supports 4K resolution, and the newly announced Seedance 2.5 promises further advanced performance.
Phase Shift Predicted: Mythos-Class Models, GW Campuses by Late 2026
teortaxesTex predicts multiple US labs releasing Mythos-level models with RSI potential, gigawatt-level compute campuses, and Chinese startups entering the "Hopper" era by Q4 2026.
Hugging Face CEO Teases New Collaboration
Clement Delangue recorded a video teasing a new partnership, with details to be announced.
ByteDance Seed 2.1 Pro Strengthens Agent and Coding Abilities
op7418 tested Seed 2.1 Pro and reports notable improvements in agent and coding capabilities, positioning Doubao as a primary content creation model.
World Action Models Survey Published
A comprehensive survey paper covers methods and advances in world action models for AI.
PlanBench-XL Evaluates LLM Tool-Use Long-Horizon Planning
A new benchmark assesses LLM agent planning across large-scale tool ecosystems.
Lift4D: Single-View 3D to 4D Reconstruction
New paper reconciles single-view 3D estimation for in-the-wild 4D reconstruction.
Multi-Vector Embeddings Provably More Expressive
Proof confirms multi-vector embeddings are strictly superior in approximation capacity over single-vector.
CMU Launches Modern GPU Programming for ML Systems Course
CMU's SCS introduced a new mini-series on modern GPU programming tailored for ML practitioners.
Zhipu: The Company Behind GLM
Known previously as Zhipu, z.ai is the company behind the GLM model series, drawing attention from the global AI community.
AutoScientist Trains Finance-Specific Models on User Data
Adaption AI launches AutoScientist for domain-specific finance model training, arguing generic models fall short in specialized domains.
Codex Disk Writing Bug Fixed in CLI v0.142.0
The excessive disk writing issue has been resolved; users need to manually update the CLI.
New Visual Benchmark Goes Viral with 660K Views
fofrAI announces a new vision benchmark that rapidly accumulated over 660,000 views.
gdb Hints at OpenAI-Samsung Partnership
Greg Brockman posted "OpenAI for Samsung," hinting at a potential collaboration between the two companies.