
xAI Launches No-Code Voice Agent Builder
xAI released Voice Agent Builder, a platform enabling users to create human-like voice agents powered by Grok Voice in under two minutes. Targeted at customer support, sales, and telephony workflows, the service is priced at $0.05 per minute and available immediately.

vLLM v0.24.0 Ships With MiniMax-M3 and DeepSeek-V4 Support
The new release packs 571 commits from 256 contributors. Highlights include MiniMax-M3 support with FP8 and MXFP4 quantization across AMD GPUs, DeepSeek-V4 maturation with FlashInfer sparse index cache and prefill chunk-planning on SM120, and Model Runner V2 for quantized models.
Zhipu AI Ships ZCode, the Official IDE for GLM-5.2
ZCode integrates AI agents with existing tools for planning, coding, review, and deployment. Available on macOS, Windows, and Linux with BYOK support for existing subscriptions and APIs. GLM Coding Plan subscribers receive a 1.5× usage quota within the IDE.

OpenAI GeneBench-Pro: GPT-5.6 Sol Sets New Bar in Computational Biology
GeneBench-Pro tests whether models handle the judgment-heavy analysis real-world computational biology demands. Tasks would take a human expert 20 to 40 hours to complete. GPT-5.6 Sol achieved a major breakthrough on this benchmark.
It really is very impressive, but that shows off best in longer, harder tasks. It launched sub-agents, retrieved 2,200+ flight schedules, and sustained hours of autonomous work. When given feedback, it deployed adversarial agent groups for cross-validation.
— Ethan Mollick on Claude Fable 5
Runway Ships Nano Banana 2 Lite at Warp Speed

Runway launched Nano Banana 2 Lite, delivering high-quality images at blazing speed without sacrificing quality. The model is available in Runway Studio and can be invoked through Runway Agent. v0 by Vercel has also integrated NB2L for image generation directly in its prompt bar.
Ollama + MLX Boosts Gemma 4 on Apple Silicon by 90%
Multi-token prediction now on by default for Gemma 4. Ollama auto-tunes draft tokens as it runs, so performance never degrades.
Recraft Drops Two Models, Zero Waiting Lists
Both models are live in Recraft Studio from day one. The company emphasizes instant availability as a core principle.
vLLM Powers Qwen3.6-27B-NVFP4 on NVIDIA Blackwell
Memory drops ~2.5× on local GPUs. Scores MMLU Pro 86.3 with hybrid attention across 27B parameters, optimized for Blackwell.
ChatGPT Plus Rolls Out Personal Finance in the U.S.
The new feature expands AI applications into daily spending, budgeting, and financial planning for Plus subscribers.
NVIDIA Isaac ROS: Open Source Robotics Takes Off
Modular platform for autonomous mobile robots, robotic arms, and humanoids with a trusted, production-ready software stack.
Musk Tours Optimus Robot Assembly Line in Fremont
Footage from the Fremont factory demonstrates mass-production progress for Tesla's humanoid robot program.
GLM Excels in Next.js Code Generation
Rauch confirms GLM tops Next.js evals. The AGENTS.md packaged docs help agents pass additional benchmarks.
Agent Skills Directory: The New npm
Rauch predicts developers will stop cloning repos and instead fetch best-practice instructions for building directly.
Vercel Ships Content Agent Eve
Eve requires only an instructions.md file to run, with tools, skills, channels, and deep Next.js integration.
SMWM: Learning World Models via Inverse Dynamics
FAIR proposes inverse dynamics regularization to prevent representation collapse, training stably from offline reward-free trajectories.
Bloome: Put Claude, ChatGPT, Gemini in One Chat
Cross-agent feedback loops boost research, drafting, and review efficiency. François Chollet endorses the approach.
Cartridge Cuts KV Cache Memory 38×
Offline trained lightweight KV cache avoids full-text reprocessing, achieving 38.6× memory reduction on long-context benchmarks.
LiteResearcher: Agentic RL for Deep Research
Scalable framework training deep research agents through reinforcement learning for automated research.
Runway + Bertelsmann: Global Creative AI Deal
AI models integrated across RTL Group, BMG, and marketing services for advertising, content, and music promotion.
Fine-Tuning Crushes Prompting With Expert Data
John Schulman cites Bridgewater AIA Labs: fine-tuning on expert judgments far outperforms prompting-only approaches.
NVIDIA Releases Nemotron-Labs-TwoTower-30B-A3B
A two-tower architecture with 30B total and 3B activated parameters, posted on Hugging Face to advance AI democratization.
Claude Science Prompt & Skills Go Public
Repository contains prompts, skills, agents, and MCP server assets. Skills defined in YAML for multiple scientific domains.
Together AI Hits 400T Tokens per Month
Co-founder Tri Dao reports surging demand for open models, with inference volumes continuing to climb.
ASPIRE: Robot Physical Self-Improvement
From ENPIRE to ASPIRE, the team builds robot self-improvement components one skill at a time.
Hugging Face Demos Gemma 4 Voice App
Powered by Cerebras, Gemma 4 31B enables fast visual recognition and web search in voice conversations.
Chollet: AI Won't Cause Mass Unemployment
The current AI wave will mainly increase demand for software engineers, with minimal broader labor impact.
Kling AI Ad Wins Two Cannes Lions
'The Last Real Man' takes Silver and Bronze in Film Consumption and AI Craft categories at Cannes Lions 2026.
PixVerse Adds Lip-Sync Feature with Voice Cloning
Upload an image or video and use text or audio to generate realistic lip-synced content.
Sonnet 5's New Tokenizer Matches Opus 4.8 Cost
Tokenizer changes increased encoding costs; Sonnet leads in finance but coding costs may exceed Opus pricing.
Mollick: Benchmark Models for Your Own Use Case
Standard benchmarks cannot capture cascading judgment differences between models like Gemini 3.1 and GPT-5.5.
Anthropic System Prompt Data Collection Exposed
Reports of hidden user-information collection in system prompts spark transparency concerns; company says it will remove the practice.
John Schulman Hires Post-Training Hackers for Tinker
The OpenAI co-founder posted a job opening for post-training experts to enhance the Tinker model.
Vercel Adds Dry-Run Step for Agentic Deployments
New step lets AI agents inspect work before deployment, lowering cost and risk.
'The Art of Debugging' Free eBook Hits 161 Pages
Now in PDF/EPUB with focus on Unix, Python, and PyTorch debugging methodology.
SKILL.md Teaches AI Assistants Better Debugging
Based on 'The Art of Debugging' open book; helps AI coding assistants tackle Unix/Python/PyTorch bugs.
Hugging Face CEO: Open Source Is Safer AI
Clement Delangue argues sunlight is the best disinfectant: more eyes on code means better accountability.
Higgsfield Compares Sonnet 5 vs Opus 4.8 Video Pipeline
Same quality output through GPT Image 2.0 and Seedance 2.0, but each model directs Seedance very differently.
AI Agent Evaluation Becomes Independent Discipline
Multiple papers push SOTA; a new survey distills best practices for agent assessment.
OpenHands Seeks Forward-Deployed Engineers
Enterprise on-premise deployments of OpenHands coding agent platform; custom integrations and agent use cases.
China AI Investment Frenzy: P/E Ratios 50-300
Chinese capital shows extreme risk appetite in AI, with valuations far exceeding U.S. benchmarks.