OpenAI Model Finds Counterexample to 80-Year-Old Erdős Conjecture
An OpenAI model discovered a counterexample to Erdős's eighty-year-old conjecture, a breakthrough shared on the OpenAI Podcast by researchers Alex Wei, Hongxun Wu, and Wojciech Zaremba, showcasing a new paradigm for mathematician-AI collaboration.
ChatGPT Memory System Receives Major Upgrade, Persistent Across Conversations
OpenAI introduced a more powerful ChatGPT memory system that carries context across conversations and maintains long-term usefulness. The new system allows ChatGPT to retain relevant information over time, making each interaction more personalized and efficient without requiring users to re-establish context.
OpenAI API Adds Inline Moderation Scores
Developers can now receive content moderation signals for both input and output in the same Responses API and Completions API request flow using the omni-moderation-latest model. The feature supports text and images, is free of charge, and lets applications decide how to use scores for logging, routing, review, or blocking.
Codex Introduces Personal Profile Showing Activity and Usage Data
OpenAI Codex now features a personal profile page displaying activity graphs, coding streaks, lifetime token counts, peak daily tokens, and top-used features. Profiles are private by default with an option to share a card publicly.
Codex Launches Build iOS Apps Plugin with Live Preview
The new Build iOS Apps plugin lets Codex view and test iOS applications in the in-app browser, open SwiftUI previews, and hot reload edits without leaving the Codex environment.
Jensen Huang: Agents Become a New Layer of Enterprise Software
NVIDIA CEO Jensen Huang explained that companies including Cadence, CrowdStrike, SAP, ServiceNow, Siemens, and Synopsys are building agents on NVIDIA. He emphasized that the opportunity for software partners is only beginning as agentic AI becomes enterprise infrastructure.
NVIDIA Releases Physical AI Agent Skills at CVPR 2026
At CVPR 2026, NVIDIA announced composable workflows that automate data generation, simulation, and policy training for autonomous vehicles, robots, and vision AI, aiming to speed development for teams that cannot collect sufficient real-world data on their own.
Sakana AI Plans to Build Japan's First 1T Parameter Model
Sakana AI founder revealed plans to build Japan's first 1-trillion-parameter agent-native model through Japan's METI GENIAC initiative. The model will be specifically optimized for long-horizon deep research and autonomous reasoning.
LM Studio Releases Mobile App, Local Models on the Go
LM Studio launched a mobile application that lets users run their local models directly on their phones, bringing offline AI inference to a pocket-sized form factor.
Safety by narrow control has shown to fail many times. Need more transparency on the absolute frontier, and openness close behind.
Nathan Lambert, AI Safety Researcher
Perplexity and SBA Launch Main Street AI Accelerator
Perplexity partnered with the U.S. Small Business Administration, committing $25 million in compute credits — $250 each for up to 100,000 companies — to accelerate AI adoption among American small businesses.
Perplexity Computer Will Integrate All Business Connectors
Perplexity CEO announced that the Computer platform will introduce all connectors needed to start and run a business, allowing anyone with an idea and a small team to build growing companies faster than ever.
NVIDIA DGX Spark Update Boosts Inference Speed 2.6×
NVIDIA DGX Spark updates simplify local agent workflows and accelerate inference up to 2.6× via NVIDIA NemoClaw, announced at GTC Taipei during COMPUTEX.
Replit Partners with Shopify: Build an Online Store in Minutes
Replit Agent now integrates with Shopify. Describe what you want to sell and the agent builds a custom storefront, creates your Shopify store, and helps add products — go live in minutes.
Cursor Adds Canvas Sharing Feature
Cursor now supports publishing canvases — dashboards, reports, and internal tools — as shareable URLs, enabling team collaboration without leaving the editor.
Step 3.7 Flash Deployed on Fireworks AI at 400 Tokens/s
StepStar's Step 3.7 Flash model is now available on Fireworks AI, featuring MTP-assisted decoding reaching 400 tokens per second, designed for capable agents in production.
LlamaIndex Unveils ParseBench at CVPR 2026
ParseBench is the first document parsing benchmark designed specifically for AI agents, treating document understanding as an AGI-complete problem.
Nemotron Parakeet ASR Achieves 97.7% Accuracy on Indonesian
Rafiqspace.ai fine-tuned Nemotron Parakeet ASR to 97.7% accuracy (2.3% WER) for Bahasa Indonesia, cutting transcription costs by up to 90%.
Cohere Wins NATO Cognitive Warfare AI Challenge
Cohere took first place in NATO's Agentic AI for Cognitive Warfare Innovation Challenge, ahead of OpenMinds, Ipsos, and Thoughtworks.
Runway Launches Aleph 2.0 Precise Editing
Runway's Aleph 2.0 provides finer video editing control, changing only user-specified parts while keeping the rest of the frame untouched.
Pika Launches First In-App Group Chat AI Agent
Pika introduced an in-app group chat where AI agents can help with phone updates, create memes, and collaborate on creative projects.
Ollama Supports Gemma 4 12B Across All Platforms
Ollama now runs Gemma 4 12B, launchable inside Claude Code, Hermes Agent, OpenClaw, and Codex via the ollama launch command.
Jeff Dean Recommends Gemma 4 12B for Laptops
Google's chief scientist called Gemma 4 12B a super capable open weights model that runs directly on a laptop.
Ideogram 4 Goes Open Weights, Ranks as Best Open Image Model
Ideogram 4 is Ideogram's first open-weight text-to-image model, trained from scratch with 9.3B parameters, structured JSON prompts, multilingual text rendering, and native 2K output.
Free vLLM Community Course Covers Full Deployment Optimization
Red Hat and DeepLearning.AI jointly released a free vLLM course with three hands-on labs covering quantization, deployment, and benchmarking on live vLLM servers.
Andrew Ng Launches New Course on Serving LLMs Efficiently
DeepLearning.AI teamed up with Red Hat to teach how to serve models to many concurrent users at low latency and reasonable cost, covering efficient memory management for large parameter models.
Runway Sees 50% Token Growth in 6 Weeks
Runway CEO shared that token consumption grew 50% in six weeks, power users up 140%, and enterprise NDR hit 300% as the platform becomes more embedded in daily workflows.
Nathan Lambert: US Open Model Labs Reverse the Decline
Since last June, Nvidia, Ai2, Arcee, Gemma, and GPT-OSS have put the US back on the map for open AI — a dramatic reversal from being "totally owned" a year ago.