June 25, 2026 · Thursday

OpenAI Launches First In-House AI Chip Jalapeño

Designed from the ground up and produced with Broadcom, Jalapeño is purpose-built for the LLM workloads powering ChatGPT, Codex, the API, and future agentic products.

OpenAI's first proprietary AI chip, purpose-built for LLM inference workloads powering ChatGPT and the API at scale.

OpenAI has crossed a critical threshold in its vertical integration strategy with the announcement of Jalapeño, its first proprietary AI chip. Built in partnership with Broadcom, the chip is purpose-designed for the large language model workloads that underpin ChatGPT, Codex, the OpenAI API, and upcoming agentic products. The move signals OpenAI's intent to reduce reliance on third-party silicon as it scales inference to hundreds of millions of users. Industry observers note that custom silicon has become table stakes for frontier AI labs: Google has its TPUs, Amazon has Trainium, and Microsoft is rumored to be developing its own accelerators. OpenAI's entry into the chip race closes one of the last remaining gaps in its full-stack ambition, spanning from model research all the way down to deployment hardware.

Products & Models June 25
Creative AI · Video & Image June 25

Qualcomm Partners with Hugging Face

At Qualcomm's Investor Day, CEO Cristiano Amon and Hugging Face CEO Clement Delangue announced a partnership as "one more thing," though specific collaboration details remain undisclosed. The move signals growing hardware-software alignment in the AI ecosystem.

Anthropic Negotiates Fable 5 Unlock with Trump Administration

WIRED reports Anthropic co-founder Tom Brown has replaced Dario Amodei as lead negotiator with the Trump administration over lifting restrictions on the Fable 5 model. One source said Brown communicates more directly than Amodei.

Huawei Claims 950 SuperPOD Demo at Shanghai Expo in Mid-July

Huawei plans to showcase an 8192-NPU, 160-cabinet SuperPOD at the Shanghai World Expo by mid-July, signalling mass production of the 950DT chip and China's entry into its domestic Hopper+ era.

MiniMax M3 Becomes Default Builder Model for Kimchi Coding

MiniMax's M3 model — with open weights, 1M context window, and strong coding capability — was selected as the default builder model in Kimchi Coding by Cast AI.

DeepMind Explores the Rise of Agent Economies

A new podcast examines what happens when millions of AI agents begin negotiating, transacting, and delegating — and how to diversify decision-making to avoid AI groupthink.

Sakana AI Partners with OpenRouter for Resilient Architecture

Sakana AI's Hardmaru announced a partnership with OpenRouter, noting products like OpenRouter Fusion and Sakana Fugu spark important conversations about dependency and resilience in AI.

Seedance Video Cost: 40K Tokens per Second of 1080p

Official documents reveal 1 second of Seedance video at 1080p uses 40,000 tokens. At Doubao's 180T tokens per day, that translates to roughly 150 million people generating 30 seconds each — a sobering scaling constraint.

Cola Launches Seed 2.1 Pro with ColaOS

Cola released Seed 2.1 Pro, a natively multimodal model with enhanced coding and agent capabilities over 2.0. ColaOS, described as an operating system with soul, features a persistent agent that remembers users and grows over time.

NVIDIA Full-Stack AI Powers Autonomous Brand Operations

NVIDIA highlighted its full-stack AI capabilities in causal marketing analytics, trustworthy agentic workflows, and real-time hyper-efficient auction bidding for global brands.

Agent Defined: LLM + Instructions + Tools + Environment

A simple agent definition: an LLM backbone running in an agentic loop, with four components — the model, instructions, tools, and the environment.

Zhipu AI Goes from HKD 120 IPO to Beating DeepSeek

Zhipu AI IPO'd at HKD 120 per share in January. GLM has since surpassed DeepSeek as a leading open model, and the company is returning to San Francisco.

Claude Code Web Hit by GitHub Egress Policy Block

Simon Willison reported Claude Code for Web displays "GitHub is blocked by egress policy," severely disrupting workflows that involve cloning repositories for reference documentation.

Governments Should Build Model Evaluations Like Civics Exams

A researcher proposed each government create environments and evaluations for desired model capabilities, similar to various civics examinations.

AI Commercialization Is Fundamentally a 2Boss Model

A commentary argued China's AI monetization follows a 2Boss pattern: bosses pay for programmers to use Claude and Codex, and for creators to use Seedance.

Tools & Infrastructure AI Engineering

FAV0 · AI Daily