June 2, 2026 · Tuesday

OpenAI Frontier Models Now on AWS Bedrock for Enterprise

OpenAI's frontier models and Codex are now available for enterprises via Amazon Bedrock, supporting secure and compliant workflows.

OpenAI announced that its frontier models and Codex are now generally available on AWS, giving enterprises a new way to build on Amazon Bedrock with OpenAI through the security, compliance, and governance workflows they already use. This marks the beginning of a broader expansion of OpenAI's enterprise footprint. Developers can now build AI applications and software engineering workflows with OpenAI models using the AWS environments and controls their teams already trust, bridging the gap between cutting-edge AI capabilities and enterprise-grade infrastructure.

NVIDIA RTX Spark: a 1-petaflop superchip with full CUDA ecosystem.

NVIDIA Unveils RTX Spark: 1 Petaflop Personal AI Superchip

NVIDIA RTX Spark is a 1 petaflop superchip with full CUDA and RTX ecosystem, enabling native Windows AI agents, marking a new era for PCs.

NVIDIA announced the RTX Spark, a one-petaflop superchip that brings the full CUDA and RTX ecosystem to personal computers. The chip enables Windows-native AI agents, representing what the company calls a new beginning for personal computing. With its massive compute capability in a consumer form factor, RTX Spark could reshape how developers and creators run AI workloads locally, challenging the established boundaries between workstation and desktop AI.

One of the new, buzzy jobs in Silicon Valley is the AI Forward Deployed Engineer — an engineer embedded within a client organization to customize solutions, building and tuning agentic workflows for the client's particular needs.

Andrew Ng on the rise of AI FDE roles in Silicon Valley

Luma Launches Open Physical AI Lab for Generalization

Luma established an open-science physical AI lab to solve the generalization problem in physical AI. The lab addresses what stands between current AI systems and a future where AI can meaningfully interact with and improve the physical world through robotic and embodied systems.

Perplexity Launches Search as Code Architecture

Perplexity Agent API introduces Search as Code: agents write Python to call the search stack directly, replacing iterative function calls. Now default in Computer mode, this architecture dramatically streamlines how AI agents interact with search infrastructure, reducing latency and improving reasoning precision.

Tencent Hunyuan Releases Agent Memory Plugin Hy-Memory

Hy-Memory is designed for long-term collaborative agents, based on a 6-layer memory framework and System1/System2 dual-processing system, giving agents a true "second brain." More than a retrieval tool, it enables persistent memory across extended agent tasks and collaborative workflows.

MiniMax M3 Now on Vercel AI Gateway

MiniMax M3 with 1M context and multimodal input is now available on Vercel's AI Gateway, with a 50% discount for the first week. Developers can immediately start building with frontier coding capabilities through the Vercel developer platform.

Replit: Build a Complete Business from a Single Prompt

Replit now generates websites, mobile apps, slide decks, and launch videos from one prompt. The feature includes partner perks from Stripe Atlas, QuickBooks, Mercury, and Doola for launching real businesses.

Runway Aleph 2.0 Introduces Fast Masking

Aleph 2.0 creates compositing mattes in seconds, isolating subjects from backgrounds for compositing, coloring, or applying effects. Users upload video, prompt for a white silhouette on black, review the preview, and export the matte.

Industry Briefs06.02
Research & Partnerships06.02

© 2026 FAV0 · AI Daily