May 21, 2026 · 03:20 PM UTC
Release: llm-gemini 0.32 New model gemini-3.5-flash for Gemini 3.5 Flash . See also my notes on Gemini 3.5 Flash , and <a href="https://simonwillison.net/2026/May/19/gemin
Also: Simon Willison, Hacker NewsSources: OpenAI is preparing to file confidentially for an IPO as early as May 22 and has a goal to be ready to go public as early as September — The artificial-intelligence giant is working with bankers at Goldman Sachs and Morgan Stanley — ChatGPT-maker OpenAI has been working …
Also: Hacker NewsGitHub links the breach of 3,800 internal repositories to the TanStack npm supply-chain attack, saying hackers used a malicious Nx Console VS Code extension — GitHub says the hackers who breached 3,800 internal repositories gained access via a malicious version of the Nx Console VS Code extension …
Also: TechmemeOpenAI says an internal general-purpose reasoning model has disproved the Erdős unit distance conjecture, a central problem in discrete geometry posed in 1946 — For nearly 80 years, mathematicians have studied a deceptively simple question: if you place nnn points in the plane …
Also: OpenAI BlogDeepseek is building a new team in Beijing to develop its own AI code agent, working title "Deepseek Code," a direct competitor to Claude Code, Codex, and Cursor. Applicants should know agent loops, MCP, and context engineering and be heavy users of existing coding tools.</p
Also: The Decoder, GeekWire, RedditGoogle on Tuesday launched a new $ 100-per-month AI Ultra plan that sits neatly between its existing $200 and $20 The post Google launches $100 AI Ultra plan and
Also: Google AI Blog, Wired AI, MarkTechPost, The New StackNew development in llama.cpp: feat: Add WAV MIME type variants and improve audio format detection (#23396)
Also: llama.cpp, RedditAnthropic has expanded its Claude Managed Agents platform with two enterprise-focused capabilities: self-hosted sandboxes and MCP tunnels. The release aims to address a recurring challenge in enterprise AI deployments, where organizations want to use autonomous agents but cannot allow execution...
Also: InfoQ AI, RedditPre-release for Ollama. This version of Ollama will change the architecture to directly support llama.cpp instead of building on top of GGML, and allows for compatibility with GGUF file format. MLX is used to accelerate mode
Also: RedditInvestor disclosures: Anthropic says it expects to generate $10.9B in revenue in Q2, up 127% from $4.8B in Q1, and turn a $559M operating profit, its first ever — The startup expects a 130% revenue surge to $10.9 billion in the June quarter and its first operating profit, defying skeptics of the AI...
Also: RedditHas any Big Tech company done as many AI-related reorgs in the past five years than Meta?
Gemini Omni is quite good at instruction following: "sea otter in a pilot's uniform explains why Spirit Airlines went bankrupt to a river otter who is distracted by their laptop while they are in a hot air balloon over NYC. in the next balloon over, william shakespeare fights a robot made of pizza"
On-policy distillation is on track to be a lasting method in post-training. The list of areas would be: Instruction tuning (SFT/IFT) RLHF Direct Preference Optimization (DPO et al) RLVR On-policy Distillation (OPD) New classes of methods are rare! Excited to play.
Text-to-speech model. 34,965 downloads. 518 likes on HuggingFace
The Nuxt MCP Toolkit now supports MCP apps . Your agent tools can return interactive HTML responses that MCP clients like Claude and ChatGPT render inline, rather than plain-text responses. Declare a tool with the defineMcpApp macro, then read pre-hydrated data, trigger follow-up prompts, or call...
ByteDance's Intelligent Creation Lab has released Lance, an open-source native unified multimodal model that handles image and video understanding, generation, and editing — all within a single framework, using only 3B activated parameters. The post One Model, Three Modalities: ByteDance Releases...
1B parameter vision-language model. 196,105 downloads. 845 likes on HuggingFace
The video clipping startup raised a $7 million seed round led by Slow Ventures.
Modern LLMs typically require multistage training pipelines to achieve strong downstream performance, with post-training serving as the main interface for adapting open-weight models. We introduce torchtune, a PyTorch-native library designed to streamline the post-training lifecycle of LLMs...
9B parameter video generation model. 1,198,471 downloads. 1,218 likes on HuggingFace
X402 vs UCP vs ACP vs AP2, And Why the Answer Isn't Picking a Winner I've spent the last year integrating every major agentic payment protocol into a single SDK. Not studying them from the outside actually writing the adapter code, handling the edge cases, debugging the interop failures. Here's...
New development in Vercel AI SDK: Version Packages (canary) (#15491)
Also: Vercel AI SDKOpenAI-compatible proxy that aggregates free-tier keys from ~14 AI providers with automatic failover. For personal experimentation only. (3,250 stars)
Here are five key things investors need to know to start the trading day.
New development in LangChain: chore: bump langsmith from 0.8.3 to 0.8.5 in /libs/partners/anthropic (#37564)
Also: LangChainGrok Build 0.1 is now available on Vercel AI Gateway . This is a beta coding model trained for agentic coding, currently in early access, and powers the Grok Build CLI app. Reasoning effort is not configurable, and there is no non-reasoning mode. To use Grok Build 0.1, set model to...
Also: Vercel BlogThe open-source managed agents platform. Turn coding agents into real teammates — assign tasks, track progress, compound skills. (30,476 stars)
Core integrations of the Datadog Agent
Chat SDK now ships a built-in AI SDK toolset through the new chat/ai subpath. One createChatTools(chat) call wires Chat SDK's read and write actions into your agent. Approval by default: write tools are gated by a requireApproval option. Presets: reader , messenger , and moderator scope the...