AI News

May 21, 2026 · 03:20 PM UTC

  1. Gemini 3.5 Flashsimonwillison.net

    Release: llm-gemini 0.32 New model gemini-3.5-flash for Gemini 3.5 Flash . See also my notes on Gemini 3.5 Flash , and <a href="https://simonwillison.net/2026/May/19/gemin

    Also: Simon Willison, Hacker News
  2. OpenAI Is Preparing to File for an IPO Soonwsj.com

    Sources: OpenAI is preparing to file confidentially for an IPO as early as May 22 and has a goal to be ready to go public as early as September — The artificial-intelligence giant is working with bankers at Goldman Sachs and Morgan Stanley — ChatGPT-maker OpenAI has been working …

    Also: Hacker News
  3. GitHub links the breach of 3,800 internal repositories to the TanStack npm supply-chain attack, saying hackers used a malicious Nx Console VS Code extensionbleepingcomputer.com

    GitHub links the breach of 3,800 internal repositories to the TanStack npm supply-chain attack, saying hackers used a malicious Nx Console VS Code extension — GitHub says the hackers who breached 3,800 internal repositories gained access via a malicious version of the Nx Console VS Code extension …

    Also: Techmeme
  4. An OpenAI model has disproved a central conjecture in discrete geometry trendingopenai.com

    OpenAI says an internal general-purpose reasoning model has disproved the Erdős unit distance conjecture, a central problem in discrete geometry posed in 1946 — For nearly 80 years, mathematicians have studied a deceptively simple question: if you place nnn points in the plane …

    Also: OpenAI Blog
  5. Amazon Leo’s leaders provide an inside look at the satellite broadband network’s past and futurethe-decoder.com

    Deepseek is building a new team in Beijing to develop its own AI code agent, working title "Deepseek Code," a direct competitor to Claude Code, Codex, and Cursor. Applicants should know agent loops, MCP, and context engineering and be heavy users of existing coding tools.</p

    Also: The Decoder, GeekWire, Reddit
  6. Everything new in our Google AI subscriptions, fresh from I/O 2026thenewstack.io

    Google on Tuesday launched a new $ 100-per-month AI Ultra plan that sits neatly between its existing $200 and $20 The post Google launches $100 AI Ultra plan and

    Also: Google AI Blog, Wired AI, MarkTechPost, The New Stack
  7. 110 tok/s with 12GB VRAM on Qwen3.6 35B A3B and ik_llama.cppgithub.com

    New development in llama.cpp: feat: Add WAV MIME type variants and improve audio format detection (#23396)

    Also: llama.cpp, Reddit
  8. Anthropic Introduces MCP Tunnels for Private Agent Access to Internal Systemsinfoq.com

    Anthropic has expanded its Claude Managed Agents platform with two enterprise-focused capabilities: self-hosted sandboxes and MCP tunnels. The release aims to address a recurring challenge in enterprise AI deployments, where organizations want to use autonomous agents but cannot allow execution...

    Also: InfoQ AI, Reddit
  9. Horizon — multi-provider Flutter chat client. Ollama (local + Cloud), Claude, OpenAI, Gemini. Android / macOS / Windows / .deb / tar.gzgithub.com

    Pre-release for Ollama. This version of Ollama will change the architecture to directly support llama.cpp instead of building on top of GGML, and allows for compatibility with GGUF file format. MLX is used to accelerate mode

    Also: Reddit
  10. Anthropic is officially set to be profitable as of Q2 2026wsj.com

    Investor disclosures: Anthropic says it expects to generate $10.9B in revenue in Q2, up 127% from $4.8B in Q1, and turn a $559M operating profit, its first ever — The startup expects a 130% revenue surge to $10.9 billion in the June quarter and its first operating profit, defying skeptics of the AI...

    Also: Reddit
  11. Qwen3.7-Max: The Agent Frontierqwen.ai
  12. Has any Big Tech company done as many AI-related reorgs in the past five years than Meta?bsky.app

    Has any Big Tech company done as many AI-related reorgs in the past five years than Meta?

  13. MCP AI integration without creating a security mess?reddit.com
  14. Gemini Omni is quite good at instruction following: "sea otter in a pilot's uniform explains why Spirit Airlines went ba...bsky.app

    Gemini Omni is quite good at instruction following: "sea otter in a pilot's uniform explains why Spirit Airlines went bankrupt to a river otter who is distracted by their laptop while they are in a hot air balloon over NYC. in the next balloon over, william shakespeare fights a robot made of pizza"

  15. On-policy distillation is on track to be a lasting method in post-training. The list of areas would be: Instruction tun...bsky.app

    On-policy distillation is on track to be a lasting method in post-training. The list of areas would be: Instruction tuning (SFT/IFT) RLHF Direct Preference Optimization (DPO et al) RLVR On-policy Distillation (OPD) New classes of methods are rare! Excited to play.

  16. I stopped using LangChain for my retrieval pipeline — here's what the benchmark numbers actually look likereddit.com
  17. Supertone/supertonic-3 — text-to-speech modelhuggingface.co

    Text-to-speech model. 34,965 downloads. 518 likes on HuggingFace

  18. Formal Verification Gates for AI Coding Loopsreubenbrooks.dev
  19. Nuxt MCP Toolkit now supports MCP appsvercel.com

    The Nuxt MCP Toolkit now supports MCP apps . Your agent tools can return interactive HTML responses that MCP clients like Claude and ChatGPT render inline, rather than plain-text responses. Declare a tool with the defineMcpApp macro, then read pre-hydrated data, trigger follow-up prompts, or call...

  20. One Model, Three Modalities: ByteDance Releases Lance for Image and Video Understanding, Generation, and Editingmarktechpost.com

    ByteDance's Intelligent Creation Lab has released Lance, an open-source native unified multimodal model that handles image and video understanding, generation, and editing — all within a single framework, using only 3B activated parameters. The post One Model, Three Modalities: ByteDance Releases...

  21. openbmb/MiniCPM-V-4.6 — vision-language modelhuggingface.co

    1B parameter vision-language model. 196,105 downloads. 845 likes on HuggingFace

  22. Clouted wants to take the guesswork out of making short videos go viraltechcrunch.com

    The video clipping startup raised a $7 million seed round led by Slow Ventures.

  23. torchtune: PyTorch native post-training libraryarxiv.org

    Modern LLMs typically require multistage training pipelines to achieve strong downstream performance, with post-training serving as the main interface for adapting open-weight models. We introduce torchtune, a PyTorch-native library designed to streamline the post-training lifecycle of LLMs...

  24. SulphurAI/Sulphur-2-base — video generation modelhuggingface.co

    9B parameter video generation model. 1,198,471 downloads. 1,218 likes on HuggingFace

  25. FullFlow: Upgrading Text-to-Image Flow Matching Models for Bidirectional Vision--Language Generationarxiv.org
  26. 'Am I OpenAI compatible' - a tool and documentation for unified api signatures in open source AI.github.com
  27. The Agentic Payment Protocol Warsdev.to

    X402 vs UCP vs ACP vs AP2, And Why the Answer Isn't Picking a Winner I've spent the last year integrating every major agentic payment protocol into a single SDK. Not studying them from the outside actually writing the adapter code, handling the edge cases, debugging the interop failures. Here's...

  28. Vercel AI SDK: fix(google): read serviceTier from x-gemini-service-tier response header (#14937)github.com

    New development in Vercel AI SDK: Version Packages (canary) (#15491)

    Also: Vercel AI SDK
  29. tashfeenahmed/freellmapigithub.com

    OpenAI-compatible proxy that aggregates free-tier keys from ~14 AI providers with automatic failover. For personal experimentation only. (3,250 stars)

  30. SpaceX's IPO, Nvidia earnings, Bezos on the AI bubble and more in Morning Squawkcnbc.com

    Here are five key things investors need to know to start the trading day.

  31. Big model loading time speedup. Update guys!github.com
  32. Is There Any Official CVPR 2026 Mobile App Yet?reddit.com
  33. LangChain: chore: bump langsmith from 0.8.3 to 0.8.5 in /libs/partners/anthropic (#37564)github.com

    New development in LangChain: chore: bump langsmith from 0.8.3 to 0.8.5 in /libs/partners/anthropic (#37564)

    Also: LangChain
  34. Things I learned building an end-to-end ML pipeline on Kubernetes: from data validation to live predictionsyoutube.comAlso: Reddit
  35. Grok Build 0.1 now available on Vercel AI Gatewayvercel.com

    Grok Build 0.1 is now available on Vercel AI Gateway . This is a beta coding model trained for agentic coding, currently in early access, and powers the Grok Build CLI app. Reasoning effort is not configurable, and there is no non-reasoning mode. To use Grok Build 0.1, set model to...

    Also: Vercel Blog
  36. multica-ai/multicagithub.com

    The open-source managed agents platform. Turn coding agents into real teammates — assign tasks, track progress, compound skills. (30,476 stars)

  37. DataDog/integrations-coregithub.com

    Core integrations of the Datadog Agent

  38. LangGraph 1.0 has been out for 7 months now. What are you shipping with it?reddit.com
  39. FIKA-Bench: From Fine-grained Recognition to Fine-Grained Knowledge Acquisitionarxiv.org
  40. Chat SDK now includes AI SDK toolsvercel.com

    Chat SDK now ships a built-in AI SDK toolset through the new chat/ai subpath. One createChatTools(chat) call wires Chat SDK's read and write actions into your agent. Approval by default: write tools are gated by a requireApproval option. Presets: reader , messenger , and moderator scope the...