Qwen3-Next achieves state-of-the-art performance with reduced training cost, researchers develop VaultGemma, a 1B-parameter differentially private LLM, and a new tool called Llm-optimizer enables benchmarking and optimization of LLM inference across frameworks.
The Center for the Alignment of AI Alignment Centers launches to tackle AI risks, async AI programming revolutionizes developer workflows, and researchers develop Refrag, a framework to reduce system latency in large language models.
A hacker integrated a live LLM into Animal Crossing on a GameCube, researchers developed R-Zero, a self-evolving reasoning LLM framework, and a new platform called ROS-MCP-Server enables connecting large language models with ROS robots using MCP.
A hacker used a "memory mailbox" technique to add a live LLM to Animal Crossing, Anthropic's $1.5B AI copyright settlement was rejected by a federal judge, and researchers introduced R-Zero, a self-evolving LLM framework that generates its own training data.
Researchers suggest AI may follow a normal technological revolution path, a malicious code compromise was discovered in popular NPM packages including debug and chalk, and a new Spec-kit tool by GitHub aims to revolutionize AI coding with Spec-Driven Development.
Taco Bell's AI drive-thru faces chaos, researchers trick LLM-based NPCs into revealing secrets, and Semantic Cache offers high-performance semantic caching for Go applications.
The US is urged to ban AI surveillance due to significant privacy risks, a portfolio website mimicking Windows XP has been created, and researchers have developed Contemplative AI, which incorporates principles from mindfulness and non-duality for more resilient AI systems.
Anthropic agrees to pay $1.5B to settle a lawsuit with book authors, GLM-4.5 is released with advanced AI models for agent-oriented applications, and researchers find that language models process suspense differently than humans and cannot accurately estimate its progression in stories.
Le Chat introduces custom MCP connectors and a "Memories" feature, researchers redesign data systems to be agent-first for Large Language Model agents, and LLMberjack offers a simple open-source Go interface for multiple LLM providers.
MIT study reveals AI use can reprogram the brain, leading to cognitive decline, a new framework called SchedCP enables Large Language Model agents to optimize Linux schedulers, and Amazon releases Amazonq.nvim, an official AWS AI assistant plugin for Neovim.
AI web crawlers are overwhelming websites, researchers are reviving "world models" for more intelligent AI systems, and a new tool called DeepDoc performs deep research on local files to generate markdown reports.
Cloudflare Radar reveals AI traffic trends, cybercriminals leverage Claude for sophisticated attacks, and researchers propose SparseLoCo for communication-efficient LLM training.
A third of senior developers say over half their code is AI-generated, researchers propose a background-independent algebra in quantum gravity, and Pitaya is introduced as an AI coding agent orchestrator that runs multiple agents in parallel.
The US Special Operations Command seeks to develop AI-powered propaganda tools, researchers have constructed a non-Rupert polyhedron, and a new library called Keeptalking provides a simple interface to interact with OpenAI-compatible LLM servers.
Taco Bell rethinks AI drive-throughs after viral mistakes, researchers identify collaboration and trust as key to AI coding evolution, and a new framework called Type-Compliant Adaptation Cascades enables reliable composition of Large Language Models for complex workflows.
Researchers develop CCPS to improve LLM confidence estimation, a custom CLI coding agent is built with Pydantic-AI to enhance coding workflows, and SwiftAI is introduced as an open-source library to easily build LLM features on iOS/macOS.
A self-proclaimed AI hater argues the technology is fundamentally flawed, a new framework called Active Reading improves large language models' knowledge absorption, and Vectorless RAG PageIndex achieves 98.7% accuracy on the FinanceBench benchmark without using vectors.
Google introduces Gemini 2.5 Flash Image for state-of-the-art image generation, researchers develop Jet-Nemotron for a breakthrough in LLM speed, and Sideko launches a hybrid deterministic/LLM generator for automating API work.
Researchers find Agentic AI browsers vulnerable to scams, a study proposes the open-source AnalogSeeker language model for analog circuit design, and developers release Agent-C, an ultra-lightweight 4KB AI agent written in C.
Comet AI browser is vulnerable to prompt injection attacks, a new evaluation finds open models often outperform closed models for personal use cases, and researchers introduce DeepConf, a method to scale LLM reasoning with confidence scores.
Google reduces AI query energy cost by 33 times, developers share tips for working with LLM coding agents like Claude Code, and researchers develop a complex number extension of standard continued fractions with unique representations.
Google reduces AI query energy cost by 33 times, expert programmers leverage LLM "vibe coding" for workflow efficiency, and researchers find LLMs exhibit human biases when generating random sequences.
AWS CEO Matt Garman calls replacing junior staff with AI "the dumbest thing I've ever heard", researchers introduce FormalGrad, a method integrating formal methods with gradient-based LLM refinement, and developers create DiffMem, a git-based memory backend for AI agents using Markdown files and Git for temporal evolution tracking.
Tidewave Web launches an in-browser coding agent for Rails and Phoenix, researchers find that AI-generated code can create a "bus factor of zero" posing significant maintenance risks, and Luminal introduces an open-source search-based GPU compiler for deep learning models.
DeepSeek-V3.1-Base model boasts 685B parameters, researchers identify six challenges to AI-assisted codebase generation, and OpenAI's Reflect project introduces a physical AI assistant that illuminates users' lives through sound, light, and color.
Read