Ilya Sutskever warns a trillion dollars may be wasted on scaling LLMs, a new tool reorganizes git commits with an LLM, and Nvidia's 8B orchestrator model outperforms GPT-5 on agentic tasks.
The White House is accused of bailing out the AI industry, DeepSeek releases a self-verifying math model and humans beat LLMs in a competitive coding tournament.
US Census data reveals a decline in enterprise AI adoption, a new protocol fixes complex coding by destroying the AI instance after each task and image diffusion models are repurposed for zero-shot video object tracking.
A mathematical ceiling may limit generative AI to amateur creativity, a new tool hands off web bugs to AI agents, and a Brain-Computer Interface protects human teams from misleading AI.
LLMs are stunned by thousands of invisible Unicode characters, the experimental Pulse-Field O(N) architecture is 12x faster than Transformers and RL now controls biohybrid robots with living muscle.
An AI trained on bacterial genomes creates never-before-seen proteins, LLM Council has models rank each other's work, and a new attack jailbreaks LLMs using game-theory scenarios.
Google targets a 1000x compute increase in five years, a project reverse jailbreaks a psychopathic AI via identity injection, and GPT-5 helps discover new mathematical results.
AI2 open-sources the entire model flow for its Olmo 3 LLMs, a new browser extension yoinks design systems for AI coding assistants and an AI rediscovers Newtonian physics from raw data.
Students protest an AI-taught coding course, Docker introduces a `docker model` CLI and adversarial poetry functions as a universal LLM jailbreak.
Europe scales back GDPR and AI laws, ChunkBack emulates LLM APIs for cost-free testing and CudaForge uses LLM agents to optimize CUDA kernels.
Google's new Gemini 3 generates UIs directly in search, a cognitive architecture lets agents swap LLMs mid-conversation, and a new system solves a million-step task with zero errors.
Windows 11 adds a background AI agent with access to personal folders, adaptive attacks bypass 12 recent LLM jailbreak defenses, and a new cognitive architecture gives agents a persistent identity.
Peter Thiel sells off all Nvidia stock over bubble fears, an open-source project claims 9.68x GPU amplification using quantum concepts and a new system solves a million-step LLM task with zero errors.
A tech ideology frames humanity as a "biological bootloader" for AGI, Microsoft releases a free "AI for Beginners" curriculum, and a new system solves a million-step LLM task with zero errors.
Nvidia plans to sell entire AI servers instead of just GPUs, Agent Playbook offers a Storybook-like playground for AI agents, and a hybrid diffusion-autoregressive model promises a 5x speedup.
An agentic LLM orchestrates a cyber-espionage campaign, a Claude Code agent calls external LLMs like Grok and Gemini, and a new side-channel attack infers prompt topics from encrypted traffic.
Yann LeCun departs Meta to launch a "world models" startup, a new project shares LLM attention caches across GPUs like memcached, and research finds smaller models can be more consistent than 120B ones.
An AI agent provides conversational documentation for any GitHub repo, a new tool uses LLMs to prevent architectural drift, and a review of 445 LLM benchmarks finds many lack construct validity.
A $1T tech stock sell-off reflects AI skepticism, a new tool serves hundreds of LLMs on a single GPU, and an evolutionary agent rediscovers mathematical formulas.
Nvidia's CEO warns China will possess more AI compute than the rest of the world by 2027, an open-source NBA game predictor reaches 70% accuracy, and an evolutionary coding agent discovers improved mathematical solutions.
OpenAI seeks U.S. loan guarantees for a $1T AI expansion, Cascadeflow cuts API costs with speculative model cascading, and research analogizes Transformers to General Relativity.
OpenAI seeks U.S. loan guarantees for a $1T expansion, an LLM agent reverse-engineers web apps into automations, and an AI scientist automates six months of human research in a single run.
Amazon demands Perplexity stop its AI agent from making purchases, a new platform uses multi-model consensus to read MRIs and Cache-to-Cache enables direct semantic communication between LLMs.
An analysis of 180M jobs shows creative roles declining 30%, a developer builds a Raspberry Pi dog cam using Claude, and a new attention architecture outperforms full attention.
LLMs may overuse em-dashes due to 19th-century training data, a RAG pipeline runs on a 2011 Raspberry Pi in pure PHP and a model maps vocal prosody to typography.
Read