Monday — March 10, 2025

The US State Department plans to use AI for tracking "pro-Hamas" students, RocketKV offers fast memory efficiency in long-context LLMs, and Sidekick introduces a local LLM app for macOS with offline capabilities.

News

With AI you need to think bigger

Roving Dynamics Ltd is a company that specializes in designing and developing custom electronics, software, and other technological solutions for businesses and industries. The company, based in Blenheim, New Zealand, also engages in research and development, and its founder is a writer who has published techno-thriller novels, with a new book recently released, and will be available for new projects in 2025.

State Department Will Use AI to Search for 'Pro-Hamas' Students to Deport

The US State Department is launching a program called "Catch and Revoke" that will use AI to scan social media accounts and news reports of students in the US on a visa to identify those with pro-Palestinian and Hamas sympathies, with the goal of deporting them. The program is part of the Trump administration's broader efforts to police speech and punish anti-Israel protesters, and has raised concerns about the use of unproven AI systems and the potential for biased and inaccurate results.

A bear case: My predictions regarding AI progress

The author expects that current avenues of AI capability advancement, such as pretraining and test-time compute/RL, will not be sufficient to achieve Artificial General Intelligence (AGI), and that progress will decouple from intuitive measures of intelligence, facing steep diminishing returns. The author predicts that future models, such as GPT-5 and GPT-6, will bring subtle quality-of-life improvements but no paradigm-shifting advancements, and that Large Language Models (LLMs) are currently good at solving eisegesis-friendly problems and in-distribution problems, but struggle with more complex tasks.

US Ends Support For Ukrainian F-16s

The US has ended its support for Ukrainian F-16 fighter jets, specifically cutting off vital support for their jamming capabilities, but France is set to provide an alternative solution with its Mirage 2000 fighters, which are equipped with their own powerful jammers. The French jets could become a crucial asset for Ukraine's air defense, potentially replacing the US-designed F-16s and providing a new layer of protection against Russian radar systems.

Doge Plan to Push AI Across the US Federal Government Is Wildly Dangerous

Elon Musk's team has proposed an "AI-first strategy" to transform the US federal government, aiming to cut the agency's budget by half by deploying AI to analyze contracts, root out fraud, and automate work. However, this approach is likely to be disastrous, as similar AI-driven initiatives in the past, such as Michigan's unemployment insurance algorithm, have led to incorrect accusations of fraud, violated human rights, and violated human rights, highlighting the need for a more nuanced understanding of government processes and data.

Research

Accelerating Long-Context LLM Inference via Two-Stage KV Cache Compression

RocketKV is a training-free compression strategy that reduces the memory bandwidth and capacity demands of the KV cache in Transformer-based Large Language Models during the decode phase. By combining coarse-grain KV cache eviction and fine-grain top-k sparse attention, RocketKV achieves significant bandwidth and storage savings while maintaining comparable accuracy, resulting in end-to-end speedup and peak memory reduction on various long-context tasks.

A GS-Cache Inference Framework for Large-Scale Gaussian Splatting Models

The 3D Gaussian Splatting (3DGS) model faces challenges in achieving real-time performance on consumer-grade devices, hindering its potential in applications like virtual reality (VR). The proposed GS-Cache framework addresses these challenges, integrating 3DGS with an optimized rendering system to achieve significant performance improvements, including up to 5.35x faster rendering and 42% lower GPU memory usage.

A programmable environment for shape optimization and shapeshifting problems

Materials that change shape are a growing area of study, but determining their structure is a complex problem that involves minimizing an energy functional with respect to shape and other factors. To address the lack of suitable simulation tools, Morpho, an open-source programmable environment, has been developed to solve shape optimization problems and has been applied to various areas of soft matter, including hydrogels, complex fluids, and soap films.

The Semantics of Rank Polymorphism

Remora is a core, functional language that captures the rank-polymorphic mechanism of lifting functions to operate on arrays of higher rank, with a formal dynamic semantics and a static, rank-polymorphic type system that prevents "array shape" errors at runtime. The type system uses dependent types and permits static type checking, while also allowing for dynamic computation of array shape or rank, and its properties are useful for the construction of efficient compilers for rank-polymorphic languages.

TableTalk: Scaffolding Spreadsheet Development with a Language Agent

TableTalk is a language agent that assists programmers in building spreadsheets through conversational interaction, generating step-by-step plans and suggesting next steps to simplify the process. A user study found that TableTalk produces preferred spreadsheets 2.3 times more often than a baseline agent, while reducing cognitive load and time spent on reasoning by 12.6%, demonstrating its effectiveness in human-agent collaboration for spreadsheet programming.

Code

Show HN: Evolving Agents Framework

The Evolving Agents Framework is a production-grade framework for creating, managing, and evolving AI agents with intelligent agent-to-agent communication, enabling the build of collaborative agent ecosystems that can semantically understand requirements and evolve based on past experiences. The framework features intelligent agent evolution, agent-to-agent communication, and semantic search, allowing for the creation of complex agent collaborations and the discovery of semantically similar components in the library.

Anus: An open-source AI agent framework created by Manus AI itself

Anus is an open-source AI agent framework that enables users to create autonomous agents for task automation, offering features such as natural language processing, multi-agent collaboration, and integration with various AI models. The framework provides a flexible and extensible architecture, allowing users to build custom agents and tools, and supports multiple installation methods, including pip installation, installation from source, and containerization using Docker.

Sidekick: Local-first native macOS LLM app

Sidekick is a local large language model (LLM) application for macOS that allows users to chat with a model that can access and respond with information from their files, folders, and websites without requiring an internet connection. The application features a range of tools and capabilities, including image generation, code interpretation, and data visualization, and is designed to be easy to use and accessible to users who may not have prior experience with LLMs or programming.

Show HN: Fresh Data for AI (Open Source)

CocoIndex is an open-source engine that supports custom transformation logic and incremental updates for data indexing, allowing users to declare transformations and maintain indexes with minimal computation. To get started with CocoIndex, users can install the Python library, set up a Postgres database, and follow the quick start guide to define their first indexing flow.

Show HN: Minimalist framework for building POC AI agents

Explicit Agent is a minimalist framework for building AI agents that provides complete transparency, full control, and minimal infrastructure, allowing users to define exactly how their agent behaves. The framework uses simple, understandable components to build complex behaviors, with key concepts including state management, tool types, and tool return values that work together to enable autonomous task solving.