Tuesday — April 15, 2025

OpenAI's GPT-4.1 models offer greater performance and lower costs, ActorCore's real-time framework supports various platforms, and NoProp presents a novel learning method without traditional back-propagation.

News

A hackable AI assistant using a single SQLite table and a handful of cron jobs

The author built a simple AI assistant called Stevens, which sends daily briefs with calendar schedules, weather forecasts, and reminders, using a basic architecture of a single SQLite table and cron jobs. The system is easily extensible and demonstrates that useful personal tools with AI can be built without fancy techniques or libraries, and can be customized to have a unique personality and vibe.

GPT-4.1 in the API

OpenAI has launched three new models in its API: GPT-4.1, GPT-4.1 mini, and GPT-4.1 nano, which outperform previous models in coding, instruction following, and long context comprehension. These models offer improved performance at a lower cost and latency, making them suitable for a range of applications, including coding, classification, and autocompletion, and will be available via the API, with GPT-4.1 nano being the fastest and cheapest option.

DolphinGemma: How Google AI is helping decode dolphin communication

Google's AI model, DolphinGemma, is being used to help scientists study and decipher dolphin communication, with the goal of understanding the patterns and structure of their vocalizations and potentially generating realistic responses. Researchers are using DolphinGemma and Google Pixel phones to record and analyze dolphin sounds, pushing the boundaries of AI and interspecies communication.

AudioX: Diffusion Transformer for Anything-to-Audio Generation

AudioX is a unified Diffusion Transformer model that can generate high-quality audio and music, offering flexible natural language control and seamless processing of various modalities. The model achieves state-of-the-art performance in audio and music generation tasks, and its versatility allows it to handle diverse input modalities and generation tasks within a unified architecture.

The Cost of Being Crawled: LLM Bots and Vercel Image API Pricing

A misconfiguration in a Next.js web app hosted on Vercel could have cost the company $7,000 due to a spike in traffic from LLM bots, which scraped thousands of images using Vercel's Image Optimization API. The issue was mitigated by blocking the bots, disabling image optimization, and configuring firewall rules to prevent similar incidents in the future.

Research

NoProp: Training neural networks without back-propagation or forward-propagation

The traditional deep learning approach involves back-propagating error signals to learn hierarchical representations, but a new method called NoProp instead uses a diffusion-based approach where each layer learns to denoise a noisy target independently. NoProp achieves superior accuracy and efficiency on image classification benchmarks, offering a viable alternative to traditional gradient-based learning methods and potentially enabling more efficient distributed learning.

MooseAgent: A LLM Based Multi-Agent Framework for Automating Moose Simulation

MooseAgent is an automated solution framework for the multi-physics simulation framework MOOSE, utilizing large-scale pre-trained language models and a multi-agent system to understand user requirements and generate input files. The framework has shown promise in simplifying finite element simulation processes, particularly for single-physics problems, and its open-sourced code is available for further development and use.

Scraping the Shadows: Deep Learning Breakthroughs in Dark Web Intelligence

Darknet markets (DNMs) enable the global trade of illegal goods, and gathering data on them is crucial for law enforcement to combat crime. Researchers developed a framework to automate data extraction from DNMs, using state-of-the-art Named Entity Recognition models, which achieved high performance with 91% precision, 96% recall, and an F1 score of 94% after fine-tuning.

All-in-Memory Stochastic Computing Using ReRAM

Stochastic computing (SC) offers a promising alternative to traditional computing methods for handling complex tasks in embedded and edge devices by approximating arithmetic operations using simple bitwise operations on random bit-streams. This work leverages ReRAM devices to implement the entire SC flow, resulting in significant improvements in throughput and energy consumption, with minimal impact on image quality, compared to state-of-the-art CMOS- and ReRAM-based solutions.

Why it is (nearly) impossible that we live in a simulation

The simulation hypothesis, which suggests that our universe is a simulation, is evaluated based on physical constraints and astrophysical observations, revealing that the required energy or power is incompatible with physics or astronomically large. The results indicate that it is impossible for our universe to be simulated by a universe with the same properties, regardless of future technological advancements.

Code

The path to open-sourcing the DeepSeek inference engine

The DeepSeek team is open-sourcing five repositories, one each day, as part of their Open-Source Week, starting from February 24, 2025, to share their progress in AGI exploration with full transparency. The open-sourced repositories include various projects such as FlashMLA, DeepEP, DeepGEMM, and 3FS, which are designed to optimize and accelerate deep learning tasks, including inference and training, on modern hardware.

Meilisearch – search engine API bringing AI-powered hybrid search

Meilisearch is a lightning-fast search engine that provides a delightful search experience with features like hybrid search, typo tolerance, filtering, and faceted search, making it easy to integrate into apps, websites, and workflows. It offers a range of tools and resources, including a RESTful API, SDKs, and documentation, to help users get started and customize their search experience.

Show HN: ActorCore – Stateful serverless framework that runs anywhere

ActorCore is a stateful, scalable, and real-time backend framework that allows developers to build multiplayer, real-time, or AI agent backends with features like persistent in-memory state, ultra-fast state updates, and integrated support for state, actions, events, and scheduling. It supports various platforms including Rivet, Cloudflare Workers, Bun, and Node.js, and provides a solid foundation for building modern apps with a range of features and tools.

Show HN: A library to convert+deploy existing agent projects as MCP servers

Automcp is a tool that allows users to convert existing agent frameworks into MCP servers, which can be accessed through standardized interfaces. The tool supports deployment of agents, tools, and orchestrators from frameworks such as CrewAI, LangGraph, and OpenAI Agents SDK, and provides a simple installation and setup process using the command line interface.

Show HN: I made a machine learning model to predict 66.45% of NBA games

DeepShot is an advanced NBA game predictor that uses historical data, rolling statistics, and machine learning to make predictions, with features including data-driven predictions, real-time interface, and weighted stats engine. The model is built with NiceGUI for seamless interaction and can be run on Windows, macOS, and Linux, with instructions provided for cloning and running the repository.