Wednesday — October 30, 2024

GitHub partners with Google and Anthropic for AI models, Mantis defends against LLM cyberattacks, and Integuru simplifies API integration through reverse-engineering with AI.

News

GitHub cuts AI deals with Google, Anthropic

Here is a summary of the text in a couple of sentences:

Microsoft's GitHub unit has agreed to integrate artificial intelligence models from Google and Anthropic into its coding assistant, GitHub Copilot, allowing users to switch from OpenAI models. The new AI models, Google's Gemini and Anthropic's Claude 3.5 Sonnet, will initially be available as an option for users, with plans to eventually incorporate them into the main part of the Copilot assistant.

OpenAI builds first chip with Broadcom and TSMC, scales back foundry ambition

I don't see any text provided. Please share the text you'd like me to summarize, and I'll be happy to assist you.

Show HN: Kasama – an IntelliJ plugin to keep track of your coding practices

This text appears to be a footer from the JetBrains plugin marketplace, providing links to feedback, terms of use, and legal information. It also includes a JetBrains logo and copyright information.

Google CEO says more than a quarter of the company's new code is created by AI

More than a quarter of new code at Google is created by AI, with the generated code then reviewed by employees to ensure accuracy. This increased use of AI for coding is part of Google's efforts to boost productivity and efficiency within the company.

Meta's AI Abundance

Stratechery, a publication, has historically benefited from a "cheat code" by writing articles that contradict investor pessimism, only to be proven correct when Meta's stock price recovers. The author now believes Meta is well-positioned to take advantage of generative AI, particularly in digital advertising, and may become the most valuable company in the world.

Research

Hacking Back the AI-Hacker: Prompt Injection as a Defense for LLM-Attackers

Researchers propose a new defense strategy, Mantis, to counter cyberattacks driven by large language models (LLMs). Mantis exploits LLMs' susceptibility to adversarial inputs to disrupt or compromise the attacker's operations, achieving over 95% effectiveness in experiments.

Natural Language Outlines for Code: Literate Programming in the LLM Era

Researchers propose using natural language outlines as a novel interface for AI-assisted software development, allowing developers to interact with code through concise prose summaries. This modality enables bidirectional syncing between code and natural language, and has various use cases, including accelerating code understanding and maintenance.

GPT-4o System Card [pdf]

GPT-4o is an advanced AI model that can process and generate text, audio, and images, trained end-to-end across multiple modalities, and can respond to audio inputs in under 320 milliseconds. It outperforms existing models in vision and audio understanding, and is also faster and cheaper than its predecessor GPT-4 Turbo, with significant improvements in non-English languages.

LLM Code Generation with Formal Specifications and Reactive Program Synthesis

Large Language Models (LLMs) have improved for code generation tasks but still struggle with accuracy, particularly for complex systems with unusual logic. A proposed solution combines LLMs with formal methods-based program synthesis to improve code generation accuracy and tackle previously intractable problems.

Acer: Automatic Language Model Context Extension via Retrieval

Researchers found that current generalist long-context language models are lacking in practical tasks, requiring task-specific data that can be costly. To address this, they developed an automatic data synthesis pipeline using short-context language models, which can generate task-specific long-context capabilities and outperform existing models in tasks like long-context retrieval augmented generation.

Code

Launch HN: Integuru (YC W24) – Reverse-engineer internal APIs using LLMs

Integuru is an AI agent that generates integration code by reverse-engineering platforms' internal APIs. It works by analyzing browser network requests, identifying dependencies, and generating runnable Python code to perform desired actions.

Show HN: GPT powered Discord bot that summarizes mental health research daily

The Elata Newsbot is a Discord bot that aggregates and summarizes news related to computational neuroscience, precision psychiatry, and other emerging mental health fields. To install and run the bot, users must clone the repository, run a setup script, and obtain API keys for Discord, NewsAPI, and OpenAI.

Show HN: Durable Swarm – A Framework for Building Reliable AI Agents

Here is a summary of the text in a couple of sentences:

Durable Swarm is a drop-in replacement for OpenAI's Swarm that adds durable execution to make multi-agent systems reliable and resilient to failures. It uses DBOS to persist the execution state of workflows in a Postgres database, allowing agents to automatically resume from their last completed steps if interrupted or restarted.

pgai: Develop RAG, semantic search, and other AI apps with PostgreSQL

pgai is an extension for PostgreSQL that simplifies the process of building AI applications, including search, Retrieval Augmented Generation (RAG), and other AI applications. It allows developers to work with embeddings generated from their data, leverage large language models (LLMs) for data processing tasks, and integrate AI from various providers.

Show HN: Add RAG to any app in minutes – Dabarqus

Dabarqus is a standalone application that implements a complete RAG (Relevance, Accuracy, and Granularity) solution, designed to be easy to use and integrate with existing applications. It includes a REST API, command-line interface, and admin dashboard, and is available on multiple platforms, including macOS, Linux, and Windows.