Simon Willison tests OpenAI's newly released ChatGPT Images 2.0 model using a complex Where's Waldo style prompt involving a raccoon holding a ham radio. By comparing results against previous versions and competitors like Google's Nano Banana, the article evaluates the model's ability to handle high-detail illustrations and specific text elements.
This article details how to use Ollama to run large language models locally, protecting sensitive data by keeping it on your machine. It covers installation, usage with Python, LangChain, and LangGraph, and provides a practical example with FinanceGPT, while also discussing the tradeoffs of using local LLMs.
Anthropic is clashing with the Pentagon over the military's use of its AI systems, particularly regarding autonomous weaponry and mass surveillance. A key point of contention arose when the Pentagon asked if Claude could be used to help intercept a nuclear missile, a request Anthropic resisted, raising concerns about unrestricted AI use and potential risks. OpenAI is also signaling it would take a similar stance.
LLM Council works together to answer your hardest questions. A local web app that uses OpenRouter to send queries to multiple LLMs, have them review/rank each other's work, and finally a Chairman LLM produces the final response.
This repository contains the source code for the summarize-and-chat project. This project provides a unified document summarization and chat framework with LLMs, aiming to address the challenges of building a scalable solution for document summarization while facilitating natural language interactions through chat interfaces.
Sam Altman discusses the imminent arrival of digital superintelligence, its potential impacts on society, and the future of technological progress. He highlights the rapid advancements in AI, the economic and scientific benefits, and the challenges of ensuring safety and equitable access.
PaperCoder is a multi-agent LLM system that transforms scientific papers into code repositories through a three-stage pipeline: planning, analysis, and code generation. It aims to create faithful, high-quality implementations.
Notte is an open-source browser using an agent, designed to improve speed, cost, and reliability in web agent tasks through a perception layer that structures webpages for LLM consumption. It offers a full stack framework with customizable browser infrastructure, web scripting, and scraping endpoints.
A terminal-based platform to experiment with the AI Software Engineer. It allows users to specify software in natural language, watch as an AI writes and executes the code, and implement improvements. Supports various models and customization options.
The article discusses the implications of Sam Altman's proposal to modify the social contract in light of advancements in AI, emphasizing the potential risks to marginalized communities and democratic values. It critiques the exclusionary nature of traditional social contract theories and questions the role of tech leaders in shaping societal norms.