SemanticScuttle - klotz.me » Tags: simon willison

Tags: simon willison*

0 bookmark(s) - Sort by: Date ↓ / Title /

Qwen2.5-1M: Deploy Your Own Qwen with Context Length up to 1M Tokens

Alibaba's Qwen 2.5 LLM now supports input token limits up to 1 million using Dual Chunk Attention. Two models are released on Hugging Face, requiring significant VRAM for full capacity. Challenges in deployment with quantized GGUF versions and system resource constraints are discussed.

2025-01-28 Tags: qwen2.5-1m, alibaba, hugging face, gguf, llm, simon willison by klotz

My AI/LLM Predictions for the Next 1, 3, and 6 Years, for Oxide and Friends

Simon Willison shares his predictions regarding the development of AI and LLMs over the next 1, 3, and 6 years. He discusses the potential failure of AI agents to fully realize their expected capabilities, the success of coding and research assistants, a Pulitzer prize for AI-assisted investigative reporting within three years, the emergence of privacy laws, the creation of amazing art in six years, and concerns about AGI/ASI leading to mass civil unrest.

2025-01-10 Tags: llm, oxide, agents, simon willison by klotz

Things we learned about LLMs in 2024

A review of advancements and key themes in Large Language Models over the course of 2024, including GPT-4 barrier breaking, reduced costs, multimodal capabilities, and more.

2025-01-01 Tags: llm, simon willison, 2024 by klotz

files-to-prompt

Concatenate a directory full of files into a single prompt for use with LLMs

2024-12-12 Tags: files-to-prompt, llm, simon willison, github by klotz

Qwen2.5-Coder-32B is an LLM that can code well that runs on my Mac

Simon Willison reviews the new Qwen2.5-Coder-32B, an open-source LLM by Alibaba, which performs well on various coding benchmarks and can run on personal devices like his MacBook Pro M2.

2024-11-13 Tags: qwen2.5-coder-32b, llm, alibaba, simon willison, code, qwen by klotz

You can now run prompts against images, audio and video in your terminal using LLM

LLM 0.17 release enables multi-modal input, allowing users to send images, audio, and video files to Large Language Models like GPT-4o, Llama, and Gemini, with a Python API and cost-effective pricing.

2024-10-29 Tags: llm, simon willison, image, audio, video, gpt-4o, gemini, python, cli by klotz

Run a prompt to generate and execute jq programs using llm-jq

A new plugin for LLM, llm-jq, generates and executes jq programs based on human-language descriptions, allowing users to manipulate JSON data without needing to write jq syntax.

2024-10-28 Tags: llm, jq, plugin, json, simon willison by klotz

mistral.rs: Running Llama Vision on Mac M2

Simon Willison explains how to use the mistral.rs library in Rust to run the Llama Vision model on a Mac M2 laptop. He provides a detailed example and discusses the memory usage and GPU utilization.

2024-10-19 Tags: mistral.rs, llama, vision, rust, simon willison, llm, cli, inference by klotz

Video scraping: extracting JSON data from a 35 second screen capture for less than 1/10th of a cent

The author records a screen capture of their Gmail account and uses Google Gemini to extract numeric values from the video.

2024-10-17 Tags: video, scraping, json, google gemini, llm, simon willison by klotz

Introduction to Datasette, a Frontend to Tabulated Data

Datasette is introduced as a functional interactive frontend to tabulated data, either in CSV format or a database schema, catering to data journalists, museum curators, archivists, local governments, and researchers.

The author explores creating tables and inserting data into a SQLite database, then targets the database with Datasette to showcase how errors in data can be identified and corrected.

2024-09-22 Tags: datasette, tabulated data, csv, simon willison, foss by klotz

First / Previous / Next / Last / Page 1 of 0

SemanticScuttle - klotz.me

Tags: simon willison*

Linked Tags

Related Tags