SemanticScuttle - klotz.me » Tags: simon willison+llm+video

Tags: simon willison* + llm* + video*

0 bookmark(s) - Sort by: Date ↓ / Title /

You can now run prompts against images, audio and video in your terminal using LLM

LLM 0.17 release enables multi-modal input, allowing users to send images, audio, and video files to Large Language Models like GPT-4o, Llama, and Gemini, with a Python API and cost-effective pricing.

2024-10-29 Tags: llm, simon willison, image, audio, video, gpt-4o, gemini, python, cli by klotz
Video scraping: extracting JSON data from a 35 second screen capture for less than 1/10th of a cent

The author records a screen capture of their Gmail account and uses Google Gemini to extract numeric values from the video.

2024-10-17 Tags: video, scraping, json, google gemini, llm, simon willison by klotz

First / Previous / Next / Last / Page 1 of 0