SemanticScuttle - klotz.me » klotz: image+gemini

klotz: image* + gemini*

Search/ReSearch: Asking questions of images with AI?

An analysis of how well different AI systems perform in describing images and answering questions about them. The article compares ChatGPT, Gemini, Llama, and Claude using four images: a hand, a bottle of wine, a piece of pastry, and a flower.

2025-03-01 Tags: vlm, image description, chatgpt, gemini, llama, claude, image, dan russell by klotz
You can now run prompts against images, audio and video in your terminal using LLM

LLM 0.17 release enables multi-modal input, allowing users to send images, audio, and video files to Large Language Models like GPT-4o, Llama, and Gemini, with a Python API and cost-effective pricing.

2024-10-29 Tags: llm, simon willison, image, audio, video, gpt-4o, gemini, python, cli by klotz

First / Previous / Next / Last / Page 1 of 0