Tags: simon willison* + image* + llm*

0 bookmark(s) - Sort by: Date ↓ / Title /

  1. LLM 0.17 release enables multi-modal input, allowing users to send images, audio, and video files to Large Language Models like GPT-4o, Llama, and Gemini, with a Python API and cost-effective pricing.
    2024-10-29 Tags: , , , , , , , , by klotz
  2. . The author experiments with the model, asking it to add a walrus to a prompt, and is surprised to find that the model can maintain consistency between images with a slightly altered prompt using a "seed" number. The author also delves into the underlying prompt engineering of DALL-E 3, revealing policies and guidelines that govern the model's image generation, including diversity and inclusivity guidelines.
    2024-10-29 Tags: , , , , , , by klotz

Top of the page

First / Previous / Next / Last / Page 1 of 0 SemanticScuttle - klotz.me: tagged with "simon willison+image+llm"

About - Propulsed by SemanticScuttle