A summary of a workshop presented at PyCon US on building software with LLMs, covering setup, prompting, building tools (text-to-SQL, structured data extraction, semantic search/RAG), tool usage, and security considerations like prompt injection. It also discusses the current LLM landscape, including models from OpenAI, Gemini, Anthropic, and open-weight alternatives.
Google's Gemini 2.5 Flash model is a new, faster, and more cost-effective model with adjustable 'thinking' capabilities. The article details how to use it with llm-gemini, explores pricing differences compared to Gemini 2.0 Flash, and shares example SVG outputs.
LLM 0.17 release enables multi-modal input, allowing users to send images, audio, and video files to Large Language Models like GPT-4o, Llama, and Gemini, with a Python API and cost-effective pricing.