SemanticScuttle - klotz.me » Tags: gemini+llm

Tags: gemini* + llm*

0 bookmark(s) - Sort by: Date ↓ / Title /

Google's new Jules Tools is very cool - how I'm using it and other Gemini AI CLIs

Jules Tools has quietly joined Gemini CLI and GitHub Actions in Google's lineup. This article details how these command-line agents differ and provides examples of their use.

2025-10-07 Tags: google, jules, llm, gemini, cli, command-line interface, github actions, coding, automation by klotz

How to Implement the LLM Arena-as-a-Judge Approach to Evaluate Large Language Model Outputs

This tutorial explores implementing the LLM Arena-as-a-Judge approach to evaluate large language model outputs using head-to-head comparisons. It demonstrates using OpenAI’s GPT-4.1 and Gemini 2.5 Pro, judged by GPT-5, in a customer support scenario.

2025-08-26 Tags: llm, arena-as-a-judge, evaluation, openai, gpt-4, gemini, gpt-5, deepeval, machine learning by klotz

Can LLMs replace on call SREs today?

**Experiment Goal:** Determine if LLMs can autonomously perform root cause analysis (RCA) on live application

Five LLMs were given access to OpenTelemetry data from a demo application,:
* They were prompted with a naive instruction: "Identify the issue, root cause, and suggest solutions."
* Four distinct anomalies were used, each with a known root cause established through manual investigation.
* Performance was measured by: accuracy, guidance required, token usage, and investigation time.
* Models: Claude Sonnet 4, OpenAI GPT-o3, OpenAI GPT-4.1, Gemini 2.5 Pro

* **Autonomous RCA is not yet reliable.** The LLMs generally fell short of replacing SREs. Even GPT-5 (not explicitly tested, but implied as a benchmark) wouldn't outperform the others.
* **LLMs are useful as assistants.** They can help summarize findings, draft updates, and suggest next steps.
* **A fast, searchable observability stack (like ClickStack) is crucial.** LLMs need access to good data to be effective.
* **Models varied in performance:**
* Claude Sonnet 4 and OpenAI o3 were the most successful, often identifying the root cause with minimal guidance.
* GPT-4.1 and Gemini 2.5 Pro required more prompting and struggled to query data independently.
* **Models can get stuck in reasoning loops.** They may focus on one aspect of the problem and miss other important clues.
* **Token usage and cost varied significantly.**

**Specific Anomaly Results (briefly):**

* **Anomaly 1 (Payment Failure):** Claude Sonnet 4 and OpenAI o3 solved it on the first prompt. GPT-4.1 and Gemini 2.5 Pro needed guidance.
* **Anomaly 2 (Recommendation Cache Leak):** Claude Sonnet 4 identified the service restart issue but missed the cache problem initially. OpenAI o3 identified the memory leak. GPT-4.1 and Gemini 2.5 Pro struggled.

2025-08-16 Tags: hallux, click house, observability, llm, openai, claude, gemini, are, automation, production engineering, lionel palacin, al brown by klotz

How Gemini could turn Google Keep into a productivity powerhouse

The article discusses how integrating Google's Gemini AI could significantly improve Google Keep's functionality, turning it into a more powerful note-taking and productivity tool. It details potential features like AI-powered summaries, improved note creation with typo correction, audio note enhancements with speaker detection, smart Q&A from tagged notes, and seamless integration with Google Calendar.

2025-08-09 Tags: google keep, gemini, llm, android, notebooklm by klotz

Google Launched LangExtract, a Python Library for Structured Data Extraction from Unstructured Text

Google has introduced LangExtract, an open-source Python library designed to help developers extract structured information from unstructured text using large language models such as the Gemini models. The library simplifies the process of converting free-form text into structured data, offering features like controlled generation, text chunking, parallel processing, and integration with various LLMs.

2025-08-09 Tags: machine learning, data engineering, python, google, langextract, llm, gemini, information extraction, e by klotz

Generate data with Gemini in Google Sheets

Google Sheets now allows users to generate text, summarize information, and categorize data using Gemini AI directly in cells. The feature supports text generation, summarization, categorization, and sentiment analysis with optional data ranges.

2025-07-01 Tags: gemini, llm, google sheets, ai function, text generation, summarization, categorization, sentiment analysis by klotz

Gemini 2.5 for robotics and embodied intelligence

This post explores how developers can leverage Gemini 2.5 to build sophisticated robotics applications, focusing on semantic scene understanding, spatial reasoning with code generation, and interactive robotics applications using the Live API. It also highlights safety measures and current applications by trusted testers.

2025-06-25 Tags: gemini, google, robotics, code generation, multimodal, gemini 2.5 pro, gemini 2.5 flash, deepmind, llm by klotz

Building software on top of Large Language Models

A summary of a workshop presented at PyCon US on building software with LLMs, covering setup, prompting, building tools (text-to-SQL, structured data extraction, semantic search/RAG), tool usage, and security considerations like prompt injection. It also discusses the current LLM landscape, including models from OpenAI, Gemini, Anthropic, and open-weight alternatives.

2025-05-16 Tags: self-hosted, llm, embeddings, gemini, vision, tools, simon willison by klotz

How to Use AI Function in Google Sheets

Google's AI function brings Gemini-powered language models right into your spreadsheet cells without any add-ons. With it, you can generate fresh text, summarize blocks of data, categorize entries, or even guess sentiments—all by typing a simple formula.

The article provides examples such as:
- *sentiment analysis* ```=AI("Is this customer feedback positive, negative, or neutral?", A2)```
- *data categorization* `=AI("Classify this expense as Travel, Office, or Other", D3)`
- *simple calculations* `=AI("Add the numbers in these cells", A1:A5)`

2025-05-03 Tags: google, sheets, spreadsheet, ai function, gemini, llm by klotz

A Step-by-Step Coding Guide to Defining Custom Model Context Protocol (MCP) Server and Client Tools with FastMCP and Integrating Them into Google Gemini 2.0’s Function‑Calling Workflow

This tutorial demonstrates how to integrate Google’s Gemini 2.0 with an in-process Model Context Protocol (MCP) server using FastMCP, creating tools for weather information and integrating them into Gemini's function calling workflow.

2025-04-23 Tags: llm, gemini 2.0, mcp, fastmcp, function calling, python, agentz open api, gemini by klotz

First / Previous / Next / Last / Page 1 of 0

SemanticScuttle - klotz.me

Tags: gemini* + llm*

Linked Tags

Related Tags