SemanticScuttle - klotz.me

Tags: hallux*

0 bookmark(s) - Sort by: Date ↓ / Title /

>The Playwright MCP Chrome Extension allows you to connect to pages in your existing browser and leverage the state of your default user profile. This means the AI assistant can interact with websites where you're already logged in, using your existing cookies, sessions, and browser state, providing a seamless experience without requiring separate authentication or setup.

2025-12-04 Tags: playwright, mcp, github, extension, llm, browser, hallux by klotz

SemTools: Are Coding Agents all you Need?

The article explores whether combining a command-line agent (like Claude Code or Gemini CLI) with Unix-like file system tools and SemTools is sufficient for complex tasks, particularly document search. It details a benchmark testing the limits of coding agents with and without SemTools, focusing on search, cross-referencing, and temporal analysis. The conclusion is that CLI access is powerful and SemTools enhances agent capabilities for document search and RAG.

2025-09-06 Tags: semtools, coding, agents, llamaindex, rag, document search, cli, semantic search, llm, llamaparse, hallux by klotz

Server

Interact with opencode server over HTTP. The `opencode serve` command runs a headless HTTP server that exposes an OpenAPI endpoint that an opencode client can use.

2025-08-25 Tags: opencode, server, http, openapi, api, sdk, cli, tui, configuration, agents, models, llm, hallux by klotz

I set up an email triage system using Home Assistant and a local LLM, here's how you can too

This article details how to set up an email triage system using Home Assistant and a local Large Language Model (LLM) to summarize and categorize incoming emails, reducing inbox clutter and improving email management. It covers the setup of a REST command to interface with Ollama, the automation process, and the benefits of using a local LLM for privacy.

2025-08-25 Tags: home assistant, llm, ollama, email triage, automation, smart home, email, summarization, hallux, solon by klotz

Developer Walk-Through of Auggie CLI, an Agentic Terminal App

We test out the latest product from Augment Code, a terminal app called Auggie CLI. How does it compare to other AI command-line interfaces?

- Workspace Indexing: Auggie automatically indexes the project directory, which is beneficial for context but raises security considerations (addressed via .augmentignore files).
Interactive vs. Non-Interactive Mode: The author tests both modes, highlighting the benefits of a one-shot, non-interactive command for quick tasks.
- Code Modification: A key test involves using Auggie to add Bootstrap classes to a Rails view file. Auggie successfully analyzed the existing code, generated a correct diff, and applied the changes.

2025-08-23 Tags: llm, agentic cli, auggie cli, augment code, terminal, developer tools, hallux by klotz

jmap-mcp

A Model Context Protocol (MCP) server that provides tools for interacting with JMAP (JSON Meta Application Protocol) email servers. Built with Deno and using the jmap-jam client library.

2025-08-16 Tags: jmap, mcp, email, deno, jmap-jam, github, hallux, solon, wyattjoh by klotz

Can LLMs replace on call SREs today?

**Experiment Goal:** Determine if LLMs can autonomously perform root cause analysis (RCA) on live application

Five LLMs were given access to OpenTelemetry data from a demo application,:
* They were prompted with a naive instruction: "Identify the issue, root cause, and suggest solutions."
* Four distinct anomalies were used, each with a known root cause established through manual investigation.
* Performance was measured by: accuracy, guidance required, token usage, and investigation time.
* Models: Claude Sonnet 4, OpenAI GPT-o3, OpenAI GPT-4.1, Gemini 2.5 Pro

* **Autonomous RCA is not yet reliable.** The LLMs generally fell short of replacing SREs. Even GPT-5 (not explicitly tested, but implied as a benchmark) wouldn't outperform the others.
* **LLMs are useful as assistants.** They can help summarize findings, draft updates, and suggest next steps.
* **A fast, searchable observability stack (like ClickStack) is crucial.** LLMs need access to good data to be effective.
* **Models varied in performance:**
* Claude Sonnet 4 and OpenAI o3 were the most successful, often identifying the root cause with minimal guidance.
* GPT-4.1 and Gemini 2.5 Pro required more prompting and struggled to query data independently.
* **Models can get stuck in reasoning loops.** They may focus on one aspect of the problem and miss other important clues.
* **Token usage and cost varied significantly.**

**Specific Anomaly Results (briefly):**

* **Anomaly 1 (Payment Failure):** Claude Sonnet 4 and OpenAI o3 solved it on the first prompt. GPT-4.1 and Gemini 2.5 Pro needed guidance.
* **Anomaly 2 (Recommendation Cache Leak):** Claude Sonnet 4 identified the service restart issue but missed the cache problem initially. OpenAI o3 identified the memory leak. GPT-4.1 and Gemini 2.5 Pro struggled.

2025-08-16 Tags: hallux, click house, observability, llm, openai, claude, gemini, are, automation, production engineering, lionel palacin, al brown by klotz

Perplexity Says Cloudflare Is Blocking Legitimate AI Assistants

Perplexity defends its AI assistants against Cloudflare’s claims, arguing that they are not web crawlers but user-triggered agents.

2025-08-05 Tags: perplexity, cloudflare, ll., assistant, crawler, robots.txt, hallux by klotz

An AI tool I find useful

This blog post details a personal code review tool built around `llm` and `git diff`. It describes installation, how it works, how the author uses it, and its advantages over GitHub's Copilot review tool.

2025-08-03 Tags: bash, hallux, code review, llm, git, automation, tooling, productivity by klotz

charmbracelet/crush

The glamourous AI coding agent for your favourite terminal

2025-08-03 Tags: coding, terminal, llm, lsp, code assistant, github, crush, charmbracelet, bash, hallux by klotz

First / Previous / Next / Last / Page 1 of 0

SemanticScuttle - klotz.me

Tags: hallux*

Linked Tags

Related Tags