SemanticScuttle - klotz.me

Tags: llm*

0 bookmark(s) - Sort by: Date ↓ / Title /

Understanding Prompt Engineering: How to Talk to AI

This article explains prompt engineering techniques for large language models (LLMs), covering methods like zero-shot, few-shot, system, contextual, role, step-back, chain-of-thought, self-consistency, ReAct, Automatic Prompt Engineering and code prompting. It also details best practices and output configuration for optimal results.

2025-04-16 Tags: prompt engineering, llm by klotz

From PDF to Markdown with Local LLMs — Fast, Private, and Free

This article details a method for converting PDFs to Markdown using a local LLM (Gemma 3 via Ollama), focusing on privacy and efficiency. It involves rendering PDF pages as images and then using the LLM for content extraction, even from scanned PDFs.

2025-04-16 Tags: pdf, markdown, llm, self-hosted, gemma, ollama, ocr, pymupdf, pillow by klotz

Engagement, user expertise, and satisfaction: Key insights from the Semantic Telemetry Project

This blog post details findings from the Semantic Telemetry Project, analyzing user engagement with Microsoft Copilot. Key takeaways include the correlation between complex task engagement and continued tool use, the increasing complexity of tasks performed by novice users, and the importance of AI expertise matching user expertise for satisfaction.

2025-04-16 Tags: semantic telemetry, microsoft copilot, user engagement, user expertise, llm, tasks, agent weiwei yang by klotz

Using ChatGPT Deep Research to explore connections between Minsky’s Society of Mind and On the Biology of a Large Language Model by Anthropic Ken Kahn

This article details an iterative process of using ChatGPT to explore the parallels between Marvin Minsky's "Society of Mind" and Anthropic's research on Large Language Models, specifically Claude Haiku. The user experimented with different prompts to refine the AI's output, navigating issues like model confusion (GPT-2 vs. Claude) and overly conversational tone. Ultimately, prompting the AI with direct source materials (Minsky’s books and Anthropic's paper) yielded the most insightful analysis, highlighting potential connections like the concept of "A and B brains" within both frameworks.

2025-04-15 Tags: llm, society of mind, ai, research, prompt engineering, anthropic, claude, marvin minsky, attribution graphs, ken kahn by klotz

Detecting Hallucinations in LLM Function Calling with Entropy

This article discusses using entropy and variance of entropy (VarEntropy) to detect hallucinations in LLM function calling, focusing on how structured outputs allow for identifying errors through statistical anomalies in token confidence.

2025-04-15 Tags: llm, hallucinations, function calling, entropy, variance of entropy, varentropy, llm machine learning, api, error detection, archgw, salman paracha by klotz

Grammarly Earns ISO/IEC 42001:2023 Certification, Reinforcing Commitment to Responsible AI

Grammarly has achieved ISO/IEC 42001:2023 certification, demonstrating its commitment to responsible AI development and deployment, focusing on security, transparency, and alignment with human values.

2025-04-15 Tags: iso 42001, llm, governance, security, ethics, grammarly by klotz

How We're Using MCP to Automate Real Workflows: 6 Working Use Cases

This article details six practical use cases for Model Context Protocol (MCP) to automate workflows using AI agents and integrations with tools like Slack, Google Calendar, BigQuery, Linear, GitHub, and HubSpot. It highlights the impact of these automations on team efficiency and productivity.

2025-04-14 Tags: mcp, agent, automation, slack, google calendar, bigquery, linear, github, hubspot, llm by klotz

Transformer Lab: Experiment with Large Language Models

Transformer Lab is an open-source application for advanced LLM engineering, allowing users to interact, train, fine-tune, and evaluate large language models on their own computer. It supports various models, hardware, and inference engines and includes features like RAG, dataset building, and a REST API.

2025-04-11 Tags: electron, transformers, llama, lora, mlx, llms, rlhf, llm, github by klotz

Answer: Can you extract and summarize a blog?

This blog post details an experiment testing the ability of LLMs (Gemini, ChatGPT, Perplexity) to accurately retrieve and summarize recent blog posts from a specific URL (searchresearch1.blogspot.com). The author found significant issues with hallucinations and inaccuracies, even in models claiming live web access, highlighting the unreliability of LLMs for even simple research tasks.

2025-04-10 Tags: llm, ai, hallucination, web access, search, gemini, chatgpt, perplexity, research, information retrieval, dan russell by klotz

How People Are Really Using Gen AI in 2025

A follow-up article to a previous piece on Gen AI usage, noting a roughly even split between personal and business applications.

2025-04-10 Tags: llm, hbr, 2025, use case by klotz

First / Previous / Next / Last / Page 1 of 0

SemanticScuttle - klotz.me

Tags: llm*

Linked Tags

Related Tags