Summarizer Service is a bare-bones sample Flask web application using Flask, using subprocesses and custom scripts. The application provides text web page summarization for bookmarking services, accepting both GET and POST requests with or without a custom prompt.
Simon Willison recently delivered a talk during the Mastering LLMs: A Conference For Developers & Data Scientists, which was a six-week long online event. The talk centered around Simon's LLM Python command-line utility and its plugins, emphasizing how they can be utilized to explore Large Language Models (LLMs) and perform various tasks. Last week, he discussed accessing LLMs from the command-line, sharing valuable insights and techniques with the audience.
This is a hands-on guide with Python example code that walks through the deployment of an ML-based search API using a simple 3-step approach. The article provides a deployment strategy applicable to most machine learning solutions, and the example code is available on GitHub.
A light-weight codebase that enables memory-efficient and performant finetuning of Mistral's models. It is based on LoRA, a training paradigm where most weights are frozen and only 1-2% additional weights in the form of low-rank matrix perturbations are trained.
Learn how to build an open LLM app using Hermes 2 Pro, a powerful LLM based on Meta's Llama 3 architecture. This tutorial explains how to deploy Hermes 2 Pro locally, create a function to track flight status using FlightAware API, and integrate it with the LLM.
This article describes the process of implementing function-calling in an AI system, specifically using the Mistral AI platform. The example showcases the development of an assistant that can manage a home automation system through natural language interactions with the user, including the use of available functions, function logic, and the integration of these functions into the AI system.
Leverage validation functions to prevent your LLM outputs from falling off a cliff. This article discusses how to use Python Guardrails to improve the reliability of LLM outputs by validating them using custom functions.
This article introduces Google's top AI applications, providing a guide on how to start using them, including Google Gemini, Google Cloud, TensorFlow, Experiments with Google, and AI Hub.
"The paper introduces a technique called LoReFT (Low-rank Linear Subspace ReFT). Similar to LoRA (Low Rank Adaptation), it uses low-rank approximations to intervene on hidden representations. It shows that linear subspaces contain rich semantics that can be manipulated to steer model behaviors."
This article guides you through the process of building a simple agent in LangChain using Tools and Toolkits. It explains the basics of Agents, their components, and how to build a Mathematics Agent that can perform simple mathematical operations.