SemanticScuttle - klotz.me » klotz: llm+nlp

klotz: llm* + nlp*

Bookmarks on this page are managed by an admin user.

A Complete Guide to BERT with Code: History, Architecture, Pre-training, and Fine-tuning This bookmark is certified by an admin user.

In this article, we will explore various aspects of BERT, including the landscape at the time of its creation, a detailed breakdown of the model architecture, and writing a task-agnostic fine-tuning pipeline, which we demonstrated using sentiment analysis. Despite being one of the earliest LLMs, BERT has remained relevant even today, and continues to find applications in both research and industry.

2024-05-28 Tags: bert, llm, embedding, google, nlp, encoder-only, transformer by klotz

Combining the Best of Both Worlds: Retrieval-Augmented Generation for Knowledge-Intensive Natural Language Processing This bookmark is certified by an admin user.

This article discusses Retrieval-Augmented Generation (RAG) models, a new approach that addresses the limitations of traditional models in knowledge-intensive Natural Language Processing (NLP) tasks. RAG models combine parametric memory from pre-trained seq2seq models with non-parametric memory from a dense vector index of Wikipedia, enabling dynamic knowledge access and integration.

2024-05-28 Tags: retrieval-augmented generation, nlp, llm, parametric memory by klotz

Training and Finetuning Embedding Models with Sentence Transformers v3 This bookmark is certified by an admin user.

This article explains how to use the Sentence Transformers library to finetune and train embedding models for a variety of applications, such as retrieval augmented generation, semantic search, and semantic textual similarity. It covers the training components, dataset format, loss function, training arguments, evaluators, and trainer.

2024-05-28 Tags: sentence transformers, finetune, embedding, models, similarity, llm, huggingface by klotz

A Step-by-Step Guide to Representation Finetuning LLAMA3 This bookmark is certified by an admin user.

"The paper introduces a technique called LoReFT (Low-rank Linear Subspace ReFT). Similar to LoRA (Low Rank Adaptation), it uses low-rank approximations to intervene on hidden representations. It shows that linear subspaces contain rich semantics that can be manipulated to steer model behaviors."

2024-05-26 Tags: linear subspace, lora, representation, fine tuning, reft, stanford, nlp, python, llm by klotz

A Comprehensive Guide to Function Calling in LLMs This bookmark is certified by an admin user.

Learn about function calling in Large Language Models (LLMs) and the list of commercial and open source LLMs suitable for function calling.

2024-05-21 Tags: function calling, llm, nlp by klotz

Researchers test AI systems' ability to solve the New York Times' connections puzzle This bookmark is certified by an admin user.

Researchers from NYU Tandon School of Engineering investigated whether modern natural language processing systems could solve the daily Connections puzzles from The New York Times. The results showed that while all the AI systems could solve some of the puzzles, they struggled overall.

2024-05-15 Tags: connections, puzzle, nyu, nlp, llm, gpt-3.5, gpt-4, bert, roberta, mpnet, minilm, ieee, games by klotz

A Beginner-Friendly Introduction to LLMs This bookmark is certified by an admin user.

This article provides a beginner-friendly introduction to Large Language Models (LLMs) and explains the key concepts in a clear and organized way.

2024-05-10 Tags: llm, introduction, bert, palm, gpt, llama by klotz

OpenAI Question Answering using Embedding and LLM This bookmark is certified by an admin user.

LangChain has many advanced retrieval methods to help address these challenges. (1) Multi representation indexing: Create a document representation (like a summary) that is well-suited for retrieval (read about this using the Multi Vector Retriever in a blog post from last week). (2) Query transformation: in this post, we'll review a few approaches to transform humans questions in order to improve retrieval. (3) Query construction: convert human question into a particular query syntax or language, which will be covered in a future post

2024-05-06 Tags: openai, llm, embedding, search by klotz

Clone the Abilities of Powerful LLMs into Small Local Models Using Knowledge Distillation This bookmark is certified by an admin user.

This article explores how to boost the performance of small language models by using supervision from larger ones through knowledge distillation. The article provides a step-by-step guide on how to distill knowledge from a teacher model (LLama 2–70B) to a student model (Tiny-LLama) using unlabeled in-domain data and targeted prompting.

2024-04-06 Tags: nlp, llm, knowledge distillation, tiny-llama, llama 2, machine learning by klotz

Using XML Schema in AI System Prompts: A Comprehensive Guide This bookmark is certified by an admin user.

This article explores the application of XML Schema in AI systems and prompts. XML Schema provides a structured way to describe and validate data, making it an essential tool for AI systems that deal with data. The author discusses how XML Schema can be used to create and manage data in AI applications, such as speech recognition and natural language processing. The article also covers the benefits of using XML Schema in AI systems, including improved data consistency, interoperability, and security. Lastly, the author provides some examples of XML Schema usage in AI systems and discusses the future of XML Schema in AI technology.

2024-04-04 Tags: xml schema, prompt engineering, nlp, llm, xml by klotz

First / Previous / Next / Last / Page 1 of 0

SemanticScuttle - klotz.me

klotz: llm* + nlp*

Linked Tags

Related Tags