SemanticScuttle - klotz.me » klotz: machine learning

klotz: machine learning*

Bookmarks on this page are managed by an admin user.

"Machine learning is a subset of artificial intelligence in the field of computer science that often uses statistical techniques to give computers the ability to "learn" (i.e., progressively improve performance on a specific task) with data, without being explicitly programmed.

https://en.wikipedia.org/wiki/Machine_learning

How to Deploy ML Solutions with FastAPI, Docker, and GCP This bookmark is certified by an admin user.

This is a hands-on guide with Python example code that walks through the deployment of an ML-based search API using a simple 3-step approach. The article provides a deployment strategy applicable to most machine learning solutions, and the example code is available on GitHub.

2024-06-09 Tags: machine learning, fastapi, docker, gcp, deployment, python, llm, tutorial by klotz

Multilingual RAG, Algorithmic Thinking, Outlier Detection, and Other Problem-Solving Highlights This bookmark is certified by an admin user.

The highlighted articles cover a variety of topics, including algorithmic thinking for data scientists, outlier detection in time-series data, route optimization for visiting NFL teams, minimum vertex coloring problem solution, high-cardinality features, multilingual RAG (Rapidly-explainable AI) system development, fine-tuning smaller transformer models, long-form visual understanding, multimodal image-text models, the theoretical underpinnings of learning, data science stress management, and reinforcement learning.

2024-06-06 Tags: towardsdatascience, llm, machine learning, embeddings, papers, summaries by klotz

How to Fine-Tune BERT for Sentiment Analysis with Hugging Face Transformers This bookmark is certified by an admin user.

This tutorial covers fine-tuning BERT for sentiment analysis using Hugging Face Transformers. Learn to prepare data, set up environment, train and evaluate the model, and make predictions.

2024-06-06 Tags: bert, sentiment analysis, hugging face, transformers, natural language processing, machine learning, pytorch, data science by klotz

mistral-finetune - GitHub This bookmark is certified by an admin user.

A light-weight codebase that enables memory-efficient and performant finetuning of Mistral's models. It is based on LoRA, a training paradigm where most weights are frozen and only 1-2% additional weights in the form of low-rank matrix perturbations are trained.

2024-06-06 Tags: github, mistral, lora, python, machine learning, fine tuning, llm by klotz

The Meaning of Explainability for AI This bookmark is certified by an admin user.

An article discussing the importance of explainability in machine learning and the challenges posed by neural networks. It highlights the difficulties in understanding the decision-making process of complex models and the need for more transparency in AI development.

2024-06-04 Tags: explainability, machine learning, neural networks, xai, interpretability by klotz

Raspberry Pi AI Kit available now at $70 This bookmark is certified by an admin user.

The Raspberry Pi AI Kit, developed in collaboration with Hailo, allows you to integrate local, high-performance, power-efficient inferencing into a wide variety of applications. It's available now from Raspberry Pi Approved Resellers for $70.

2024-06-04 Tags: raspberry pi ai kit, hailo, hardware, machine learning, raspberry pi 5, m.2 by klotz

New Trends in LLM Architecture This bookmark is certified by an admin user.

Discusses the trends in Large Language Models (LLMs) architecture, including the rise of more GPU, more weights, more tokens, energy-efficient implementations, the role of LLM routers, and the need for better evaluation metrics, faster fine-tuning, and self-tuning.

2024-06-01 Tags: llm, machine learning, deep learning, transformers, self-tuning, evaluation by klotz

Automating Data Pipelines with Python & GitHub Actions This bookmark is certified by an admin user.

An article discussing a simple and free way to automate data workflows using Python and GitHub Actions, written by Shaw Talebi.

2024-06-01 Tags: pipeline, python, github actions, machine learning, screwdriver, data engineering by klotz

Automatic Data Curation for Self-Supervised Learning: A Clustering-Based Approach This bookmark is certified by an admin user.

This article discusses a method for automatically curating high-quality datasets for self-supervised pre-training of machine learning systems. The method involves successive and hierarchical applications of k-means on a large and diverse data repository to obtain clusters that distribute uniformly among data concepts, followed by a hierarchical, balanced sampling step from these clusters. The experiments on three different data domains show that features trained on the automatically curated datasets outperform those trained on uncurated data while being on par or better than ones trained on manually curated data.

2024-06-01 Tags: self-supervised learning, clustering, machine learning, k-means, feature training, llm by klotz

Contextual Transformer Embeddings Using Self-Attention Explained with Diagrams and Python Code This bookmark is certified by an admin user.

This article is part of a series titled ‘LLMs from Scratch’, a complete guide to understanding and building Large Language Models (LLMs). In this article, we discuss the self-attention mechanism and how it is used by transformers to create rich and context-aware transformer embeddings.

The Self-Attention mechanism is used to add context to learned embeddings, which are vectors representing each word in the input sequence. The process involves the following steps:

1. Learned Embeddings: These are the initial vector representations of words, learned during the training phase. The weights matrix, storing the learned embeddings, is stored in the first linear layer of the Transformer architecture.

2. Positional Encoding: This step adds positional information to the learned embeddings. Positional information helps the model understand the order of the words in the input sequence, as transformers process all words in parallel, and without this information, they would lose the order of the words.

3. Self-Attention: The core of the Self-Attention mechanism is to update the learned embeddings with context from the surrounding words in the input sequence. This mechanism determines which words provide context to other words, and this contextual information is used to produce the final contextualized embeddings.

2024-06-01 Tags: transformer, attention, self-attention, embeddings, nlp, deep learning, llm, machine learning by klotz

First / Previous / Next / Last / Page 1 of 0

SemanticScuttle - klotz.me

klotz: machine learning*

Linked Tags

Related Tags