SemanticScuttle - klotz.me » klotz: feature engineering+classification

klotz: feature engineering* + classification*

Feature Engineering with LLM Embeddings: Enhancing Scikit-Learn Models

The article discusses using Large Language Model (LLM) embeddings as features in traditional machine learning models built with scikit-learn. It covers the process of generating embeddings from text data using models like Sentence Transformers, and how these embeddings can be combined with existing features to improve model performance. It details practical steps including loading data, creating embeddings, and integrating them into a scikit-learn pipeline for tasks like classification.

2025-07-18 Tags: llm, embeddings, feature engineering, scikit-learn, machine learning, sentence transformers, text data, classification, pipelines by klotz

SVM for Multiclass Classification | Kaggle

2021-05-11 Tags: svm, multi-class, classification, categorical data, feature engineering, python, kaggle by klotz

How I improved my text classification model with feature engineering

2020-01-07 Tags: text, nlp, classification, feature engineering by klotz

Bias in random forest variable importance measures: Illustrations, sources and a solution | BMC Bioinformatics | Full Text

Variable importance measures for random forests have been receiving increased attention as a means of variable selection in many classification tasks in bioinformatics and related scientific fields, for instance to select a subset of genetic markers relevant for the prediction of a certain disease. We show that random forest variable importance measures are a sensible means for variable selection in many applications, but are not reliable in situations where potential predictor variables vary in their scale of measurement or their number of categories. This is particularly important in genomics and computational biology, where predictors often include variables of different types, for example when predictors include both sequence data and continuous variables such as folding energy, or when amino acid sequence data show different numbers of categories.

2019-03-29 Tags: random forest, machine learning, feature engineering, variable, classification, prediction by klotz

First / Previous / Next / Last / Page 1 of 0

SemanticScuttle - klotz.me

klotz: feature engineering* + classification*

Linked Tags

Related Tags