Explores the role of conditional probability in understanding events and Bayes' theorem, with examples in regression analysis and everyday scenarios, demonstrating how our biological tissue runs probabilistic machinery.
An article discussing recent updates and improvements in several foundation time-series models, including TimeGPT, TimesFM, MOIRAI, Tiny Time Mixers (TTM), and MOMENT. These models, initially released with significant impact, have since seen updates in benchmarks and model variants.
Exploring and exploiting the seemingly innocent theorem behind Double Machine Learning. The theorem, rooted in econometrics, states that if we have a linear model that predicts an outcome variable based on multiple features, and we want to understand the causal effect of a specific feature on the outcome, we can use the residuals of the model as an instrumental variable to estimate the causal effect.
Discusses reasons why clustering in data science might not produce desired results and how to address these issues.
This article features a curated list of the top data science articles published in July, covering topics such as LLM apps, chatGPT, data visualization, multi-agent AI systems, and essential data science skills for 2024.
An article discussing the current state, recent approaches, and future directions of prompt engineering in data and machine learning. It includes several links to relevant articles and tutorials on the topic.
An overview of the LIDA library, including how to get started, examples, and considerations going forward, with a focus on large language models (LLMs) and image generation models (IGMs) in data visualization and business intelligence.
This article discusses the importance of understanding and memorizing classification metrics in machine learning. The author shares their own experience and strategies for memorizing metrics such as accuracy, precision, recall, F1 score, and ROC AUC.
This article explains the PCA algorithm and its implementation in Python. It covers key concepts such as Dimensionality Reduction, eigenvectors, and eigenvalues. The tutorial aims to provide a solid understanding of the algorithm's inner workings and its application for dealing with high-dimensional data and the curse of dimensionality.
This tutorial covers fine-tuning BERT for sentiment analysis using Hugging Face Transformers. Learn to prepare data, set up environment, train and evaluate the model, and make predictions.