klotz: eda*

0 bookmark(s) - Sort by: Date ↓ / Title / - Bookmarks from other users for this tag

  1. This article explains Pair Plots (Scatter Matrices) in Python for exploratory data analysis, showing pairwise relationships between numerical variables using scatter plots and distribution plots.

    The article provides the following Python code using `seaborn` and `matplotlib` to create a pair plot:

    ```python
    import seaborn as sns
    import matplotlib.pyplot as plt
    import pandas as pd
    import numpy as np

    # Create some random data
    data = np.random.rand(100, 4)
    df = pd.DataFrame(data, columns= 'A', 'B', 'C', 'D' » )

    # Create the pair plot
    sns.pairplot(df)

    # Show the plot
    plt.show()
    ```
  2. Hex introduces Advanced Compute Profiles for demanding workflows, offering more CPU, RAM, and GPUs. It also features Explore, a fast, flexible no-code data analysis tool. Hex emphasizes collaboration, AI integration, and a wide range of use cases including data science, operational reporting, and self-serve data tools.
  3. An article detailing how to build a flexible, explainable, and algorithm-agnostic ML pipeline with MLflow, focusing on preprocessing, model training, and SHAP-based explanations.
  4. ASCVIT V1 aims to make data analysis easier by automating statistical calculations, visualizations, and interpretations.

    Includes descriptive statistics, hypothesis tests, regression, time series analysis, clustering, and LLM-powered data interpretation.

    - Accepts CSV or Excel files. Provides a data overview including summary statistics, variable types, and data points.
    - Histograms, boxplots, pairplots, correlation matrices.
    - t-tests, ANOVA, chi-square test.
    - Linear, logistic, and multivariate regression.
    - Time series analysis.
    - k-means, hierarchical clustering, DBSCAN.

    Integrates with an LLM (large language model) via Ollama for automated interpretation of statistical results.
  5. This article demonstrates how to use Pandas plotting capabilities for common data visualization tasks, suggesting that Pandas can be sufficient for routine EDA without relying on libraries like Matplotlib.
  6. Exploratory data analysis (EDA) is a powerful technique to understand the structure of word embeddings, the basis of large language models. In this article, we'll apply EDA to GloVe word embeddings and find some interesting insights.
  7. 2020-10-06 Tags: , , , , , , by klotz

Top of the page

First / Previous / Next / Last / Page 1 of 0 SemanticScuttle - klotz.me: Tags: eda

About - Propulsed by SemanticScuttle