klotz: pandas* + python*

0 bookmark(s) - Sort by: Date ↓ / Title / - Bookmarks from other users for this tag

  1. The author describes building a personal, open-source computational engine using Python libraries SymPy, NumPy, pandas, SciPy, statsmodels, Pingouin, Matplotlib, and Seaborn, effectively replicating the functionality of Wolfram Mathematica at no cost.
  2. This tutorial compares Polars and pandas, covering syntax, performance, LazyFrames, conversions, and plotting to help you choose the right library for your data analysis needs.
  3. Learn how to connect several essential tools to develop a simple yet intuitive dashboard using Streamlit, Plotly, DuckDB, and Pandas to visualize data from a JSON file.
  4. This video course introduces DuckDB, an open-source database for data analytics in Python. It covers creating databases from files (Parquet, CSV, JSON), querying with SQL and the Python API, concurrent access, and integration with pandas and Polars.
  5. Local Large Language Models can convert massive DataFrames to presentable Markdown reports — here's how.
    2025-06-03 Tags: , , , , by klotz
  6. Pandas 3.0 will significantly boost performance by replacing NumPy with PyArrow as its default engine, enabling faster loading and reading of columnar data.
  7. These one-liners provide quick and effective ways to assess the quality and consistency of the data within a Pandas DataFrame.

    | Code Snippet | Explanation |
    | --- | --- |
    | `df.isnull().sum()` | Counts the number of missing values per column. |
    | `df.duplicated().sum()` | Counts the number of duplicate rows in the DataFrame. |
    | `df.describe()` | Provides basic descriptive statistics of numerical columns. |
    | `df.info()` | Displays a concise summary of the DataFrame including data types and presence of null values. |
    | `df.nunique()` | Counts the number of unique values per column. |
    | `df.apply(lambda x: x.nunique() / x.count() * 100)` | Computes the percentage of unique values for each column. |
    | `df.isin( value » ).sum()` | Counts the number of occurrences of a specific value across all columns. |
    | `df.applymap(lambda x: isinstance(x, type_to_check)).sum()` | Counts the number of values of a specific type (e.g., int, str) per column. |
    | `df.dtypes` | Lists the data type for each column in the DataFrame. |
    | `df.sample(n)` | Returns a random sample of n rows from the DataFrame. |
  8. Mastering specific Pandas functions can enhance data manipulation skills for data scientists using Python, focusing on less explored methods for data transformation and analysis.
  9. Reset a pandas DataFrame index
    2024-11-07 Tags: , , , , by klotz
  10. This article demonstrates how to use Pandas plotting capabilities for common data visualization tasks, suggesting that Pandas can be sufficient for routine EDA without relying on libraries like Matplotlib.

Top of the page

First / Previous / Next / Last / Page 1 of 0 SemanticScuttle - klotz.me: Tags: pandas + python

About - Propulsed by SemanticScuttle