Tags: data engineering*

0 bookmark(s) - Sort by: Date ↓ / Title /

  1. A guide to tracking in MLOps, covering code, data, and machine learning model tracking
  2. Airbyte is an open-source data integration engine that helps you consolidate your data in your data warehouses, lakes and databases.
  3. This article provides Python tricks and techniques for data ingestion, validation, processing, and testing in data engineering projects. It offers practical solutions for streamlining the code, including tips for data validation, handling errors, and testing.
    2024-06-13 Tags: , by klotz
  4. An exploration of the benefits of switching from the popular Python library Pandas to the newer Polars for data manipulation tasks, highlighting improvements in performance, concurrency, and ease of use.
  5. An in-process analytics database, DuckDB can work with surprisingly large data sets without having to maintain a distributed multiserver system. Best of all? You can analyze data directly from your Python app.
  6. An article discussing a simple and free way to automate data workflows using Python and GitHub Actions, written by Shaw Talebi.
  7. Learn data engineering through free courses, tutorials, books, tools, guides, roadmaps, practice exercises, projects, and other resources.
  8. This article describes how to use GNU Emacs for quick data visualization in combination with Gnuplot. It provides a command that can be used to visualize the correlation of data without needing any setup or specific files. The article also includes an example of a command for generating a graph using a data range selected with a rectangle command copy-rectangle.
  9. - standardization, governance, simplified troubleshooting, and reusability in ML application development.
    - integrations with vector databases and LLM providers to support new applications -
    provides tutorials on integrating
  10. Notebooks are not enough for ML at scale

Top of the page

First / Previous / Next / Last / Page 1 of 0 SemanticScuttle - klotz.me: tagged with "data engineering"

About - Propulsed by SemanticScuttle