This video course introduces DuckDB, an open-source database for data analytics in Python. It covers creating databases from files (Parquet, CSV, JSON), querying with SQL and the Python API, concurrent access, and integration with pandas and Polars.
A guide to building a front-end data application using Taipy, comparing it to Streamlit and Gradio, and providing a step-by-step implementation of a sales performance dashboard.
LlamaExtract is a powerful, easy-to-use tool that allows users to extract structured data from unstructured documents with minimal effort, available through LlamaCloud’s web UI and Python SDK.
This article introduces Streamlit, a Python library for building data dashboards, as a solution for Python programmers to create graphical front-ends without needing to delve into CSS, HTML, or JavaScript. The author, a seasoned data engineer, explains how Streamlit and similar tools enable the creation of attractive dashboards, marking a shift from traditional tools like Tableau or Quicksight. This piece serves as the first in a series focusing on Streamlit, with future articles planned on Gradio and Taipy. The author aims to replicate similar layouts and functionalities across dashboards using consistent data.
This article introduces Path-Swarm and Super-Swarm, new techniques for creating swarm charts using circle arrangements for data visualization. The author, Nick Gerend, discusses two primary swarm techniques and some extensions for rapid visual exploration of data. Written for Towards Data Science.
A simple and fast data pipeline foundation with sophisticated functionality.