SemanticScuttle - klotz.me

klotz: gcp*

Let’s build a data platform like Spotify! How they build and what can we learn.

Spotify, a human's digital jukebox, has been a data-driven company since day one, using data for various purposes including payments and experimentation. Managing the vast amount of data required a more streamlined approach, leading to the development of their internal data platform.

Event Delivery System:

On-Premises Setup: Initially, Spotify used on-premises solutions like Kafka and HDFS. Event data from clients was captured, timestamped, and routed to a central Hadoop cluster.
Google Cloud Transition: In 2015, Spotify moved to Google Cloud Platform (GCP) for better scalability and reliability. Key components include File Tailer, Event Delivery Service, Reliable Persistent Queue, and ETL jobs using Dataflow and BigQuery.

2025-02-10 Tags: data platform, spotify, data engineering, gcp by klotz

How to Deploy ML Solutions with FastAPI, Docker, and GCP

This is a hands-on guide with Python example code that walks through the deployment of an ML-based search API using a simple 3-step approach. The article provides a deployment strategy applicable to most machine learning solutions, and the example code is available on GitHub.

2024-06-09 Tags: machine learning, fastapi, docker, gcp, deployment, python, llm, tutorial, production engineering by klotz

Running Machine Learning Workloads on Kubernetes with Google AI Platform and TensorFlow Serving

In this article, we explore how to deploy and manage machine learning models using Google Kubernetes Engine (GKE), Google AI Platform, and TensorFlow Serving. We will cover the steps to create a machine learning model and deploy it on a Kubernetes cluster for inference.

2024-05-15 Tags: gcp, kubernetes, machine learning, tensorflow sing, deployment, inference, mlops, production engineering by klotz

Machine Learning on GCP: From Notebooks to Pipelines

Notebooks are not enough for ML at scale

2024-05-11 Tags: data science, data engineering, gcp, machine learning, vertex by klotz

Pluralsight Cloud Guru Pricing

2024-05-06 Tags: training, aws, gcp by klotz

Chess.com boosts performance, cuts response times by 71% with Cloud SQL Enterprise Plus

Launched in 2007, Chess.com is a premium platform for online chess and one of the largest of its kind. A Cloud SQL for MySQL shop, it transitioned to Cloud SQL Enterprise Plus edition, improving the user experience, cutting costs, and significantly reducing response times, decreasing p99 latency response from 14ms to 4ms. Read on to learn more.

2024-03-13 Tags: google cloud, gcp, chess.com, sql, production engineering by klotz

localllm/llm-tool at main · GoogleCloudPlatform/localllm

llm-tool provides a command-line utility for running large language models locally. It includes scripts for pulling models from the internet, starting them, and managing them using various commands such as 'run', 'ps', 'kill', 'rm', and 'pull'. Additionally, it offers a Python script named 'querylocal.py' for querying these models. The repository also come

2024-02-08 Tags: llm, localllama, self-hosted, google, gcp, foss, llama.cpp, github by klotz

Figuring out microservices running on your GKE cluster with help from Duet AI

2024-01-20 Tags: google, gcp, microservices, llm, duet, gke, production engineering by klotz

Introducing sample GenAI Databases Retrieval App – augment your LLMs with Google Cloud database

2023-12-04 Tags: google, gcp, llm, rag by klotz

Bigquery DataFrame

2023-10-14 Tags: bigquery, pandas, python, data frame, gcp, data engineering by klotz

First / Previous / Next / Last / Page 1 of 0