SemanticScuttle - klotz.me » Tags: llama.cpp+google

Tags: llama.cpp* + google*

0 bookmark(s) - Sort by: Date ↓ / Title /

AMD Rolls Out Gemma 4 Model Support Across Full Range of GPUs & CPUs

AMD now supports Google’s Gemma 4 models (2B–31B parameters) across its entire hardware lineup, including Instinct GPUs (datacenters), Radeon GPUs (workstations), and Ryzen AI processors (PCs). The integration is compatible with vLLM, SGLang, llama.cpp, Ollama, and Lemonade Server, aiming to optimize AI performance for both cloud and local deployment.

2026-04-05 Tags: amd, google, gemma 4, gpu, cpu, radeon, ryzen, models, vllm, sglang, llama.cpp, machine learning, hardware support, llm, hardware by klotz

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

Google releases Gemma 3, a new iteration of their Gemma family of models. It ranges from 1B to 27B parameters, supports up to 128k tokens, accepts images and text, and supports 140+ languages. This article details its technical enhancements (longer context, multimodality, multilinguality) and provides information on inference with Hugging Face transformers, on-device deployment, and evaluation.

2025-04-03 Tags: gemma 3, llm, hugging face, llama.cpp, google by klotz

localllm/llm-tool at main · GoogleCloudPlatform/localllm

llm-tool provides a command-line utility for running large language models locally. It includes scripts for pulling models from the internet, starting them, and managing them using various commands such as 'run', 'ps', 'kill', 'rm', and 'pull'. Additionally, it offers a Python script named 'querylocal.py' for querying these models. The repository also come

2024-02-08 Tags: llm, localllama, self-hosted, google, gcp, foss, llama.cpp, github by klotz

GoogleCloudPlatform/localllm: Run LLMs locally on Cloud Workstations

- create a custom base image for a Cloud Workstation environment using a Dockerfile
. Uses:

Quantized models from

2024-02-08 Tags: llm, google, locallama, github, foss, gguf, huggingface, llama.cpp by klotz

LLM Large Language Model Toolkit: Google

The "LLM" toolkit offers a versatile command-line utility and Python library that allows users to work efficiently with large language models. Users can execute prompts directly from their terminals, store the outcomes in SQLite databases, generate embeddings, and perform various other tasks. In this extensive tutorial, topics covered include setup, usage, OpenAI models, alternative models, embeddings, plugins, model aliases, Python APIs, prompt templates, logging, related tools, CLI references, contributing, and change logs.

2024-02-08 Tags: llm, cli, google, llama.cpp by klotz

First / Previous / Next / Last / Page 1 of 0

SemanticScuttle - klotz.me

Tags: llama.cpp* + google*

Linked Tags

Related Tags