SemanticScuttle - klotz.me

klotz: chat*

Pure C++ implementation of several models for real-time chatting on your computer (CPU), based on ggml.

2024-12-13 Tags: c++, chat, llm chatllm, github by klotz

Add StreamingLLM for llamacpp & llamacpp_HF (2nd attempt)

This pull request adds StreamingLLM support for llamacpp and llamacpp_HF models, aiming to improve performance and reliability. The changes allow indefinite chatting with the model without re-evaluating the prompt.

2024-11-26 Tags: streamingllm, llamacpp, llm, chat, oobabooga by klotz

StreamingLLM (llama.cpp & llamacpp_HF loaders)

This PR implements the StreamingLLM technique for model loaders, focusing on handling context length and optimizing chat generation speed.

2024-11-26 Tags: streamingllm, llama.cpp, context, chat, llm, oobabooga by klotz

memoripy

An AI memory layer with short- and long-term storage, semantic clustering, and optional memory decay for context-aware applications.

2024-11-18 Tags: llmi, memory, chat by klotz

Indexing repositories for Copilot Chat

Improve GitHub Copilot Chat responses by indexing repositories for semantic code search, allowing better context-based answers to questions about code within a repository.

2024-09-26 Tags: github, copilot, chat, code search, copilot chat, llm by klotz

Sage: Chat with any codebase

Sage is a tool that allows developers to chat with any codebase using two commands. It provides a functional chat interface for code, supports running locally or on the cloud, and has a modular design for swapping components.

2024-09-19 Tags: sage, codebase, chat, developers, github, embeddings, llm, vector stores, modular by klotz

Google Gemini News

The latest news about Gemini. Chat to start writing, planning, learning and more with Google AI.

2024-07-22 Tags: gemini, google, ai, chat, llm, news by klotz

GitHub Copilot Chat: From Prompt Injection to Data Exfiltration

This post highlights how the GitHub Copilot Chat VS Code Extension was vulnerable to data exfiltration via prompt injection when analyzing untrusted source code.

2024-06-16 Tags: github, copilot, chat, prompt injection, llm, security, wunderwuzzi by klotz

Retrochat v0.0.4 Release

Retrochat is chat application that supports Llama.cpp, Kobold.cpp, and Ollama. It highlights new features, commands for configuration, chat management, and models, and provides a download link for the release.

2024-06-14 Tags: retrochat, llama.cpp, llm, ollama, chat, github, cli, text ui by klotz

Perplexica: The Open-Source Solution Replicating Billion Dollar Perplexity for AI Search Tools - MarkTechPost

The article discusses Perplexica, an open-source AI-powered search tool that aims to address the limitations of traditional and proprietary AI-powered search engines. The tool uses large language models (LLMs) like Mixtral and Gemini to understand and process user queries, delivering relevant and insightful results. It allows searches to be conducted locally, ensuring privacy, and employs information retrieval techniques to fetch relevant web pages based on user queries. Perplexica offers focus modes for specific types of questions, including All Mode, Writing Assistant Mode, Academic Search Mode, YouTube Search Mode, and Wolfram Alpha Search Mode.

2024-06-10 Tags: perplexica, perplexity, chat, llm, foss by klotz

First / Previous / Next / Last / Page 1 of 0

SemanticScuttle - klotz.me

klotz: chat*

Linked Tags

Related Tags