Pure C++ implementation of several models for real-time chatting on your computer (CPU), based on ggml.
This pull request adds StreamingLLM support for llamacpp and llamacpp_HF models, aiming to improve performance and reliability. The changes allow indefinite chatting with the model without re-evaluating the prompt.
This PR implements the StreamingLLM technique for model loaders, focusing on handling context length and optimizing chat generation speed.
An AI memory layer with short- and long-term storage, semantic clustering, and optional memory decay for context-aware applications.
Improve GitHub Copilot Chat responses by indexing repositories for semantic code search, allowing better context-based answers to questions about code within a repository.
Sage is a tool that allows developers to chat with any codebase using two commands. It provides a functional chat interface for code, supports running locally or on the cloud, and has a modular design for swapping components.
The latest news about Gemini. Chat to start writing, planning, learning and more with Google AI.
This post highlights how the GitHub Copilot Chat VS Code Extension was vulnerable to data exfiltration via prompt injection when analyzing untrusted source code.
Retrochat is chat application that supports Llama.cpp, Kobold.cpp, and Ollama. It highlights new features, commands for configuration, chat management, and models, and provides a download link for the release.
The article discusses Perplexica, an open-source AI-powered search tool that aims to address the limitations of traditional and proprietary AI-powered search engines. The tool uses large language models (LLMs) like Mixtral and Gemini to understand and process user queries, delivering relevant and insightful results. It allows searches to be conducted locally, ensuring privacy, and employs information retrieval techniques to fetch relevant web pages based on user queries. Perplexica offers focus modes for specific types of questions, including All Mode, Writing Assistant Mode, Academic Search Mode, YouTube Search Mode, and Wolfram Alpha Search Mode.