SemanticScuttle - klotz.me » klotz: oobabooga+chat

klotz: oobabooga* + chat*

Add StreamingLLM for llamacpp & llamacpp_HF (2nd attempt)

This pull request adds StreamingLLM support for llamacpp and llamacpp_HF models, aiming to improve performance and reliability. The changes allow indefinite chatting with the model without re-evaluating the prompt.

2024-11-26 Tags: streamingllm, llamacpp, llm, chat, oobabooga by klotz
StreamingLLM (llama.cpp & llamacpp_HF loaders)

This PR implements the StreamingLLM technique for model loaders, focusing on handling context length and optimizing chat generation speed.

2024-11-26 Tags: streamingllm, llama.cpp, context, chat, llm, oobabooga by klotz
web search extension for text-generation-webui

2023-11-22 Tags: llm, search, oobabooga, text, chat, extension, browse, headless by klotz

First / Previous / Next / Last / Page 1 of 0