Snowflake recently announced the launch of Arctic Embed L 2.0 and Arctic Embed M 2.0, two small and powerful embedding models tailored for multilingual search and retrieval. The models are available in medium and large variants, with the medium model incorporating 305 million parameters and the large variant with 568 million parameters. Both models support context lengths of up to 8,192 tokens. They demonstrate high-quality retrieval across multiple languages and excel in benchmarks like MTEB and CLEF.
The Towards Data Science team highlights recent articles on the rise of open-source LLMs, ethical considerations with chatbots, potential manipulation of LLM recommendations, and techniques for temperature scaling and re-ranking in generative AI.
SciPhi-AI/R2R is a framework for rapid development and deployment of production-ready RAG pipelines. The framework enables the deployment, customization, extension, autoscaling, and optimization of RAG pipeline systems, making it easier for the OSS community to use them. It includes several code examples and client applications that demonstrate application deployment and interaction. The core abstractions come in the form of ingestion, embedding, RAG, and eval pipelines.