etalab-ia / albert-models
Deployment a light and full OpenAI API for production with vLLM to support /v1/embeddings with all embeddings models.
☆37Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for albert-models
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆53Updated 3 weeks ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆55Updated 3 months ago
- ☆37Updated 11 months ago
- ☆53Updated 5 months ago
- Data preparation code for CrystalCoder 7B LLM☆42Updated 6 months ago
- Data preparation code for Amber 7B LLM☆84Updated 6 months ago
- Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]☆55Updated this week
- Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".☆193Updated 2 months ago
- An enterprise-grade AI retriever designed to streamline AI integration into your applications, ensuring cutting-edge accuracy.☆265Updated 3 weeks ago
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆83Updated 2 months ago
- This is an NVIDIA AI Workbench example project that demonstrates an end-to-end model development workflow using Llamafactory.☆37Updated last month
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆62Updated 3 weeks ago
- Open Source Text Embedding Models with OpenAI Compatible API☆131Updated 4 months ago
- A pipeline parallel training script for LLMs.☆83Updated this week
- Repository for “PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makers”, NAACL24☆126Updated 5 months ago
- A pipeline for LLM knowledge distillation☆78Updated 3 months ago
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆81Updated this week
- ☆49Updated 2 months ago
- ☆51Updated 4 months ago
- ☆153Updated 2 months ago
- ☆106Updated 2 months ago
- Meta-Chunking: Learning Efficient Text Segmentation via Logical Perception☆72Updated this week
- [EMNLP 2024] LongRAG: A Dual-perspective Retrieval-Augmented Generation Paradigm for Long-Context Question Answering☆80Updated 2 weeks ago
- This repo is for handling Question Answering, especially for Multi-hop Question Answering☆64Updated 11 months ago
- ☆73Updated 10 months ago
- Simple examples using Argilla tools to build AI☆42Updated this week
- Evaluation of bm42 sparse indexing algorithm☆62Updated 4 months ago
- ☆47Updated this week
- LLM based agents with proactive interactions, long-term memory, external tool integration, and local deployment capabilities.☆89Updated this week
- ☆65Updated last month