Retrieval and Retrieval-augmented LLMs
☆11,479Mar 27, 2026Updated last week
Alternatives and similar repositories for FlagEmbedding
Users that are interested in FlagEmbedding are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)☆69,375Updated this week
- Netease Youdao's open-source embedding and reranker models for RAG products.☆1,871Sep 9, 2025Updated 6 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆74,805Updated this week
- Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain…☆37,662Nov 10, 2025Updated 4 months ago
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.☆39,459Jun 2, 2025Updated 10 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、…☆6,651Oct 24, 2024Updated last year
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆31,832Updated this week
- MTEB: Massive Text Embedding Benchmark☆3,189Updated this week
- unified embedding model☆877Sep 1, 2023Updated 2 years ago
- The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.☆20,887Mar 5, 2026Updated 3 weeks ago
- Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5, DeepSeek-R1, GLM-5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, …☆13,391Mar 27, 2026Updated last week
- BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)☆8,287Oct 16, 2024Updated last year
- Question and Answer based on Anything.☆13,917Mar 24, 2025Updated last year
- 🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.☆20,865Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- LlamaIndex is the leading document agent and OCR platform☆48,180Updated this week
- SGLang is a high-performance serving framework for large language models and multimodal models.☆25,041Updated this week
- Supercharge Your LLM Application Evaluations 🚀☆13,195Feb 24, 2026Updated last month
- LMDeploy is a toolkit for compressing, deploying, and serving LLMs.☆7,738Updated this week
- Fast and memory-efficient exact attention☆23,062Updated this week
- 中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)☆18,963Jul 15, 2025Updated 8 months ago
- Train transformer language models with reinforcement learning.☆17,863Updated this week
- State-of-the-Art Text Embeddings☆18,459Mar 25, 2026Updated last week
- Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.☆15,799Mar 4, 2026Updated 3 weeks ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.☆27,007Jan 9, 2026Updated 2 months ago
- A blazing fast inference solution for text embeddings models☆4,640Updated this week
- RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to creat…☆76,367Mar 27, 2026Updated last week
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.☆41,925Mar 26, 2026Updated last week
- A series of large language models developed by Baichuan Intelligent Technology☆4,115Nov 8, 2024Updated last year
- OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, …☆6,811Updated this week
- text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。☆4,957Feb 14, 2026Updated last month
- Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-p…☆9,180Updated this week
- The agent engineering platform☆131,360Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).☆7,174Oct 30, 2025Updated 5 months ago
- ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型☆13,742Jan 13, 2025Updated last year
- ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型☆41,210Jun 27, 2024Updated last year
- Large Language Model Text Generation Inference☆10,815Mar 21, 2026Updated last week
- ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型☆15,631Jun 27, 2024Updated last year
- verl: Volcano Engine Reinforcement Learning for LLMs☆20,286Updated this week
- Unsloth Studio is a web UI for training and running open models like Qwen, DeepSeek, gpt-oss and Gemma locally.☆58,639Updated this week