FlagOpen / FlagEmbedding
Retrieval and Retrieval-augmented LLMs
☆8,380Updated this week
Alternatives and similar repositories for FlagEmbedding:
Users that are interested in FlagEmbedding are comparing it to the libraries listed below
- LMDeploy is a toolkit for compressing, deploying, and serving LLMs.☆5,438Updated this week
- SGLang is a fast serving framework for large language models and vision language models.☆8,509Updated this week
- OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, …☆4,585Updated last week
- Use PEFT or Full-parameter to finetune 450+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek-R1, ...) and 15…☆5,215Updated this week
- Supercharge Your LLM Application Evaluations 🚀☆8,053Updated this week
- Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you ne…☆6,137Updated this week
- 🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.☆17,165Updated this week
- Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).☆6,734Updated last week
- Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.☆5,674Updated last week
- Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)☆39,315Updated this week
- An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)☆4,187Updated 2 weeks ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆36,497Updated this week
- MTEB: Massive Text Embedding Benchmark☆2,139Updated this week
- Train transformer language models with reinforcement learning.☆11,140Updated this week
- An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.☆4,666Updated 2 weeks ago
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆22,098Updated this week
- Fast and memory-efficient exact attention☆15,318Updated this week
- Tools for merging pretrained large language models.☆5,202Updated this week
- QLoRA: Efficient Finetuning of Quantized LLMs☆10,210Updated 7 months ago
- A blazing fast inference solution for text embeddings models☆3,104Updated last week
- A framework for few-shot evaluation of language models.☆7,653Updated this week
- PyTorch native post-training library☆4,789Updated this week
- The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.☆5,386Updated 6 months ago
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…☆16,111Updated this week
- Large Language Model Text Generation Inference☆9,698Updated this week
- Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.☆9,980Updated this week
- Go ahead and axolotl questions☆8,484Updated this week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆11,453Updated this week
- Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"☆11,218Updated last month
- Netease Youdao's open-source embedding and reranker models for RAG products.☆1,587Updated this week