01-ai / DescartesLinks

☆112

Alternatives and similar repositories for Descartes

Users that are interested in Descartes are comparing it to the libraries listed below

Sorting:

allwefantasy / BYZER-RETRIEVAL
Byzer-retrieval is a distributed retrieval system which designed as a backend for LLM RAG (Retrieval Augmented Generation). The system su…
☆49Updated 7 months ago
OpenCSGs / llm-inference
llm-inference is a platform for publishing and managing llm inference, providing a wide range of out-of-the-box features for model deploy…
☆86Updated last year
allwefantasy / byzer-llm
Easy, fast, and cheap pretrain,finetune, serving for everyone
☆315Updated 3 months ago
shootime2021 / APUS-xDAN-4.0-moe
Its an open source LLM based on MOE Structure.
☆58Updated last year
xverse-ai / XVERSE-65B
XVERSE-65B: A multilingual large language model developed by XVERSE Technology Inc.
☆140Updated last year
inferflow / inferflow
Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).
☆248Updated last year
xorbitsai / xllamacpp
xllamacpp - a Python wrapper of llama.cpp
☆60Updated last week
dataelement / bisheng-unstructured
bisheng-unstructured library
☆55Updated 5 months ago
zilliztech / akcio
Akcio is a demonstration project for Retrieval Augmented Generation (RAG). It leverages the power of LLM to generate responses and uses v…
☆258Updated last year
xverse-ai / XVERSE-MoE-A4.2B
XVERSE-MoE-A4.2B: A multilingual large language model developed by XVERSE Technology Inc.
☆39Updated last year
modelscope / dash-infer
DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …
☆266Updated 2 months ago
hpcaitech / SwiftInfer
Efficient AI Inference & Serving
☆478Updated last year
IEIT-Yuan / Yuan2.0-M32
Mixture-of-Experts (MoE) Language Model
☆189Updated last year
SomeoneKong / llm_long_context_bench202405
☆29Updated last year
hyperai / vllm-cn
vLLM Documentation in Chinese Simplified / vLLM 中文文档
☆114Updated last week
allwefantasy / byzer-agent
☆32Updated last year
zai-org / GLM-Edge
GLM Series Edge Models
☆149Updated 4 months ago
HFAiLab / hai-platform-studio
配合 HAI Platform 使用的集成化用户界面
☆53Updated 2 years ago
zzlgreat / smart_agent
☆106Updated 2 years ago
OpenBMB / MobileCPM
A Toolkit for Running On-device Large Language Models (LLMs) in APP
☆78Updated last year
infinigence / InfiniWebSearch
A demo built on Megrez-3B-Instruct, integrating a web search tool to enhance the model's question-and-answer capabilities.
☆39Updated 10 months ago
baidu / puck
Puck is a high-performance ANN search engine
☆364Updated 4 months ago
QwenLM / vllm-gptq
A high-throughput and memory-efficient inference and serving engine for LLMs
☆138Updated 10 months ago
WangRongsheng / Aurora
The official codes for "Aurora: Activating chinese chat capability for Mixtral-8x7B sparse Mixture-of-Experts through Instruction-Tuning"
☆264Updated last year
tpoisonooo / ROGRAG
[ACL2025 demo track] ROGRAG: A Robustly Optimized GraphRAG Framework
☆175Updated 3 weeks ago
wey-gu / grpo-graph-extraction
Qwen GRPO Graph Extraction RL Finetune
☆57Updated 6 months ago
viitrix / vt-transformer
Transformer framework for edge computing based on C++.
☆128Updated 11 months ago
thunlp / Delta-CoMe
Delta-CoMe can achieve near loss-less 1-bit compressin which has been accepted by NeurIPS 2024
☆57Updated 11 months ago
shell-nlp / gpt_server
gpt_server是一个用于生产级部署LLMs、Embedding、Reranker、ASR、TTS、文生图、图片编辑和文生视频的开源框架。
☆213Updated last week
the-seeds / imitater
Imitate OpenAI with Local Models
☆88Updated last year