01-ai / Descartes
☆107Updated 11 months ago
Alternatives and similar repositories for Descartes:
Users that are interested in Descartes are comparing it to the libraries listed below
- Byzer-retrieval is a distributed retrieval system which designed as a backend for LLM RAG (Retrieval Augmented Generation). The system su…☆48Updated 2 weeks ago
- Easy, fast, and cheap pretrain,finetune, serving for everyone☆291Updated this week
- ☆32Updated last year
- DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …☆238Updated 2 weeks ago
- llm-inference is a platform for publishing and managing llm inference, providing a wide range of out-of-the-box features for model deploy…☆79Updated 10 months ago
- ☆29Updated 6 months ago
- A demo built on Megrez-3B-Instruct, integrating a web search tool to enhance the model's question-and-answer capabilities.☆37Updated 3 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆132Updated 3 months ago
- Imitate OpenAI with Local Models☆88Updated 6 months ago
- 中文原生检索增强生成测评基准☆112Updated 11 months ago
- ☆105Updated last year
- Mixture-of-Experts (MoE) Language Model☆185Updated 6 months ago
- Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).☆238Updated last year
- Puck is a high-performance ANN search engine☆347Updated 4 months ago
- Akcio is a demonstration project for Retrieval Augmented Generation (RAG). It leverages the power of LLM to generate responses and uses v…☆255Updated last year
- Its an open source LLM based on MOE Structure.☆58Updated 8 months ago
- Qwen GRPO Graph Extraction RL Finetune☆40Updated last month
- XVERSE-65B: A multilingual large language model developed by XVERSE Technology Inc.☆139Updated 11 months ago
- gpt_server是一个用于生产级部署LLMs或Embedding的开源框架。☆160Updated last week
- Synthesizing High-quality Text-to-SQL Data at Scale. SynSQL-2.5M is the first million-scale cross-domain text-to-SQL dataset.☆141Updated last week
- Transformer framework for edge computing based on C++.☆125Updated 4 months ago
- GLM Series Edge Models☆132Updated last month
- 纯c++的全平台llm加速库,支持python调用,支持baichuan, glm, llama, moss基座,手机端流畅运行chatglm-6B级模型单卡可达10000+token / s,☆45Updated last year
- vLLM Documentation in Chinese Simplified / vLLM 中文文档☆55Updated 2 months ago
- Efficient AI Inference & Serving☆468Updated last year
- AGI模块库架构图☆75Updated last year
- 旨在对当前主流LLM进行一个直观、具体、标准的评测☆94Updated last year
- ☆113Updated last month
- A Toolkit for Running On-device Large Language Models (LLMs) in APP☆67Updated 8 months ago
- bisheng model services backend☆27Updated 8 months ago