01-ai / Descartes
☆105Updated 9 months ago
Alternatives and similar repositories for Descartes:
Users that are interested in Descartes are comparing it to the libraries listed below
- Puck is a high-performance ANN search engine☆345Updated 2 months ago
- Byzer-retrieval is a distributed retrieval system which designed as a backend for LLM RAG (Retrieval Augmented Generation). The system su…☆45Updated 2 weeks ago
- ☆30Updated 10 months ago
- Akcio is a demonstration project for Retrieval Augmented Generation (RAG). It leverages the power of LLM to generate responses and uses v…☆254Updated last year
- DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …☆220Updated this week
- Easy, fast, and cheap pretrain,finetune, serving for everyone☆276Updated this week
- llm-inference is a platform for publishing and managing llm inference, providing a wide range of out-of-the-box features for model deploy…☆75Updated 8 months ago
- Knowhere is a vector search engine, integrating FAISS, HNSW, DiskANN.☆203Updated this week
- Mixture-of-Experts (MoE) Language Model☆183Updated 4 months ago
- Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).☆236Updated 10 months ago
- Imitate OpenAI with Local Models☆85Updated 5 months ago
- vsag is a vector indexing library used for similarity search.☆212Updated this week
- XVERSE-MoE-A4.2B: A multilingual large language model developed by XVERSE Technology Inc.☆36Updated 8 months ago
- A demo built on Megrez-3B-Instruct, integrating a web search tool to enhance the model's question-and-answer capabilities.☆35Updated last month
- A Toolkit for Running On-device Large Language Models (LLMs) in APP☆59Updated 6 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆127Updated last month
- ☆28Updated 5 months ago
- 纯c++的全平台llm加速库,支持python调用,支持baichuan, glm, llama, moss基座,手机端流畅运行chatglm-6B级模型单卡可达10000+token / s,☆45Updated last year
- XVERSE-65B: A multilingual large language model developed by XVERSE Technology Inc.☆133Updated 9 months ago
- Efficient AI Inference & Serving☆464Updated last year
- ☆105Updated last year
- ☆311Updated last week
- 中文原生检索增强生成测评基准☆107Updated 9 months ago
- AGI模块库架构图☆75Updated last year
- ☆385Updated last year
- bisheng-unstructured library☆41Updated 2 months ago
- RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.☆607Updated last week
- Implement OpenAI APIs and plugin-enabled ChatGPT with open source LLM and other models.☆121Updated 7 months ago
- A flexible and efficient training framework for large-scale alignment tasks☆281Updated this week
- The Multi-Faceted Optimizer for GenAI Workflows