tensorchord / modelz-ChatGLM
Deploy ChatGLM on Modelz
☆15Updated last year
Alternatives and similar repositories for modelz-ChatGLM:
Users that are interested in modelz-ChatGLM are comparing it to the libraries listed below
- This repository contains statistics about the AI Infrastructure products.☆18Updated last week
- setup the env for vllm users☆16Updated last year
- Byzer-retrieval is a distributed retrieval system which designed as a backend for LLM RAG (Retrieval Augmented Generation). The system su…☆47Updated this week
- The minimal, ad-hoc way of plug and play NebulaGraph with pip install, even inside Colab Notebook!☆16Updated 9 months ago
- ☆31Updated 11 months ago
- 本插件是将faiss集成到greenplum数据库中,以提供向量召回的能力。☆21Updated 2 years ago
- Unit Minions 的各种数据准备、处理脚本,诸如 OpenAI 处理、格式转换等等。☆14Updated last year
- OpenAI compatible API for open source LLMs☆15Updated last year
- Benchmark for machine learning model online serving (LLM, embedding, Stable-Diffusion, Whisper)☆28Updated last year
- ☆23Updated last year
- The driver for LMCache core to run in vLLM☆32Updated last month
- ☆35Updated 3 years ago
- Yet another coding assistant powered by LLM.☆15Updated 5 months ago
- A memory efficient DLRM training solution using ColossalAI☆103Updated 2 years ago
- Compare different hardware platforms via the Roofline Model for LLM inference tasks.☆92Updated 11 months ago
- A simple service that integrates vLLM with Ray Serve for fast and scalable LLM serving.☆63Updated 11 months ago
- ☆18Updated 11 months ago
- Sentence Embedding as a Service☆15Updated last year
- Pretrain, finetune and serve LLMs on Intel platforms with Ray☆117Updated 2 weeks ago
- Efficient, Flexible, and Highly Fault-Tolerant Model Service Management Based on SGLang☆40Updated 4 months ago
- fastertransformer for codegeex model☆63Updated last year
- ☆16Updated 2 years ago
- sqlgpt-parser is a Python implementation of an SQL parser that effectively converts SQL statements into Abstract Syntax Trees (AST). By l…☆28Updated last year
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆36Updated last year
- Summary of system papers/frameworks/codes/tools on training or serving large model☆56Updated last year
- ☆17Updated last year
- Evaluation for AI apps and agent☆36Updated last year
- Some microbenchmarks and design docs before commencement☆12Updated 4 years ago
- Tutorial for LLM developers about engine design, service deployment, evaluation/benchmark, etc. Provide a C/S style optimized LLM inferen…☆19Updated last year