stepfun-ai / vllmLinks
A high-throughput and memory-efficient inference and serving engine for LLMs
☆17Updated 4 months ago
Alternatives and similar repositories for vllm
Users that are interested in vllm are comparing it to the libraries listed below
Sorting:
- UnitEval is a benchmarking and evaluation tools for AutoDev Coder.☆12Updated last year
- ☆40Updated 3 months ago
- xllamacpp - a Python wrapper of llama.cpp☆44Updated last week
- A game of pong made by MetaGPT and ChatGPT Code Interpreter☆14Updated last year
- mirror of https://huggingface.co/spaces/enzostvs/deepsite☆76Updated 3 months ago
- ☆30Updated last year
- Codai is an AI programming tool that boosts coding efficiency and empowers non-programmers. Its future plans include introducing a local …☆20Updated last week
- The official GitHub Page for MiniMax☆47Updated last week
- a custom comfyui node for fish-speech☆39Updated last year
- ☆82Updated 2 weeks ago
- App-Controller: Allow users to manipulate your App with natural language☆133Updated 7 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆40Updated last month
- Wan 2.1 AI Video Generator Web UI☆40Updated 4 months ago
- A open version Manus.☆59Updated 3 months ago
- AI model that understands text & humanoids.☆114Updated last month
- Real-time video understanding and interaction through text,audio,image and video with large multi-modal model. 利用多模态大模型的实时视频理解和交互框架,通过文本…☆24Updated last year
- GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型☆26Updated 3 weeks ago
- Wan2.1, quantized and optimized so it fits on your 3090/4090☆33Updated 4 months ago
- Memory Management for the GPU Poor, run the latest open source frontier models on consumer Nvidia GPUs☆124Updated last month
- Project Page of SignLLM: Sign Languages Production Large Language Models.☆42Updated this week
- Diffusers Image Fill v3 -- Inpaint or Remove objects from an image - or Outpaint - or Outpaint Video Zoom: 16GB+ GPU | 32GB+ RAM | 20GB+…☆13Updated 8 months ago
- A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.☆134Updated this week
- Get up and running with Llama 3, Mistral, Gemma, and other large language models.☆27Updated this week
- Auto Thinking Mode switch for Qwen3 in Open webui☆66Updated 2 months ago
- ☆164Updated this week
- 基于youtube、bilibili等视频平台、webpage网页等,利用零一万物大模型或ollama本地小模型构建大语言模型高质量训练数据集(计划支持可自定义输出的训练数据格式)☆18Updated last year
- Try out HallOumi, a state-of-the-art claim verification model in a simple UI!☆36Updated 3 months ago
- CursorCore: Assist Programming through Aligning Anything☆127Updated 5 months ago
- GPT+神器,简单实用的一站式AGI架构,内置本地化,LLM模型,agent,矢量数据库,智能链chain☆48Updated 2 years ago
- DeepFloyd IF web UI☆30Updated 2 years ago