vproxy-tools / ktransformersLinks
A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations
☆40Updated last month
Alternatives and similar repositories for ktransformers
Users that are interested in ktransformers are comparing it to the libraries listed below
Sorting:
- ☆17Updated 2 months ago
- Janus-Series: Unified Multimodal Understanding and Generation Models forked from deepseek-ai/Janus☆17Updated 4 months ago
- run DeepSeek-R1 GGUFs on KTransformers☆234Updated 3 months ago
- Efficient inference of large language models.☆148Updated this week
- An AI agent to control drones☆109Updated this week
- High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.☆1,134Updated this week
- ☆132Updated 3 months ago
- The inference code of RVC-Boss/GPT-SoVITS that can be developer-friendly.☆15Updated 8 months ago
- Transformer framework for edge computing based on C++.☆124Updated 6 months ago
- Compile & run a single CUDA file on the cloud GPUs☆14Updated 8 months ago
- 支持中文场景的的小语言模型 llama2.c-zh☆147Updated last year
- DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …☆254Updated last week
- ☆108Updated last year
- LM inference server implementation based on *.cpp.☆203Updated last week
- support BM25+vecetor☆29Updated last week
- a huggingface mirror site.☆289Updated last year
- ai法律团队☆42Updated 5 months ago
- run chatglm3-6b in BM1684X☆39Updated last year
- 🤗 这里提出一种可能的提高逆向成本的方案,希望厂商们让free-api早日终结👋☆74Updated last year
- Its an open source LLM based on MOE Structure.☆58Updated 11 months ago
- CPU inference for the DeepSeek family of large language models in C++☆300Updated this week
- GLM Series Edge Models☆142Updated 3 months ago
- Simple script to quickly implement DDNS based on CloudFlare.☆16Updated 2 months ago
- 视频理解:千问视频多模态模型 & Dify☆58Updated 9 months ago
- Awesome Code Action - DeepWebSearch AgentKit App. Build with 🤗 Hugging Face smolagents framework☆62Updated last week
- 利用免费的大模型api来结合你的私域数据来生成sft训练数据(妥妥白嫖)支持llamafactory等工具的训练数据格式synthetic data☆164Updated 6 months ago
- 1MB size, 100% Coverage, Use more expressive YAML / JSON to manage your Config files. --- 1MB大小,100% 测试覆盖,使用更具表现力的YAML / JSON来管理您的配置文件。☆63Updated 3 months ago
- FORK of VLLM for AMD MI25/50/60. A high-throughput and memory-efficient inference and serving engine for LLMs☆49Updated last month
- ☆71Updated 2 months ago
- A Python Package to Access World-Class Generative Models☆128Updated 11 months ago