vproxy-tools / ktransformers
A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations
☆31Updated 2 weeks ago
Alternatives and similar repositories for ktransformers:
Users that are interested in ktransformers are comparing it to the libraries listed below
- Efficient inference of large language models.☆146Updated 3 months ago
- ☆13Updated 2 weeks ago
- ai法律团队☆40Updated 3 months ago
- 一个网络安全法律法规、安全政策、国家标准、行业标准知识库。A knowledge base of cybersecurity laws and regulations, security policies, national standards, and industry …☆90Updated 4 months ago
- Janus-Series: Unified Multimodal Understanding and Generation Models forked from deepseek-ai/Janus☆17Updated 2 months ago
- ☆53Updated last year
- ☆124Updated last month
- ☆81Updated last month
- 用于kokoro TTS的webui界面和兼容openai api☆30Updated last month
- 电子鹦鹉 / Toy Language Model☆162Updated last week
- Simple script to quickly implement DDNS based on CloudFlare.☆16Updated last month
- 哈基米 一个分布式蜜网系统 | hachimi A Distributed Honeypot System☆174Updated 2 months ago
- 助你实现Ollama自由,配合FOFA等搜索引擎体验更佳☆237Updated 3 weeks ago
- Moling is a computer-used MCP Server that implements system interaction through operating system APIs. It is a dependency-free local offi…☆118Updated this week
- support BM25+vecetor☆26Updated 4 months ago
- transformer安全相关☆30Updated 5 months ago
- Compile & run a single CUDA file on the cloud GPUs☆14Updated 6 months ago
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆10Updated 8 months ago
- High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.☆1,053Updated this week
- A duolingo opensource alternative. RIP Duo🕯️☆62Updated last month
- 🏠 将小爱音箱接入 ChatGPT 和OpenCamera,改造成你的专属语音助手。☆50Updated 9 months ago
- 百度QA100万数据集☆47Updated last year
- run DeepSeek-R1 GGUFs on KTransformers☆212Updated last month
- run chatglm3-6b in BM1684X☆38Updated last year
- Phi3 中文后训练模型仓库☆320Updated 4 months ago
- ChatGLM-6B-Slim:裁减掉20K图片Token的ChatGLM-6B,完全一样的性能,占用更小的显存。☆126Updated last year
- 网络包测试工具☆44Updated 11 months ago
- Finetune Llama 3, Mistral & Gemma LLMs 2-5x faster with 80% less memory☆28Updated 10 months ago
- 🤗 这里提出一种可能的提高逆向成本的方案,希望厂商们让free-api早日终结👋☆72Updated last year
- ☆107Updated last year