xorbitsai / xllamacppLinks
xllamacpp - a Python wrapper of llama.cpp
☆49Updated last week
Alternatives and similar repositories for xllamacpp
Users that are interested in xllamacpp are comparing it to the libraries listed below
Sorting:
- A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.☆150Updated last month
- ☆91Updated last month
- Auto Thinking Mode switch for Qwen3 in Open webui☆67Updated 3 months ago
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆34Updated 6 months ago
- A third-party component library based on Gradio.☆114Updated last week
- GLM Series Edge Models☆147Updated 2 months ago
- Try out HallOumi, a state-of-the-art claim verification model in a simple UI!☆37Updated 4 months ago
- Jina DeepSearch UI☆122Updated last month
- A Toolkit for Running On-device Large Language Models (LLMs) in APP☆77Updated last year
- Its an open source LLM based on MOE Structure.☆58Updated last year
- Library for model distillation☆150Updated 6 months ago
- ☆290Updated 2 months ago
- An LLM-based agent simulation framework that simulates human behavior and generates dynamic, text-based social graphs.☆80Updated 2 weeks ago
- The Level-Navi Agent, a framework that requires no training and utilizes large language models for deep query understanding and precise s…☆81Updated 7 months ago
- FlexRAG: A RAG Framework for Information Retrieval and Generation.☆213Updated 2 months ago
- Deep Reasoning Translation (DRT) Project☆227Updated 2 months ago
- Real time faster whisper gradio☆26Updated this week
- ☆127Updated 4 months ago
- The latest graphrag interface is used, using the local ollama to provide the LLM interface.Support for using the pip installation☆153Updated 10 months ago
- LM inference server implementation based on *.cpp.☆255Updated this week
- Fused Qwen3 MoE layer for faster training, compatible with HF Transformers, LoRA, 4-bit quant, Unsloth☆156Updated this week
- Dify 1.0 Plugin Convert your Dify tools's API to MCP compatible API☆23Updated 3 months ago
- LLM based agents with proactive interactions, long-term memory, external tool integration, and local deployment capabilities.☆106Updated 3 weeks ago
- agentcp是一个基于ACP协议的Agent sdk,用于解决Agent间的身份认证及通信问题;用于创建AID、连接入网、构建会话,收发消息等;支持多Agent协作,异步消息处理,支持内网穿透,支持Agent访问的负载均衡☆13Updated last month
- Delta-CoMe can achieve near loss-less 1-bit compressin which has been accepted by NeurIPS 2024☆56Updated 9 months ago
- DeepSearch Code-Actions Agent (DSCA). Build 🙌 with 🤗 smolagents☆112Updated last week
- A lightweight script for processing HTML page to markdown format with support for code blocks☆79Updated last year
- ☆104Updated 2 weeks ago
- 我们是第一个完全可商用的角色大模型。☆40Updated last year
- Cross Platform Open Sourced Chinese NoteBookLM app based on Electron, Use DeepSeek + Reecho.ai☆74Updated 9 months ago