xorbitsai / xllamacppLinks
xllamacpp - a Python wrapper of llama.cpp
☆43Updated this week
Alternatives and similar repositories for xllamacpp
Users that are interested in xllamacpp are comparing it to the libraries listed below
Sorting:
- A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.☆124Updated 2 weeks ago
- Auto Thinking Mode switch for Qwen3 in Open webui☆65Updated last month
- GLM Series Edge Models☆142Updated last week
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆35Updated 4 months ago
- agentcp是一个基于ACP协议的Agent sdk,用于解决Agent间的身份认证及通信问题;用于创建AID、连接入网、构建会话,收发消息等;支持多Agent协作,异步消息处理,支持内网穿透,支持Agent访问的负载均衡☆12Updated last month
- A demo built on Megrez-3B-Instruct, integrating a web search tool to enhance the model's question-and-answer capabilities.☆38Updated 6 months ago
- The Level-Navi Agent, a framework that requires no training and utilizes large language models for deep query understanding and precise s…☆79Updated 5 months ago
- 我们是第一个完全可商用的角色大模型。☆40Updated 10 months ago
- Its an open source LLM based on MOE Structure.☆58Updated 11 months ago
- A third-party component library based on Gradio.☆107Updated this week
- Awesome Code Action - DeepWebSearch AgentKit App. Build with 🤗 Hugging Face smolagents framework☆72Updated this week
- ☆269Updated 3 weeks ago
- XVERSE-MoE-A4.2B: A multilingual large language model developed by XVERSE Technology Inc.☆39Updated last year
- ☆90Updated 3 months ago
- LM inference server implementation based on *.cpp.☆224Updated this week
- Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The infere…☆20Updated last month
- Qwen GRPO Graph Extraction RL Finetune☆49Updated 2 months ago
- Jina DeepSearch UI☆114Updated this week
- ☆119Updated 2 months ago
- Library for model distillation☆144Updated 4 months ago
- ☆109Updated last year
- the official repo for E^2GraphRAG.☆47Updated 2 weeks ago
- An open-source chat text to control actions agentic workflow framework/showcase powered by Agently AI application development framework.☆28Updated 9 months ago
- bisheng-unstructured library☆51Updated last month
- Delta-CoMe can achieve near loss-less 1-bit compressin which has been accepted by NeurIPS 2024☆57Updated 7 months ago
- Get up and running with Llama 3, Mistral, Gemma, and other large language models.☆26Updated last week
- A Toolkit for Running On-device Large Language Models (LLMs) in APP☆73Updated 11 months ago
- Try out HallOumi, a state-of-the-art claim verification model in a simple UI!☆35Updated 2 months ago
- 研究GOT-OCR-项目落地加速,不限语言☆60Updated 7 months ago
- Dify 1.0 Plugin Convert your Dify tools's API to MCP compatible API☆20Updated last month