xorbitsai / xllamacppLinks
xllamacpp - a Python wrapper of llama.cpp
☆60Updated last week
Alternatives and similar repositories for xllamacpp
Users that are interested in xllamacpp are comparing it to the libraries listed below
Sorting:
- A third-party component library based on Gradio. Integrates Ant Design, Ant Design X, and more advanced components to help you build appl…☆125Updated last week
- Auto Thinking Mode switch for Qwen3 in Open webui☆68Updated 5 months ago
- A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.☆166Updated 3 months ago
- GLM Series Edge Models☆149Updated 4 months ago
- Delta-CoMe can achieve near loss-less 1-bit compressin which has been accepted by NeurIPS 2024☆57Updated 11 months ago
- Try out HallOumi, a state-of-the-art claim verification model in a simple UI!☆41Updated 6 months ago
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆34Updated 8 months ago
- Its an open source LLM based on MOE Structure.☆58Updated last year
- This is an NVIDIA AI Workbench example project that demonstrates an end-to-end model development workflow using Llamafactory.☆67Updated last year
- ☆93Updated 3 months ago
- ☆180Updated last month
- ☆296Updated 4 months ago
- ☆130Updated 6 months ago
- ☆112Updated last year
- The latest graphrag interface is used, using the local ollama to provide the LLM interface.Support for using the pip installation☆155Updated last year
- CursorCore: Assist Programming through Aligning Anything☆131Updated 8 months ago
- LM inference server implementation based on *.cpp.☆286Updated 2 months ago
- A Toolkit for Running On-device Large Language Models (LLMs) in APP☆78Updated last year
- 我们是第一个完全可商用的角色大模型。☆40Updated last year
- Library for model distillation☆152Updated last month
- LLM based agents with proactive interactions, long-term memory, external tool integration, and local deployment capabilities.☆105Updated 2 months ago
- Deep Reasoning Translation (DRT) Project☆233Updated last month
- XVERSE-MoE-A4.2B: A multilingual large language model developed by XVERSE Technology Inc.☆39Updated last year
- Mixture-of-Experts (MoE) Language Model☆189Updated last year
- The Level-Navi Agent, a framework that requires no training and utilizes large language models for deep query understanding and precise s…☆80Updated 9 months ago
- Fused Qwen3 MoE layer for faster training, compatible with HF Transformers, LoRA, 4-bit quant, Unsloth☆191Updated 2 weeks ago
- Repo for "MaskSearch: A Universal Pre-Training Framework to Enhance Agentic Search Capability"☆146Updated 4 months ago
- Jina DeepSearch UI☆126Updated last month
- An open-source chat text to control actions agentic workflow framework/showcase powered by Agently AI application development framework.☆28Updated last year
- Get up and running with Llama 3, Mistral, Gemma, and other large language models.☆30Updated last month