xorbitsai / xllamacppLinks
xllamacpp - a Python wrapper of llama.cpp
☆36Updated last week
Alternatives and similar repositories for xllamacpp
Users that are interested in xllamacpp are comparing it to the libraries listed below
Sorting:
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆35Updated 3 months ago
- A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.☆115Updated this week
- Auto Thinking Mode switch for Qwen3 in Open webui☆61Updated 3 weeks ago
- A third-party component library based on Gradio.☆100Updated this week
- Its an open source LLM based on MOE Structure.☆58Updated 10 months ago
- GLM Series Edge Models☆139Updated 3 months ago
- Jina DeepSearch UI☆107Updated 2 weeks ago
- A demo built on Megrez-3B-Instruct, integrating a web search tool to enhance the model's question-and-answer capabilities.☆38Updated 5 months ago
- The Level-Navi Agent, a framework that requires no training and utilizes large language models for deep query understanding and precise s…☆79Updated 5 months ago
- An open-source chat text to control actions agentic workflow framework/showcase powered by Agently AI application development framework.☆28Updated 8 months ago
- Evaluation for AI apps and agent☆41Updated last year
- ☆112Updated last month
- Try out HallOumi, a state-of-the-art claim verification model in a simple UI!☆34Updated last month
- Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The infere…☆19Updated 3 weeks ago
- ☆88Updated 2 months ago
- LM inference server implementation based on *.cpp.☆198Updated this week
- Train a Language Model with GRPO to create a schedule from a list of events and priorities☆188Updated last month
- ☆142Updated 3 months ago
- Using APPL to reimplement popular algorithms for Large Language Models (LLMs) and prompts☆44Updated 4 months ago
- ☆19Updated 8 months ago
- Qwen GRPO Graph Extraction RL Finetune☆49Updated last month
- ☆51Updated 10 months ago
- 我们是第一个完全可商用的角色大模型。☆40Updated 9 months ago
- Delta-CoMe can achieve near loss-less 1-bit compressin which has been accepted by NeurIPS 2024☆57Updated 6 months ago
- bisheng-unstructured library☆48Updated last week
- XVERSE-MoE-A4.2B: A multilingual large language model developed by XVERSE Technology Inc.☆38Updated last year
- conversion doc(pdf/html/doc/docx/ppt/pptx)to markdown☆43Updated 10 months ago
- A lightweight script for processing HTML page to markdown format with support for code blocks☆79Updated last year
- GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型☆26Updated last month
- A Toolkit for Running On-device Large Language Models (LLMs) in APP☆72Updated 10 months ago