xorbitsai / xllamacppLinks
xllamacpp - a Python wrapper of llama.cpp
☆66Updated last week
Alternatives and similar repositories for xllamacpp
Users that are interested in xllamacpp are comparing it to the libraries listed below
Sorting:
- GLM Series Edge Models☆156Updated 6 months ago
- Auto Thinking Mode switch for Qwen3 in Open webui☆70Updated 7 months ago
- Try out HallOumi, a state-of-the-art claim verification model in a simple UI!☆41Updated 8 months ago
- Library for model distillation☆158Updated 3 months ago
- ☆188Updated 2 weeks ago
- A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.☆184Updated 2 weeks ago
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆33Updated 10 months ago
- A third-party component library based on Gradio. Integrates Ant Design, Ant Design X, Monaco Editor and more advanced components to help…☆132Updated last month
- ☆94Updated 5 months ago
- 我们是第一个完全可商用的角色大模型。☆40Updated last year
- Its an open source LLM based on MOE Structure.☆58Updated last year
- Dify 1.0 Plugin Convert your Dify tools's API to MCP compatible API☆23Updated 7 months ago
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆22Updated last year
- XVERSE-MoE-A4.2B: A multilingual large language model developed by XVERSE Technology Inc.☆39Updated last year
- DeepSearch Code-Actions Agent (DSCA). Build 🙌 with 🤗 smolagents☆128Updated 4 months ago
- ☆113Updated last year
- Repo for "MaskSearch: A Universal Pre-Training Framework to Enhance Agentic Search Capability"☆147Updated 6 months ago
- bisheng-unstructured library☆56Updated 7 months ago
- A Toolkit for Running On-device Large Language Models (LLMs) in APP☆79Updated last year
- The Level-Navi Agent, a framework that requires no training and utilizes large language models for deep query understanding and precise s…☆82Updated 11 months ago
- ☆374Updated this week
- Enjoy easier conversations with LLM☆47Updated 9 months ago
- Delta-CoMe can achieve near loss-less 1-bit compressin which has been accepted by NeurIPS 2024☆59Updated last year
- Tencent Hunyuan 7B (short as Hunyuan-7B) is one of the large language dense models of Tencent Hunyuan☆69Updated 4 months ago
- ☆19Updated last year
- Mixture-of-Experts (MoE) Language Model☆192Updated last year
- This is an NVIDIA AI Workbench example project that demonstrates an end-to-end model development workflow using Llamafactory.☆70Updated last year
- ☆133Updated 8 months ago
- An open-source chat text to control actions agentic workflow framework/showcase powered by Agently AI application development framework.☆29Updated last year
- LLM based agents with proactive interactions, long-term memory, external tool integration, and local deployment capabilities.☆107Updated 4 months ago