xllamacpp - a Python wrapper of llama.cpp
☆82May 10, 2026Updated 2 weeks ago
Alternatives and similar repositories for xllamacpp
Users that are interested in xllamacpp are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp☆28May 17, 2026Updated last week
- ☆13Mar 10, 2025Updated last year
- Super simple python connectors for llama.cpp, including vision models (Gemma 3, Qwen2-VL). Compile llama.cpp and run!☆31Dec 11, 2025Updated 5 months ago
- Since the owner of the repo took it down and it used an MIT license, I guess it's okay to upload it here for people to use.☆54Mar 11, 2025Updated last year
- 编译扣子空间生成的 jsx 网页,方便部署到自己的服务器☆15Apr 29, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆27Dec 15, 2024Updated last year
- [ACL'22] Training-free Neural Architecture Search for RNNs and Transformers☆14May 26, 2024Updated 2 years ago
- Chatbot-to-speech using Orpheus TTS model. Interactive console app.☆21May 1, 2025Updated last year
- An Open source controller to convert any servo motor to the best smart servo. Servomotor Hack☆39Jul 19, 2024Updated last year
- ☆16Dec 16, 2024Updated last year
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆33Feb 10, 2025Updated last year
- ☆22Jun 19, 2024Updated last year
- Deep insight tensorrt, including but not limited to qat, ptq, plugin, triton_inference, cuda☆23May 10, 2026Updated 2 weeks ago
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆14Nov 15, 2025Updated 6 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The original BabyAGI, updated with LiteLLM and no vector database reliance (csv instead)☆22Oct 2, 2024Updated last year
- agentcp是一个基于ACP协议的Agent sdk,用于解决Agent间的身份认证及通信问题;用于创建AID、连接入网、构建会话,收发消息等;支持多Agent协作,异步消息处理,支持内网穿透,支持Agent访问的负载均衡☆38Feb 27, 2026Updated 3 months ago
- Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-p…☆9,312Updated this week
- ☆47Aug 29, 2024Updated last year
- A tool for testing and comparing the performance of different Large Language Model APIs. 一个用于测试和比较不同大语言模型API性能的工具。☆42Dec 9, 2025Updated 5 months ago
- 基于langchain设计的智能体任务,包含规划会话场景资源,构建子任务,任务执行器包含(MCTS)☆33Nov 10, 2025Updated 6 months ago
- A simple GUI to show shot boundary detection based on TransNet V2.☆30Dec 5, 2020Updated 5 years ago
- CanvasAnvil is an AI multi-canvas creation platform for flowcharts, interior design, presentations, posters, infographics, and product st…☆81May 5, 2026Updated 3 weeks ago
- ☆12Sep 29, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Analyze Reddit posts☆31Feb 27, 2025Updated last year
- An Anime and Manga Search List built with VueJS and TailwindCSS powered by the Anilist API.☆13Oct 7, 2022Updated 3 years ago
- 🔎 A deep-dive into HyDE for Advanced LLM RAG + 💡 Introducing AutoHyDE, a semi-supervised framework to improve the effectiveness, covera…☆37Mar 26, 2024Updated 2 years ago
- ☆10Dec 10, 2024Updated last year
- ☆13Nov 24, 2025Updated 6 months ago
- Weaviate's own language vectorizer, which allows for semantic context-based searches in Weaviate☆17Jan 17, 2024Updated 2 years ago
- ☆23Mar 26, 2026Updated 2 months ago
- [EMNLP 2025] Official codebase for Rearank: Reasoning Re-ranking Agent☆36Aug 20, 2025Updated 9 months ago
- This is a downloader of NetEaseMusic (http://music.163.com)☆16Dec 23, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ArrayViews: creating specific views to array storage objects☆16Feb 6, 2019Updated 7 years ago
- ☆16Apr 1, 2025Updated last year
- Collect orderbook data from crypto exchanges and publish as GRPC☆13Jun 19, 2022Updated 3 years ago
- Automatic Differentiation for Gradient Boosted Decision Trees.☆13May 17, 2022Updated 4 years ago
- LLM inference in C/C++☆119Updated this week
- 哔哩哔哩应援团&私信聊天机器人☆16Aug 14, 2019Updated 6 years ago
- The node has been created with an objective of identity consistency for FLUX.2 klein 9b models in ComfyUI.☆56May 13, 2026Updated 2 weeks ago