xllamacpp - a Python wrapper of llama.cpp
☆78Apr 27, 2026Updated last week
Alternatives and similar repositories for xllamacpp
Users that are interested in xllamacpp are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp☆25Updated this week
- Since the owner of the repo took it down and it used an MIT license, I guess it's okay to upload it here for people to use.☆54Mar 11, 2025Updated last year
- 编译扣子空间生成的 jsx 网页,方便部署到自己的服务器☆15Apr 29, 2025Updated last year
- [ACL'22] Training-free Neural Architecture Search for RNNs and Transformers☆14May 26, 2024Updated last year
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆33Feb 10, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Deep insight tensorrt, including but not limited to qat, ptq, plugin, triton_inference, cuda☆23Apr 21, 2026Updated 2 weeks ago
- Minimal web client for chatting and roleplay with AI characters☆26Aug 21, 2025Updated 8 months ago
- A simple, easy-to-customize pipeline for local RAG evaluation. Starter prompts and metric definitions included.☆25Jan 14, 2026Updated 3 months ago
- Simple node proxy for llama-server that enables MCP use☆18May 10, 2025Updated 11 months ago
- Lightweight Python Wrapper for OpenVINO, enabling LLM inference on NPUs☆29Dec 17, 2024Updated last year
- A production-ready platform for dynamic AI agents — plan, use tools, and complete real work without hardcoded workflows.☆235Updated this week
- ☆33Jul 12, 2018Updated 7 years ago
- agentcp是一个基于ACP协议的Agent sdk,用于解决Agent间的身份认证及通信问题;用于创建AID、连接入网、构建会话,收发消息等;支持多Agent协作,异步消息处理,支持内网穿透,支持Agent访问的负载均衡☆38Feb 27, 2026Updated 2 months ago
- A simple thermal camera for ESP32 TTGO T-Display module using AMG8833 sensor☆10Nov 14, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Model Server Template. Used to expose custom models to the LangSmith Playground☆17Jun 14, 2024Updated last year
- Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-p…☆9,281Updated this week
- ☆48Aug 29, 2024Updated last year
- A tool for testing and comparing the performance of different Large Language Model APIs. 一个用于测试和比较不同大语言模型API性能的工具。☆42Dec 9, 2025Updated 4 months ago
- 基于langchain设计的智能体任务,包含规划会话场景资源,构建子任务,任务执行器包含(MCTS)☆33Nov 10, 2025Updated 5 months ago
- CanvasAnvil is an AI multi-canvas creation platform for flowcharts, interior design, presentations, posters, infographics, and product st…☆76Apr 25, 2026Updated last week
- The library for character-driven AI experiences.☆87May 15, 2024Updated last year
- ☆12Sep 29, 2024Updated last year
- A modern, single-page web chat interface for local LLMs (Large Language Models), inspired by the visual style and UX of Anthropic's Claud…☆32May 11, 2025Updated 11 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Drag-and-drop, grouping, sorting bookmarklet plugin☆30May 6, 2025Updated last year
- This is Asynchronous HTTP and WebSocket Server Library for WT32_ETH01 (ESP32 + LAN8720). Now supporting using CString to save heap to sen…☆18Dec 5, 2022Updated 3 years ago
- Analyze Reddit posts☆31Feb 27, 2025Updated last year
- An Anime and Manga Search List built with VueJS and TailwindCSS powered by the Anilist API.☆13Oct 7, 2022Updated 3 years ago
- llama-swap + a minimal ollama compatible api☆57Mar 14, 2026Updated last month
- ☆30Apr 29, 2026Updated last week
- ☆13Nov 24, 2025Updated 5 months ago
- ☆23Mar 26, 2026Updated last month
- [EMNLP 2025] Official codebase for Rearank: Reasoning Re-ranking Agent☆35Aug 20, 2025Updated 8 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Easy to install Text to Speech system for Raspberry Pi 4☆17Mar 4, 2024Updated 2 years ago
- This plugin allows to search between two dates on the issue status changes over the history of the issue.☆10Nov 13, 2024Updated last year
- MegaStyle, 面向一致性与多样性的可扩展风格数据生成框架☆97Apr 23, 2026Updated last week
- This is a downloader of NetEaseMusic (http://music.163.com)☆16Dec 23, 2024Updated last year
- ArrayViews: creating specific views to array storage objects☆16Feb 6, 2019Updated 7 years ago
- ☆11Nov 10, 2024Updated last year
- A simple http server in a container☆11Oct 29, 2019Updated 6 years ago