xorbitsai / xllamacpp
xllamacpp - a Python wrapper of llama.cpp
☆34Updated this week
Alternatives and similar repositories for xllamacpp:
Users that are interested in xllamacpp are comparing it to the libraries listed below
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆35Updated 2 months ago
- Jina DeepSearch UI☆95Updated this week
- A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.☆99Updated this week
- GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型☆21Updated this week
- GLM Series Edge Models☆136Updated 2 months ago
- ☆85Updated last month
- An open-source chat text to control actions agentic workflow framework/showcase powered by Agently AI application development framework.☆27Updated 7 months ago
- A third-party component library based on Gradio.☆95Updated last week
- Real time faster whisper gradio☆26Updated 6 months ago
- A Toolkit for Running On-device Large Language Models (LLMs) in APP☆72Updated 9 months ago
- XVERSE-MoE-A36B: A multilingual large language model developed by XVERSE Technology Inc.☆36Updated 7 months ago
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆56Updated 4 months ago
- LLM based agents with proactive interactions, long-term memory, external tool integration, and local deployment capabilities.☆100Updated this week
- The latest graphrag interface is used, using the local ollama to provide the LLM interface.Support for using the pip installation☆146Updated 6 months ago
- Its an open source LLM based on MOE Structure.☆58Updated 9 months ago
- ☆59Updated last year
- Using APPL to reimplement popular algorithms for Large Language Models (LLMs) and prompts☆43Updated 3 months ago
- Try out HallOumi, a state-of-the-art claim verification model in a simple UI!☆30Updated 2 weeks ago
- g1: Using GPT-4o to create o1-like reasoning chains☆20Updated 7 months ago
- ☆58Updated 6 months ago
- Deployment a light and full OpenAI API for production with vLLM to support /v1/embeddings with all embeddings models.☆42Updated 9 months ago
- Receipts for creating AI Applications with APIs from DashScope (and friends)!☆51Updated 6 months ago
- ☆53Updated 10 months ago
- The Level-Navi Agent, a framework that requires no training and utilizes large language models for deep query understanding and precise s…☆78Updated 3 months ago
- Delta-CoMe can achieve near loss-less 1-bit compressin which has been accepted by NeurIPS 2024☆57Updated 5 months ago
- Streaming ASR and TTS based on FastAPI+ sherpa-onnx☆95Updated 6 months ago
- A lightweight script for processing HTML page to markdown format with support for code blocks☆79Updated last year
- A Next.js version of Claude Aritfacts , inspired by llamacoder☆22Updated 6 months ago
- 我们是第一个完全可商用的角色大模型。☆39Updated 8 months ago
- ☆51Updated 8 months ago