Verdagon / Anima
33B Chinese LLM, DPO QLORA, 100K context, AirLLM 70B inference with single 4GB GPU
☆11Updated 4 months ago
Related projects: ⓘ
- Local LLM inference & management server with built-in OpenAI API☆30Updated 5 months ago
- A library for incremental loading of large PyTorch checkpoints☆56Updated last year
- Integrates AI tools into Microsoft® Word® (independently developed, not affiliated with Microsoft)☆20Updated this week
- AirLLM 70B inference with single 4GB GPU☆11Updated last month
- Minimal Gemma inference code in Rust☆24Updated 2 weeks ago
- ☆40Updated last year
- Demo of an "always-on" AI assistant.☆23Updated 7 months ago
- Large Model Proxy is designed to make it easy to run multiple resource-heavy Large Models (LM) on the same machine with limited amount of…☆44Updated last month
- A sleek, customizable interface for managing LLMs with responsive design and easy agent personalization.☆11Updated 2 weeks ago
- ☆29Updated 4 months ago
- Tcurtsni: Reverse Instruction Chat, ever wonder what your LLM wants to ask you?☆19Updated 2 months ago
- Building open version of OpenAI o1 via reasoning traces (Groq, ollama, Anthropic, Gemini, OpenAI, Azure supported)☆29Updated this week
- Course Project for COMP4471 on RWKV☆16Updated 7 months ago
- ☆55Updated last month
- RetroChat is a powerful command-line interface for interacting with various AI language models. It provides a seamless experience for eng…☆63Updated 3 weeks ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆24Updated 2 months ago
- a lightweight, open-source blueprint for building powerful and scalable LLM chat applications☆30Updated 3 months ago
- Like system requirements lab but for LLMs☆30Updated last year
- One Line To Build Zero-Data Classifiers in Minutes☆29Updated 3 weeks ago
- A bot that checks your grammar and phrasing using LLM of choice☆27Updated 3 months ago
- llama.cpp clone with additional SOTA quants and improved CPU performance☆57Updated this week
- Simple, Fast, Parallel Huggingface GGML model downloader written in python☆24Updated last year
- ☆28Updated this week
- A QT GUI for large language models☆23Updated 8 months ago
- Flexible Python package for managing and extending LLM based agents☆25Updated last year
- ☆24Updated 3 weeks ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31Updated 3 months ago
- ☆15Updated 6 months ago
- Add-on for the Web Search extension that provides the web browsing capabilities without the need for Extras API.☆22Updated 4 months ago
- iterate quickly with llama.cpp hot reloading. use the llama.cpp bindings with bun.sh☆45Updated 10 months ago