Codys12 / airllm
AirLLM 70B inference with single 4GB GPU
☆12Updated 6 months ago
Alternatives and similar repositories for airllm:
Users that are interested in airllm are comparing it to the libraries listed below
- ☆24Updated 3 weeks ago
- Easy to use, High Performant Knowledge Distillation for LLMs☆46Updated last month
- Chat WebUI is an easy-to-use user interface for interacting with AI, and it comes with multiple useful built-in tools.☆15Updated 3 months ago
- Yet Another (LLM) Web UI, made with Gemini☆11Updated last month
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆55Updated 2 months ago
- run ollama & gguf easily with a single command☆49Updated 9 months ago
- Who needs o1 anyways. Add CoT to any OpenAI compatible endpoint.☆41Updated 5 months ago
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated 3 months ago
- Light WebUI for lm.rs☆23Updated 4 months ago
- A Python library to orchestrate LLMs in a neural network-inspired structure☆46Updated 4 months ago
- Easily convert HuggingFace models to GGUF-format for llama.cpp☆21Updated 6 months ago
- Nexusflow function call, tool use, and agent benchmarks.☆19Updated 2 months ago
- 33B Chinese LLM, DPO QLORA, 100K context, AirLLM 70B inference with single 4GB GPU☆13Updated 9 months ago
- fast state-of-the-art speech models and a runtime that runs anywhere 💥☆54Updated 2 weeks ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆21Updated 2 months ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆31Updated 7 months ago
- Simple, Fast, Parallel Huggingface GGML model downloader written in python☆24Updated last year
- ☆27Updated 5 months ago
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆53Updated this week
- Proteus is an experimental platform that combines the power of Large Language Models with the Genesis physics engine☆21Updated 2 months ago
- A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.☆72Updated 2 months ago
- A stock market bot that automatically, once a day, rebalances your Robinhood portfolio by gathering information about each ticker in the …☆51Updated this week
- ☆16Updated 2 months ago
- ☆28Updated 4 months ago
- Create text chunks which end at natural stopping points without using a tokenizer☆26Updated 2 months ago
- B-Llama3o a llama3 with Vision Audio and Audio understanding as well as text and Audio and Animation Data output.☆26Updated 8 months ago
- fork of litellm that is open source☆16Updated 2 months ago
- A unified library for interacting with various AI APIs through a standardized interface.☆28Updated 2 weeks ago