Codys12 / airllmLinks

AirLLM 70B inference with single 4GB GPU

☆14

Alternatives and similar repositories for airllm

Users that are interested in airllm are comparing it to the libraries listed below

Sorting:

rodrigobaron / anthill
☆24Updated 6 months ago
the-crypt-keeper / tcurtsni
Tcurtsni: Reverse Instruction Chat, ever wonder what your LLM wants to ask you?
☆22Updated last year
astramind-ai / Pulsar
The hearth of The Pulsar App, fast, secure and shared inference with modern UI
☆55Updated 8 months ago
perk11 / large-model-proxy
Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…
☆67Updated last month
severian42 / Computational-Model-for-Symbolic-Representations
Glyphs, acting as collaboratively defined symbols linking related concepts, add a layer of multidimensional semantic richness to user-AI …
☆50Updated 5 months ago
FishiaTee / yawullm
Yet Another (LLM) Web UI, made with Gemini
☆12Updated 7 months ago
wavify-labs / wavify-sdks
fast state-of-the-art speech models and a runtime that runs anywhere 💥
☆55Updated last month
charmandercha / ArchiDoc
☆17Updated 7 months ago
calmstate / VisualTagger
Visual Tagger is a JavaScript tool that visually highlights HTML elements for AIs, aiding in identifying interactive components on web pa…
☆10Updated 9 months ago
FishiaTee / Tumera
Yet another frontend for LLM, written using .NET and WinUI 3
☆10Updated 8 months ago
monk1337 / auto-ollama
run ollama & gguf easily with a single command
☆52Updated last year
deepgrove-ai / Bonsai
☆27Updated 4 months ago
QuixiAI / kraken
☆66Updated last year
nath1295 / MLX-Textgen
A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.
☆90Updated last month
samuel-vitorino / lm.rs-webui
Light WebUI for lm.rs
☆24Updated 9 months ago
kyegomez / Exa
Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…
☆26Updated 8 months ago
beyondExp / B-Llama3-o
B-Llama3o a llama3 with Vision Audio and Audio understanding as well as text and Audio and Animation Data output.
☆26Updated last year
tolitius / towel
"a towel is about the most massively useful thing an interstellar AI hitchhiker can have"
☆48Updated 9 months ago
bjj / exllamav2-openai-server
An OpenAI API compatible LLM inference server based on ExLlamaV2.
☆25Updated last year
catena-labs / moa-llm
A Python library to orchestrate LLMs in a neural network-inspired structure
☆49Updated 10 months ago
abgulati / hf-waitress
Serving LLMs in the HF-Transformers format via a PyFlask API
☆71Updated 10 months ago
EduardTalianu / EntropixLab
entropix style sampling + GUI
☆26Updated 9 months ago
matthewrenze / jhu-concise-cot
The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models
☆22Updated 8 months ago
pnmartinez / simple-computer-use
Open source implementation for computer use, using light OCR models and LLMs. Get Android app in link below.
☆27Updated last month
severian42 / Proteus-The-Genesis-LLM
Proteus is an experimental platform that combines the power of Large Language Models with the Genesis physics engine
☆23Updated 7 months ago
serp-ai / Parameter-Efficient-MoE
Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks
☆31Updated last year
andrewginns / CoT-at-Home
Who needs o1 anyways. Add CoT to any OpenAI compatible endpoint.
☆43Updated 10 months ago
lorenzodimauro97 / FileCollector
☆15Updated last week
QuixiAI / dolphin-logger
☆102Updated last month
uukuguy / speechless
LLM based agents with proactive interactions, long-term memory, external tool integration, and local deployment capabilities.
☆104Updated last week