AMD-AIG-AIMA / InstellaLinks

Fully Open Language Models with Stellar Performance

☆247

Alternatives and similar repositories for Instella

Users that are interested in Instella are comparing it to the libraries listed below

Sorting:

AMD-AIG-AIMA / AMD-LLM
☆187Updated last year
codelion / pts
Pivotal Token Search
☆121Updated last month
SearchSavior / OpenArc
Lightweight Inference server for OpenVINO
☆202Updated this week
iuliaturc / gguf-docs
Docs for GGUF quantization (unofficial)
☆251Updated last month
pico-lm / pico-analyze
A companion toolkit to pico-train for quantifying, comparing, and visualizing how language models evolve during training.
☆107Updated 4 months ago
adobe-research / NoLiMa
Official repository for "NoLiMa: Long-Context Evaluation Beyond Literal Matching"
☆144Updated last month
ScalingIntelligence / tokasaurus
☆407Updated this week
NimbleEdge / sparse_transformers
Sparse Inferencing for transformer based LLMs
☆197Updated 2 weeks ago
R0bk / killedbyllm
☆95Updated 7 months ago
antimatter15 / reverse-engineering-gemma-3n
Reverse Engineering Gemma 3n: Google's New Edge-Optimized Language Model
☆238Updated 3 months ago
onnx / turnkeyml
No-code CLI designed for accelerating ONNX workflows
☆208Updated 2 months ago
inferx-net / inferx
InferX is a Inference Function as a Service Platform
☆129Updated last week
exo-explore / mlx-bitnet
1.58 Bit LLM on Apple Silicon using MLX
☆221Updated last year
klara-research / klarity
See Through Your Models
☆400Updated last month
blackhole89 / autopen
Editor with LLM generation tree exploration
☆73Updated 6 months ago
amd / gaia
Run LLM Agents on Ryzen AI PCs in Minutes
☆529Updated last week
NVlabs / Jet-Nemotron
☆403Updated this week
google-ai-edge / LiteRT-LM
☆311Updated this week
Foreseerr / TScale
☆197Updated 3 months ago
microsoft / GRIN-MoE
GRadient-INformed MoE
☆264Updated 11 months ago
babycommando / neuralgraffiti
Live-bending a foundation model’s output at neural network level.
☆266Updated 4 months ago
microsoft / ArchScale
Simple & Scalable Pretraining for Neural Architecture Research
☆289Updated last week
google / lmeval
☆230Updated last month
SakanaAI / evo-memory
Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.
☆319Updated 10 months ago
kalavai-net / kalavai-client
A platform to self-host AI on easy mode
☆159Updated 2 weeks ago
bold84 / cot_proxy
Smart proxy for LLM APIs that enables model-specific parameter control, automatic mode switching (like Qwen3's /think and /no_think), and…
☆49Updated 3 months ago
adriancable / qwen3.c
Local Qwen3 LLM inference. One easy-to-understand file of C source with no dependencies.
☆102Updated last month
lemonade-sdk / lemonade
Lemonade helps users run local LLMs with the highest performance by configuring state-of-the-art inference engines for their NPUs and GPU…
☆1,098Updated this week
foundation-model-stack / bamba
Train, tune, and infer Bamba model
☆131Updated 2 months ago
sam-paech / slop-forensics
☆260Updated 2 months ago