AMD-AGI / InstellaLinks

Fully Open Language Models with Stellar Performance

☆303

Alternatives and similar repositories for Instella

Users that are interested in Instella are comparing it to the libraries listed below

Sorting:

ibm-granite / granite-4.0-language-models
☆141Updated last month
AMD-AGI / AMD-LLM
☆190Updated last year
NimbleEdge / sparse_transformers
Sparse Inferencing for transformer based LLMs
☆213Updated 3 months ago
exo-explore / mlx-bitnet
1.58 Bit LLM on Apple Silicon using MLX
☆225Updated last year
bentoml / llm-optimizer
Benchmark and optimize LLM inference across frameworks with ease
☆138Updated 2 months ago
codelion / pts
Pivotal Token Search
☆131Updated 4 months ago
microsoft / GRIN-MoE
GRadient-INformed MoE
☆264Updated last year
microsoft / ArchScale
Simple & Scalable Pretraining for Neural Architecture Research
☆302Updated last month
SakanaAI / evo-memory
Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.
☆327Updated last year
sgl-project / sgl-project.github.io
This is the documentation repository for SGLang. It is auto-generated from https://github.com/sgl-project/sglang/tree/main/docs.
☆92Updated this week
huawei-csl / SINQ
Welcome to the official repository of SINQ! A novel, fast and high-quality quantization method designed to make any Large Language Model …
☆578Updated this week
scaleapi / SWE-bench_Pro-os
SWE-Bench Pro: Can AI Agents Solve Long-Horizon Software Engineering Tasks?
☆217Updated last week
Zyphra / Zamba2
PyTorch implementation of models from the Zamba2 series.
☆185Updated 10 months ago
foundation-model-stack / bamba
Train, tune, and infer Bamba model
☆136Updated 5 months ago
antimatter15 / reverse-engineering-gemma-3n
Reverse Engineering Gemma 3n: Google's New Edge-Optimized Language Model
☆252Updated 6 months ago
WeiboAI / VibeThinker
Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B
☆506Updated last week
SearchSavior / OpenArc
Inference engine for Intel devices. Serve LLMs, VLMs, Whisper, Kokoro-TTS, Embedding and Rerank models over OpenAI endpoints.
☆247Updated 3 weeks ago
onnx / turnkeyml
No-code CLI designed for accelerating ONNX workflows
☆216Updated 5 months ago
cjpais / LocalScore
LocalScore is an open benchmark which helps you understand how well your computer can handle local AI tasks.
☆72Updated 2 months ago
bentoml / llm-inference-handbook
Everything you need to know about LLM inference
☆245Updated this week
ibm-granite / granite-3.0-language-models
☆268Updated 5 months ago
huggingface / kernel-builder
👷 Build compute kernels
☆186Updated this week
google / lmeval
☆234Updated 4 months ago
MoonshotAI / Kimi-Linear
☆1,215Updated last week
swiss-ai / mmore
Massive Multimodal Open RAG & Extraction A scalable multimodal pipeline for processing, indexing, and querying multimodal documents Eve…
☆165Updated this week
NVlabs / Jet-Nemotron
☆703Updated last week
tensorwavecloud / ScalarLM
ScalarLM - a unified training and inference stack
☆93Updated last week
iuliaturc / gguf-docs
Docs for GGUF quantization (unofficial)
☆319Updated 4 months ago
deepseek-ai / DeepSeek-V3.2-Exp
☆1,022Updated last week
tiiuae / onebitllms
Lightweight toolkit package to train and fine-tune 1.58bit Language models
☆99Updated 6 months ago