AMD-AGI / InstellaLinks
Fully Open Language Models with Stellar Performance
☆303Updated 2 weeks ago
Alternatives and similar repositories for Instella
Users that are interested in Instella are comparing it to the libraries listed below
Sorting:
- ☆141Updated last month
- ☆190Updated last year
- Sparse Inferencing for transformer based LLMs☆213Updated 3 months ago
- 1.58 Bit LLM on Apple Silicon using MLX☆225Updated last year
- Benchmark and optimize LLM inference across frameworks with ease☆138Updated 2 months ago
- Pivotal Token Search☆131Updated 4 months ago
- GRadient-INformed MoE☆264Updated last year
- Simple & Scalable Pretraining for Neural Architecture Research☆302Updated last month
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆327Updated last year
- This is the documentation repository for SGLang. It is auto-generated from https://github.com/sgl-project/sglang/tree/main/docs.☆92Updated this week
- Welcome to the official repository of SINQ! A novel, fast and high-quality quantization method designed to make any Large Language Model …☆578Updated this week
- SWE-Bench Pro: Can AI Agents Solve Long-Horizon Software Engineering Tasks?☆217Updated last week
- PyTorch implementation of models from the Zamba2 series.☆185Updated 10 months ago
- Train, tune, and infer Bamba model☆136Updated 5 months ago
- Reverse Engineering Gemma 3n: Google's New Edge-Optimized Language Model☆252Updated 6 months ago
- Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B☆506Updated last week
- Inference engine for Intel devices. Serve LLMs, VLMs, Whisper, Kokoro-TTS, Embedding and Rerank models over OpenAI endpoints.☆247Updated 3 weeks ago
- No-code CLI designed for accelerating ONNX workflows☆216Updated 5 months ago
- LocalScore is an open benchmark which helps you understand how well your computer can handle local AI tasks.☆72Updated 2 months ago
- Everything you need to know about LLM inference☆245Updated this week
- ☆268Updated 5 months ago
- 👷 Build compute kernels☆186Updated this week
- ☆234Updated 4 months ago
- ☆1,215Updated last week
- Massive Multimodal Open RAG & Extraction A scalable multimodal pipeline for processing, indexing, and querying multimodal documents Eve…☆165Updated this week
- ☆703Updated last week
- ScalarLM - a unified training and inference stack☆93Updated last week
- Docs for GGUF quantization (unofficial)☆319Updated 4 months ago
- ☆1,022Updated last week
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆99Updated 6 months ago