AMD-AGI / InstellaLinks
Fully Open Language Models with Stellar Performance
☆317Updated 2 months ago
Alternatives and similar repositories for Instella
Users that are interested in Instella are comparing it to the libraries listed below
Sorting:
- Welcome to the official repository of SINQ! A novel, fast and high-quality quantization method designed to make any Large Language Model …☆588Updated 2 weeks ago
- ☆191Updated last year
- Sparse Inferencing for transformer based LLMs☆218Updated 5 months ago
- Benchmark and optimize LLM inference across frameworks with ease☆158Updated 4 months ago
- Simple & Scalable Pretraining for Neural Architecture Research☆307Updated last month
- Reverse Engineering Gemma 3n: Google's New Edge-Optimized Language Model☆262Updated 8 months ago
- This is the documentation repository for SGLang. It is auto-generated from https://github.com/sgl-project/sglang☆100Updated last week
- Pivotal Token Search☆144Updated last month
- Code for Bolmo: Byteifying the Next Generation of Language Models☆115Updated last month
- PyTorch implementation of models from the Zamba2 series.☆186Updated last year
- ☆463Updated 2 months ago
- LLM Inference on consumer devices☆128Updated 10 months ago
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆347Updated last year
- Train, tune, and infer Bamba model☆138Updated 7 months ago
- SWE-Bench Pro: Can AI Agents Solve Long-Horizon Software Engineering Tasks?☆251Updated 3 weeks ago
- Inference engine for Intel devices. Serve LLMs, VLMs, Whisper, Kokoro-TTS, Embedding and Rerank models over OpenAI endpoints.☆283Updated last week
- No-code CLI designed for accelerating ONNX workflows☆226Updated 7 months ago
- ScalarLM - a unified training and inference stack☆96Updated 2 months ago
- ☆165Updated last month
- ☆219Updated last year
- 👷 Build compute kernels☆214Updated last week
- 1.58 Bit LLM on Apple Silicon using MLX☆240Updated last year
- ☆237Updated 2 months ago
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆109Updated 8 months ago
- All information and news with respect to Falcon-H1 series☆106Updated 3 months ago
- A companion toolkit to pico-train for quantifying, comparing, and visualizing how language models evolve during training.☆110Updated 2 months ago
- Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research☆282Updated this week
- Efficient non-uniform quantization with GPTQ for GGUF☆58Updated 4 months ago
- Aana SDK is a powerful framework for building AI enabled multimodal applications.☆55Updated 5 months ago
- ☆269Updated 7 months ago