ensemble-core / NdLinearLinks
NdLinear by Ensemble is a drop-in PyTorch module that shrinks your models with no accuracy loss. It powers the Ensemble Platform—upload any model and get back a smaller, faster version, ready to deploy. Export to all frameworks (ONNX, TensorRT, SNPE, and more). Try it now → https://app.ensemblecore.ai/signup
☆299Updated 6 months ago
Alternatives and similar repositories for NdLinear
Users that are interested in NdLinear are comparing it to the libraries listed below
Sorting:
- ⏰ AI conference deadline countdowns☆293Updated last week
- List of startups doing AI & ML☆289Updated 3 weeks ago
- Repository for ACM India Summer School on Generative AI for Text☆13Updated last year
- ☆567Updated last month
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch☆395Updated last month
- ☆45Updated 6 months ago
- The AdEMAMix Optimizer: Better, Faster, Older.☆186Updated last year
- 🟣 Pytorch interview questions and answers to help you prepare for your next machine learning and data science interview in 2025.☆274Updated 3 months ago
- ☆89Updated 8 months ago
- PyTorch Code for Energy-Based Transformers paper -- generalizable reasoning and scalable learning☆565Updated last month
- Resources needed to start deep learning research. ML/DL/CV/NLP/ML-SYS/RL/Graphs/Maths/Med image lecture videos from professors at esteeme…☆90Updated 3 weeks ago
- List of AI Internships☆129Updated 2 years ago
- GPU Kernels☆210Updated 7 months ago
- Official implementation of the paper: "ZClip: Adaptive Spike Mitigation for LLM Pre-Training".☆141Updated last month
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆195Updated 6 months ago
- Building GPT ...☆18Updated last year
- 100 days of building GPU kernels!☆552Updated 7 months ago
- Best practices & guides on how to write distributed pytorch training code☆552Updated last month
- Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs☆38Updated last year
- Fine tune Gemma 3 on an object detection task☆92Updated 5 months ago
- ☆32Updated 5 months ago
- ICLR 2025 - official implementation for "I-Con: A Unifying Framework for Representation Learning"☆118Updated 5 months ago
- VIT inference in triton because, why not?☆32Updated last year
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆149Updated 2 months ago
- Simple, minimal implementation of the Mamba SSM in one pytorch file. Using logcumsumexp (Heisen sequence).☆128Updated last year
- Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation (NeurIPS 2025)☆525Updated 2 months ago
- Tutorials for Triton, a language for writing gpu kernels☆61Updated 2 years ago
- A list of summer schools on Artificial Intelligence, Machine Learning, and Healthcare☆407Updated this week
- Official JAX implementation of xLSTM including fast and efficient training and inference code. 7B model available at https://huggingface.…☆105Updated 11 months ago
- "LLM from Zero to Hero: An End-to-End Large Language Model Journey from Data to Application!"☆141Updated last month