ensemble-core / NdLinearLinks
NdLinear by Ensemble is a drop-in PyTorch module that shrinks your models with no accuracy loss. It powers the Ensemble Platform—upload any model and get back a smaller, faster version, ready to deploy. Export to all frameworks (ONNX, TensorRT, SNPE, and more). Try it now → https://app.ensemblecore.ai/signup
☆298Updated 5 months ago
Alternatives and similar repositories for NdLinear
Users that are interested in NdLinear are comparing it to the libraries listed below
Sorting:
- Repository for ACM India Summer School on Generative AI for Text☆13Updated last year
- ⏰ AI conference deadline countdowns☆288Updated 2 weeks ago
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch☆390Updated 2 weeks ago
- ☆555Updated 2 weeks ago
- ☆45Updated 6 months ago
- ☆662Updated last week
- ☆32Updated 4 months ago
- The AdEMAMix Optimizer: Better, Faster, Older.☆186Updated last year
- List of startups doing AI & ML☆286Updated 11 months ago
- Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation (NeurIPS 2025)☆516Updated 2 months ago
- Resources needed to start deep learning research. ML/DL/CV/NLP/ML-SYS/RL/Graphs/Maths/Med image lecture videos from professors at esteeme…☆89Updated 3 months ago
- GPUGrants - a list of GPU grants that I can think of☆49Updated 2 months ago
- Tutorials for Triton, a language for writing gpu kernels☆57Updated 2 years ago
- GPU Kernels☆209Updated 7 months ago
- ICLR 2025 - official implementation for "I-Con: A Unifying Framework for Representation Learning"☆117Updated 5 months ago
- List of AI Internships☆129Updated 2 years ago
- First-principle implementations of groundbreaking AI algorithms using a wide range of deep learning frameworks, accompanied by supporting…☆180Updated 4 months ago
- Conference schedule, top papers, and analysis of the data for NeurIPS 2023!☆121Updated last year
- NUS CS5242 Neural Networks and Deep Learning, Xavier Bresson, 2025☆403Updated 7 months ago
- Implementation of Agent Attention in Pytorch☆92Updated last year
- Training small GPT-2 style models using Kolmogorov-Arnold networks.☆121Updated last year
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆196Updated 5 months ago
- ☆90Updated 7 months ago
- PyTorch Code for Energy-Based Transformers paper -- generalizable reasoning and scalable learning☆561Updated 2 weeks ago
- Attempt to make multiple residual streams from Bytedance's Hyper-Connections paper accessible to the public☆93Updated 5 months ago
- Simple, minimal implementation of the Mamba SSM in one pytorch file. Using logcumsumexp (Heisen sequence).☆127Updated last year
- ☆96Updated last month
- 100 days of building GPU kernels!☆540Updated 7 months ago
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆147Updated last month
- From scratch implementation of a vision language model in pure PyTorch☆250Updated last year