ensemble-core / NdLinearLinks
NdLinear by Ensemble is a drop-in PyTorch module that shrinks your models with no accuracy loss. It powers the Ensemble Platform—upload any model and get back a smaller, faster version, ready to deploy. Export to all frameworks (ONNX, TensorRT, SNPE, and more). Try it now → https://app.ensemblecore.ai/signup
☆302Updated last month
Alternatives and similar repositories for NdLinear
Users that are interested in NdLinear are comparing it to the libraries listed below
Sorting:
- Repository for ACM India Summer School on Generative AI for Text☆13Updated last year
- ☆459Updated this week
- ⏰ AI conference deadline countdowns☆267Updated last month
- ICLR 2025 - official implementation for "I-Con: A Unifying Framework for Representation Learning"☆101Updated 3 weeks ago
- List of startups doing AI & ML☆274Updated 7 months ago
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆140Updated last month
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch☆309Updated this week
- GPU Kernels☆188Updated 2 months ago
- List of AI Internships☆123Updated last year
- Resources needed to start deep learning research. ML/DL/CV/NLP/ML-SYS/RL/Graphs/Maths/Med image lecture videos from professors at esteeme…☆84Updated last year
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆188Updated last month
- Fine tune Gemma 3 on an object detection task☆69Updated last week
- "LLM from Zero to Hero: An End-to-End Large Language Model Journey from Data to Application!"☆30Updated this week
- Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs☆38Updated 8 months ago
- Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models. TMLR 2025.☆81Updated 2 months ago
- The AdEMAMix Optimizer: Better, Faster, Older.☆183Updated 10 months ago
- A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.☆378Updated 4 months ago
- Official implementation of the paper: "ZClip: Adaptive Spike Mitigation for LLM Pre-Training".☆130Updated 2 weeks ago
- ☆43Updated last month
- Reproduction of DeepSeek-R1☆235Updated 3 months ago
- First-principle implementations of groundbreaking AI algorithms using a wide range of deep learning frameworks, accompanied by supporting…☆174Updated last week
- An extension of the nanoGPT repository for training small MOE models.☆160Updated 4 months ago
- Attempt to make multiple residual streams from Bytedance's Hyper-Connections paper accessible to the public☆87Updated last month
- H-Net: Hierarchical Network with Dynamic Chunking☆115Updated this week
- Getting crystal-like representations with harmonic loss☆191Updated 3 months ago
- ☆179Updated 6 months ago
- VIT inference in triton because, why not?☆30Updated last year
- 100 days of building GPU kernels!☆462Updated 2 months ago
- ☆290Updated 2 months ago
- Reading list for research topics in state-space models☆306Updated last month