ensemble-core / NdLinearLinks
NdLinear by Ensemble is a drop-in PyTorch module that shrinks your models with no accuracy loss. It powers the Ensemble Platform—upload any model and get back a smaller, faster version, ready to deploy. Export to all frameworks (ONNX, TensorRT, SNPE, and more). Try it now → https://app.ensemblecore.ai/signup
☆298Updated 7 months ago
Alternatives and similar repositories for NdLinear
Users that are interested in NdLinear are comparing it to the libraries listed below
Sorting:
- Repository for ACM India Summer School on Generative AI for Text☆13Updated last year
- ⏰ AI conference deadline countdowns☆307Updated last week
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch☆399Updated 2 months ago
- List of startups doing AI & ML☆293Updated last month
- ☆576Updated 2 months ago
- List of AI Internships☆131Updated 2 years ago
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆149Updated 3 months ago
- GPU Kernels☆217Updated 8 months ago
- ☆45Updated 7 months ago
- First-principle implementations of groundbreaking AI algorithms using a wide range of deep learning frameworks, accompanied by supporting…☆181Updated 5 months ago
- Official implementation of the paper: "ZClip: Adaptive Spike Mitigation for LLM Pre-Training".☆141Updated last month
- Fine tune Gemma 3 on an object detection task☆95Updated 5 months ago
- Getting crystal-like representations with harmonic loss☆194Updated 9 months ago
- Best practices & guides on how to write distributed pytorch training code☆562Updated 2 months ago
- The AdEMAMix Optimizer: Better, Faster, Older.☆186Updated last year
- React + Next.js template for research websites (for PhD students, researchers, etc)☆217Updated last year
- Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs☆38Updated last year
- Tutorials for Triton, a language for writing gpu kernels☆65Updated 2 years ago
- A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.☆451Updated 10 months ago
- Building GPT ...☆18Updated last year
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆195Updated 7 months ago
- "LLM from Zero to Hero: An End-to-End Large Language Model Journey from Data to Application!"☆141Updated last week
- Resources needed to start deep learning research. ML/DL/CV/NLP/ML-SYS/RL/Graphs/Maths/Med image lecture videos from professors at esteeme…☆90Updated last month
- ICLR 2025 - official implementation for "I-Con: A Unifying Framework for Representation Learning"☆121Updated 6 months ago
- H-Net: Hierarchical Network with Dynamic Chunking☆800Updated last month
- ☆89Updated 9 months ago
- 100 days of building GPU kernels!☆560Updated 8 months ago
- FlexAttention based, minimal vllm-style inference engine for fast Gemma 2 inference.☆328Updated 2 months ago
- PyTorch Code for Energy-Based Transformers paper -- generalizable reasoning and scalable learning☆572Updated last month
- Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation (NeurIPS 2025)☆529Updated 3 months ago