Memory Optimizations for Deep Learning (ICML 2023)
☆122Mar 13, 2024Updated 2 years ago
Alternatives and similar repositories for MODel_opt
Users that are interested in MODel_opt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Directed masked autoencoders☆15Mar 25, 2026Updated 2 months ago
- RDNA3 emulator☆61Apr 16, 2026Updated last month
- Official implementation for the paper Lancet: Accelerating Mixture-of-Experts Training via Whole Graph Computation-Communication Overlapp…☆14May 20, 2026Updated last week
- ☆45Nov 1, 2022Updated 3 years ago
- ☆21Jan 21, 2026Updated 4 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- SIEVE: Multimodal Dataset Pruning using Image-Captioning Models (CVPR 2024)☆19Apr 28, 2024Updated 2 years ago
- ☆24Jun 13, 2022Updated 3 years ago
- ILAng documentation☆10Nov 2, 2025Updated 6 months ago
- ☆15Apr 11, 2024Updated 2 years ago
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆31May 19, 2026Updated last week
- MSCCL++: A GPU-driven communication stack for scalable AI applications☆520May 22, 2026Updated last week
- ☆32Jun 6, 2024Updated last year
- Mirage Persistent Kernel: Compiling LLMs into a MegaKernel☆2,271May 20, 2026Updated last week
- Dynamic Tensor Rematerialization prototype (modified PyTorch) and simulator. Paper: https://arxiv.org/abs/2006.09616☆133Jul 6, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Debug print operator for cudagraph debugging☆15Aug 2, 2024Updated last year
- A weak supervision framework for (partial) labeling functions☆16Jul 15, 2024Updated last year
- ☆59Dec 10, 2025Updated 5 months ago
- A parameter server implement with MPI.☆11Nov 15, 2017Updated 8 years ago
- (Hybrid) BYOL-S feature extractor using serab-byols package in pytorch.☆27Apr 20, 2024Updated 2 years ago
- ☆13Mar 11, 2024Updated 2 years ago
- ☆35Feb 18, 2026Updated 3 months ago
- ☆53Oct 29, 2024Updated last year
- Official code for "pi-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task Interpolation", ICML 2023.☆34Jul 21, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Code associated with the paper **SkipBERT: Efficient Inference with Shallow Layer Skipping**, at ACL 2022☆16Jun 22, 2022Updated 3 years ago
- High performance pytorch modules☆18Jan 14, 2023Updated 3 years ago
- ☆25Oct 17, 2016Updated 9 years ago
- Code for paper Rethinking the Data Annotation Process for Multi-view 3D Pose Estimation with Active Learning and Self-Training☆22Apr 18, 2023Updated 3 years ago
- Reinforcement Learning example in Nim, playing tic tac toe. Based off original C version from the great Antirez☆15Apr 2, 2025Updated last year
- ☆41Jun 18, 2021Updated 4 years ago
- Bottom Up Rewrite Generator☆28Aug 17, 2017Updated 8 years ago
- PyTorch-UVM on super-large language models.☆17Dec 21, 2020Updated 5 years ago
- PArametrized Recommendation and Ai Model benchmark is a repository for development of numerous uBenchmarks as well as end to end nets for…☆155May 6, 2026Updated 3 weeks ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration☆201Apr 27, 2022Updated 4 years ago
- A blog for LLVM(v11.0.0) beginner, step by step, with detailed documents and comments. Record the way I learn LLVM.☆14Jun 17, 2022Updated 3 years ago
- ☆13Feb 22, 2023Updated 3 years ago
- A block oriented training approach for inference time optimization.☆34Aug 19, 2024Updated last year
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆16Mar 28, 2023Updated 3 years ago
- Spartan is an algorithm for training sparse neural network models. This repository accompanies the paper "Spartan Differentiable Sparsity…☆25Oct 31, 2022Updated 3 years ago
- An open-source efficient deep learning framework/compiler, written in python.☆742Sep 4, 2025Updated 8 months ago