OTOv1-v3, NeurIPS, ICLR, TMLR, DNN Training, Compression, Structured Pruning, Erasing Operators, CNN, Diffusion, LLM
☆50Oct 10, 2024Updated last year
Alternatives and similar repositories for only_train_once
Users that are interested in only_train_once are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- (NeurIPS 2024) One-shot Federated Learning via Synthetic Distiller-Distillate Communication☆19Mar 11, 2025Updated last year
- [ICLR'23] Trainability Preserving Neural Pruning (PyTorch)☆34May 21, 2023Updated 2 years ago
- [ICML 2023] UPop: Unified and Progressive Pruning for Compressing Vision-Language Transformers☆105Dec 30, 2024Updated last year
- Preview code of ECCV'24 paper "Distill Gold from Massive Ores" (BiLP)☆25Jul 6, 2024Updated last year
- ☆13Jul 3, 2024Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- ☆31Feb 8, 2026Updated 2 months ago
- vortex particles for simulating smoke in 2d☆16Dec 13, 2021Updated 4 years ago
- ☆15Apr 25, 2023Updated 3 years ago
- ICLR 2026☆154Apr 8, 2026Updated 3 weeks ago
- [NeurIPS 2024] Search for Efficient LLMs☆16Jan 16, 2025Updated last year
- This is the official repo for "Differentiable Model Scaling using Differentiable Topk"☆12May 16, 2024Updated last year
- Lightweight piece tokenization library☆12Apr 15, 2024Updated 2 years ago
- Awesome Pruning. ✅ Curated Resources for Neural Network Pruning.☆175Aug 30, 2024Updated last year
- EEZ Studio 中文版☆13Jun 20, 2025Updated 10 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Fast and Robust Early-Exiting Framework for Autoregressive Language Models with Synchronized Parallel Decoding (EMNLP 2023 Long)☆65Sep 28, 2024Updated last year
- Official PyTorch implementation of DistiLLM: Towards Streamlined Distillation for Large Language Models (ICML 2024)☆258Mar 13, 2025Updated last year
- Thinking is hard - automate it☆18Aug 24, 2022Updated 3 years ago
- [NAACL 2022] "Learning to Win Lottery Tickets in BERT Transfer via Task-agnostic Mask Training", Yuanxin Liu, Fandong Meng, Zheng Lin, Pe…☆15Oct 18, 2022Updated 3 years ago
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Mar 7, 2023Updated 3 years ago
- The official code of "Building on Efficient Foundations: Effectively Training LLMs with Structured Feedforward Layers"☆19Jul 24, 2024Updated last year
- ☆21Feb 11, 2022Updated 4 years ago
- ☆10Oct 20, 2023Updated 2 years ago
- ☆21Oct 1, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [IEEE TIP 2024] Ultra-low Bitrate Image Semantic Compression Driven by Large Multimodal Model☆36Apr 24, 2024Updated 2 years ago
- Code of Robust Lottery Tickets for Pre-trained Language Models (ACL2022)☆20Jul 18, 2022Updated 3 years ago
- ☆38Feb 11, 2026Updated 2 months ago
- Self-Contrastive Learning: Single-viewed Supervised Contrastive Framework using Sub-network (AAAI 2023)☆21Oct 28, 2023Updated 2 years ago
- Official Pytorch Implementation of "Outlier Weighed Layerwise Sparsity (OWL): A Missing Secret Sauce for Pruning LLMs to High Sparsity"☆81Jul 7, 2025Updated 9 months ago
- Data analysis scripts for Puffer☆12Jun 4, 2025Updated 10 months ago
- Exploring Robustness of Unsupervised Domain Adaptation in Semantic Segmentation (ICCV 2021; Oral)☆12Oct 9, 2022Updated 3 years ago
- Offcial code for the ECCV2024 paper "Self-Adapting Large Visual-Language Models to Edge Devices across Visual Modalities"☆26Oct 1, 2024Updated last year
- A Python API for the MiniSat and MiniCard constraint solvers.☆23Jan 1, 2026Updated 3 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official Implementation of Curriculum of Data Augmentation for Long-tailed Recognition (CUDA) (ICLR'23 Spotlight)☆23May 26, 2023Updated 2 years ago
- An extension of thu-spmi/CAT which contains a full-fledged implementation of CTC-CRF for Tensorflow.☆12Jul 5, 2021Updated 4 years ago
- Operating Systems Internals and Design principles 8th 读书笔记,资源整理☆19Jun 3, 2021Updated 4 years ago
- Elucidated Dataset Condensation (NeurIPS 2024)☆20Oct 5, 2024Updated last year
- High quality slow motion video generation using deep neural nets and optical flow.☆10Jul 13, 2018Updated 7 years ago
- Official implementation of the paper: "A deeper look at depth pruning of LLMs"☆15Jul 24, 2024Updated last year
- ASR, End-to-End, end2end, Speech Recognition, 端到端语音识别☆12Oct 25, 2020Updated 5 years ago