☆94Jul 3, 2022Updated 3 years ago
Alternatives and similar repositories for DT-FM
Users that are interested in DT-FM are comparing it to the libraries listed below
Sorting:
- Official code for "SWARM Parallelism: Training Large Models Can Be Surprisingly Communication-Efficient"☆149Dec 11, 2023Updated 2 years ago
- Website for Systems Research Seminar at UIUC☆20Feb 21, 2026Updated last week
- Official code for "Distributed Deep Learning in Open Collaborations" (NeurIPS 2021)☆117Jan 13, 2022Updated 4 years ago
- Understanding RL vision Distill article☆25Mar 3, 2023Updated 2 years ago
- Federated reconnaissance mini-ImageNet benchmark and baseline models☆13Sep 2, 2021Updated 4 years ago
- A resilient distributed training framework☆97Apr 11, 2024Updated last year
- A comprehensive overview of Data Distillation and Condensation (DDC). DDC is a data-centric task where a representative (i.e., small but …☆13Dec 1, 2022Updated 3 years ago
- Code for the paper "Pretrained Models for Multilingual Federated Learning" at NAACL 2022☆11Aug 9, 2022Updated 3 years ago
- Sample, estimate, aggregate: A recipe for causal discovery foundation models☆17Jun 21, 2024Updated last year
- [ICML 2023] "Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?" by Ruisi Cai, Zhenyu Zhang, Zhangyang Wang☆16May 4, 2023Updated 2 years ago
- Unofficial Experiments with AlgebraNets☆17Jun 17, 2020Updated 5 years ago
- [NeurIPS 2022] JAX/Haiku implementation of "On Privacy and Personalization in Cross-Silo Federated Learning"☆27Apr 16, 2023Updated 2 years ago
- Code Release for "Broken Neural Scaling Laws" (BNSL) paper☆59Oct 29, 2023Updated 2 years ago
- [CVPRW 2023] "Many-Task Federated Learning: A New Problem Setting and A Simple Baseline" by Ruisi Cai, Xiaohan Chen, Shiwei Liu, Jayanth …☆13Aug 28, 2023Updated 2 years ago
- This repository contains all codes for the VLDB 2022 paper "Dynamic Spanning Trees for Connectivity Queries on Fully-dynamic Undirected G…☆12Feb 24, 2025Updated last year
- (NeurIPS 2022) Automatically finding good model-parallel strategies, especially for complex models and clusters.☆44Nov 4, 2022Updated 3 years ago
- ☆37Sep 13, 2025Updated 5 months ago
- ☆251Jul 25, 2024Updated last year
- Memory-efficient transformer. Work in progress.☆19Sep 17, 2022Updated 3 years ago
- Code for the paper "Secure Distributed Training at Scale" (ICML 2022)☆16Feb 4, 2025Updated last year
- Code to implement the AND-mask and geometric mean to do gradient based optimization, from the paper "Learning explanations that are hard …☆41Nov 12, 2020Updated 5 years ago
- [ICML 2024] Serving LLMs on heterogeneous decentralized clusters.☆34May 6, 2024Updated last year
- ☆78May 4, 2021Updated 4 years ago
- This technique modifies image data so that any model trained on it will bear an identifiable mark.☆44Aug 13, 2021Updated 4 years ago
- This is the official implementation for our ACL 2024 paper: "Causal Estimation of Memorisation Profiles".☆24Mar 25, 2025Updated 11 months ago
- Artifact for "Shockwave: Fair and Efficient Cluster Scheduling for Dynamic Adaptation in Machine Learning" [NSDI '23]☆47Nov 24, 2022Updated 3 years ago
- ☆19Jul 1, 2020Updated 5 years ago
- Magicube is a high-performance library for quantized sparse matrix operations (SpMM and SDDMM) of deep learning on Tensor Cores.☆91Nov 23, 2022Updated 3 years ago
- ☆58May 4, 2024Updated last year
- My Implementation of " Structure and Content-Guided Video Synthesis with Diffusion Models" by RunwayML☆26Jan 16, 2024Updated 2 years ago
- ReCross: Unsupervised Cross-Task Generalization via Retrieval Augmentation☆24May 1, 2022Updated 3 years ago
- A schedule language for large model training☆152Aug 21, 2025Updated 6 months ago
- A client library for LAION's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.☆32Mar 21, 2023Updated 2 years ago
- ☆26Dec 5, 2022Updated 3 years ago
- Accommodating Large Language Model Training over Heterogeneous Environment.☆25Mar 13, 2025Updated 11 months ago
- ☆145Jan 30, 2025Updated last year
- FedScale is a scalable and extensible open-source federated learning (FL) platform.☆410Dec 18, 2023Updated 2 years ago
- Code for the paper: Why Transformers Need Adam: A Hessian Perspective☆63Mar 11, 2025Updated 11 months ago
- Privacy Budget Orchestration in Machine Learning Workloads (OSDI '21)☆26Oct 20, 2023Updated 2 years ago