NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to Automate Distributed Training and Inference
☆69Dec 9, 2024Updated last year
Alternatives and similar repositories for redco
Users that are interested in redco are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2024] Efficiency for Free: Ideal Data Are Transportable Representations☆19Jan 19, 2025Updated last year
- Distributed DRL by Ray and TensorFlow Tutorial.☆10Dec 26, 2019Updated 6 years ago
- Mixed precision training from scratch with Tensors and CUDA☆30May 14, 2024Updated 2 years ago
- MMoE: Multimodal Mixture-of-Experts (EMNLP 2024)☆16Nov 14, 2024Updated last year
- Code for the ACL 2021 paper "Structural Guidance for Transformer Language Models"☆14Sep 17, 2025Updated 8 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆45Oct 15, 2025Updated 7 months ago
- ☆12Dec 13, 2022Updated 3 years ago
- ☆20May 28, 2025Updated 11 months ago
- ☆17Dec 9, 2022Updated 3 years ago
- Offical code repository for PromptMix: A Class Boundary Augmentation Method for Large Language Model Distillation, EMNLP 2023☆12Dec 13, 2023Updated 2 years ago
- ☆28Jul 11, 2024Updated last year
- ☆14Jun 24, 2024Updated last year
- ☆16Dec 9, 2023Updated 2 years ago
- StarCraft 2 Imitation Learning☆29Jul 2, 2021Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Source-to-Source Debuggable Derivatives in Pure Python☆15Jan 23, 2024Updated 2 years ago
- Training neural networks in TensorFlow 2.0 with 5x less memory☆137Feb 21, 2022Updated 4 years ago
- Code for MLSys 2024 Paper "SiDA-MoE: Sparsity-Inspired Data-Aware Serving for Efficient and Scalable Large Mixture-of-Experts Models"☆22Apr 13, 2024Updated 2 years ago
- Pipeline parallelism for the minimalist☆39Aug 6, 2025Updated 9 months ago
- Official repository for DistFlashAttn: Distributed Memory-efficient Attention for Long-context LLMs Training☆221Aug 19, 2024Updated last year
- ☆63Dec 6, 2024Updated last year
- A schedule language for large model training☆152Aug 21, 2025Updated 8 months ago
- Learn online intrinsic rewards from LLM feedback☆45Dec 17, 2024Updated last year
- [Preprint] GMem: A Modular Approach for Ultra-Efficient Generative Models☆43Mar 11, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆13May 13, 2025Updated last year
- Scripts for large-scale prediction of lexical semantic change.☆14Feb 9, 2023Updated 3 years ago
- Code for the paper "Refining Language Model with Compositional Explanation" (NeurIPS 2021)☆11Oct 25, 2021Updated 4 years ago
- StructSR: Refuse Spurious Details in Real-World Image Super-Resolution☆29Jan 16, 2025Updated last year
- The test set for Koala☆45Mar 31, 2023Updated 3 years ago
- This repository contains reference implementation for multi-LLM ToM paper (accepted to EMNLP 2023), Theory of Mind for Multi-Agent Collab…☆18Jun 11, 2024Updated last year
- Interpretable Word Sense Representations via Definition Generation☆10Mar 6, 2025Updated last year
- ☆21Dec 22, 2020Updated 5 years ago
- Implementation for <Regularizing Neural Networks via Minimizing Hyperspherical Energy> in CVPR'20.☆24Jun 23, 2020Updated 5 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Reinforcement Learning Seminar at the Chinese University of Hong Kong, Shenzhen, China.☆21Nov 17, 2023Updated 2 years ago
- Learning to Model Editing Processes☆26Aug 3, 2025Updated 9 months ago
- Automatically Discovering Fast Parallelization Strategies for Distributed Deep Neural Network Training☆1,878Updated this week
- AdaptiveStep: Automatically Dividing Reasoning Step through Model Confidence☆10Mar 2, 2025Updated last year
- Findings of ACL 2021☆24May 8, 2021Updated 5 years ago
- ☆22Dec 15, 2023Updated 2 years ago
- ☆22Nov 20, 2020Updated 5 years ago