A plug-in of Microsoft DeepSpeed to fix the bug of DeepSpeed pipeline
☆25Apr 16, 2021Updated 5 years ago
Alternatives and similar repositories for TDS
Users that are interested in TDS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pretrain CPM-1☆53Apr 20, 2021Updated 5 years ago
- Distill CPM-1☆18May 6, 2021Updated 5 years ago
- ☆14Aug 18, 2022Updated 3 years ago
- Introduction to CPM☆164Sep 26, 2021Updated 4 years ago
- The implementation for the paper `Byte-Pair Encoding for Text-to-SQL Generation`.☆14Feb 26, 2020Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆13May 26, 2022Updated 4 years ago
- The official implementation of the paper "Self-Updatable Large Language Models by Integrating Context into Model Parameters"☆15May 18, 2025Updated last year
- ☆54Apr 15, 2022Updated 4 years ago
- MATCH-TUNING☆15Aug 6, 2022Updated 3 years ago
- [ACL-IJCNLP 2021] "EarlyBERT: Efficient BERT Training via Early-bird Lottery Tickets" by Xiaohan Chen, Yu Cheng, Shuohang Wang, Zhe Gan, …☆18Dec 30, 2021Updated 4 years ago
- CS294-162; Machine Learning Systems Seminar☆32Apr 11, 2023Updated 3 years ago
- A "gym" style toolkit for building lightweight NAS systems.☆13Jun 13, 2022Updated 3 years ago
- Source code for "Domain-Aware Dialogue State Tracker for Multi-Domain Dialogue Systems"☆10Oct 5, 2020Updated 5 years ago
- Baremetal softwares for TrivialMIPS platform☆11Aug 12, 2019Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A router IP written in Verilog.☆12Dec 20, 2019Updated 6 years ago
- ICLR 2021: Pre-Training for Context Representation in Conversational Semantic Parsing☆31Aug 30, 2021Updated 4 years ago
- Tomasulo Simulator written in React as the project for Computer Architecture course, Spring 2019, Tsinghua University☆12Jun 9, 2019Updated 7 years ago
- An external memory allocator example for PyTorch.☆16Aug 10, 2025Updated 10 months ago
- ☆17Jun 21, 2024Updated last year
- A summary of my projects☆49Dec 29, 2025Updated 5 months ago
- reStructured Pre-training☆99Dec 22, 2022Updated 3 years ago
- ☆11Apr 29, 2024Updated 2 years ago
- [EMNLP 2022] Training Language Models with Memory Augmentation https://arxiv.org/abs/2205.12674☆193Jun 14, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Tutorial for rCore OS step by step (3rd edition)☆10Apr 24, 2021Updated 5 years ago
- A tiny Catalyst-like experiment runner framework on top of micrograd.☆51Jan 18, 2021Updated 5 years ago
- Word Embeddings for Low Resource Languages: The Case of Buryat☆10Mar 12, 2025Updated last year
- fast trainer for educational purposes☆26Jun 4, 2026Updated last week
- Utility classes for dense and sparse matrices in JCuda☆11Mar 8, 2019Updated 7 years ago
- A mini (consistent-wannabe) proof-assistant with power roughly equivalent to intelligence of a two month old cat☆16Mar 12, 2022Updated 4 years ago
- ☆49Dec 24, 2020Updated 5 years ago
- Minghao Hu's thesis on Machine Reading Comprehension☆37Dec 19, 2019Updated 6 years ago
- distill large scale web page text☆12Jul 29, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Stock Analysis System based on Spark, Kafka.☆10Dec 22, 2019Updated 6 years ago
- The code of AAAI 2020 paper "Transparent Classification with Multilayer Logical Perceptrons and Random Binarization".☆23Mar 10, 2024Updated 2 years ago
- 本项目提供了面向中文的XLNet预训练模型,旨在丰富中文自然语言处理资源,提供多元化的中文预训练模型选择。 我们欢迎各位专家学者下载使用,并共同促进和发展中文资源建设。☆11May 30, 2023Updated 3 years ago
- Official Pytorch Implementation for the paper 'SUPER-ADAM: Faster and Universal Framework of Adaptive Gradients'☆17Jan 12, 2022Updated 4 years ago
- RV32I by cats☆15Sep 4, 2023Updated 2 years ago
- ☆13Nov 1, 2021Updated 4 years ago
- Various data structures implementation, in STL style☆11Jul 26, 2021Updated 4 years ago