A plug-in of Microsoft DeepSpeed to fix the bug of DeepSpeed pipeline
☆25Apr 16, 2021Updated 4 years ago
Alternatives and similar repositories for TDS
Users that are interested in TDS are comparing it to the libraries listed below
Sorting:
- Pretrain CPM-1☆52Apr 20, 2021Updated 4 years ago
- Code for CPM-2 Pre-Train☆158Mar 18, 2023Updated 2 years ago
- Finetune CPM-1☆75Mar 18, 2023Updated 2 years ago
- ☆15Aug 18, 2022Updated 3 years ago
- Code of the COLING22 paper "uChecker: Masked Pretrained Language Models as Unsupervised Chinese Spelling Checkers"☆19Aug 17, 2022Updated 3 years ago
- [ACL-IJCNLP 2021] "EarlyBERT: Efficient BERT Training via Early-bird Lottery Tickets" by Xiaohan Chen, Yu Cheng, Shuohang Wang, Zhe Gan, …☆18Dec 30, 2021Updated 4 years ago
- Introduction to CPM☆165Sep 26, 2021Updated 4 years ago
- ☆54Apr 15, 2022Updated 3 years ago
- CSE201 Objected-Oriented Programming in C++: Teach an AI to produce pieces of music☆12Jan 23, 2019Updated 7 years ago
- FlowDelta: Modeling Flow Information Gain in Reasoning for Conversational Machine Comprehension☆35Oct 4, 2022Updated 3 years ago
- Minghao Hu's thesis on Machine Reading Comprehension☆37Dec 19, 2019Updated 6 years ago
- ☆10Aug 15, 2022Updated 3 years ago
- ☆40Nov 14, 2022Updated 3 years ago
- IIRC baseline☆10Jan 13, 2021Updated 5 years ago
- ☆13Aug 20, 2016Updated 9 years ago
- Unofficial PyTorch Implementation of OpenAI's GPT-3☆13Apr 11, 2022Updated 3 years ago
- A "gym" style toolkit for building lightweight NAS systems.☆13Jun 13, 2022Updated 3 years ago
- The community version of HLS_BLSTM (A BLSTM FPGA accelerator of an OCR appilcation, using CAPI/SNAP))☆11Sep 27, 2019Updated 6 years ago
- A small library that will help you plot and visualize numeric data from serial port.☆12Apr 28, 2018Updated 7 years ago
- VST FM synthesizer fm sound match example experiment☆13May 25, 2020Updated 5 years ago
- Python scripts for modifying Minecraft save files☆12Feb 13, 2024Updated 2 years ago
- 本项目提供了面向中文的XLNet预训练模型,旨在丰富中文自然语言处理资源,提供多元化的中文预训练模型选择。 我们欢迎各位专家学者下载使用,并共同促进和发展中文资源建设。☆11May 30, 2023Updated 2 years ago
- Utility classes for dense and sparse matrices in JCuda☆11Mar 8, 2019Updated 6 years ago
- Data extract of the DoD Procurement (P-1) and RDTE (R-1) justification book exhibits submitted by the US DoD Military Departments and Def…☆13Jan 3, 2019Updated 7 years ago
- ☆12Oct 23, 2018Updated 7 years ago
- distill large scale web page text☆12Jul 29, 2023Updated 2 years ago
- Gets your API token for DuckDuckGo private email redirecting service. Needed for Bitwarden etc.☆13Apr 6, 2023Updated 2 years ago
- The official implementation of the paper "Self-Updatable Large Language Models by Integrating Context into Model Parameters"☆15May 18, 2025Updated 9 months ago
- CommonsenseQA☆10Mar 28, 2020Updated 5 years ago
- reStructured Pre-training☆99Dec 22, 2022Updated 3 years ago
- This is a C++ implementation of cocoapi bbox evaluation code.☆11Dec 9, 2021Updated 4 years ago
- golang tun nat☆11Jul 20, 2022Updated 3 years ago
- Legate Hello World Pedagogical Library☆10Apr 5, 2023Updated 2 years ago
- A simple daemon to control fan speed on t2 Macs with patchched kernel. Visit https://t2linux.org for more information on the kernels☆11Aug 17, 2022Updated 3 years ago
- ☆10Sep 27, 2021Updated 4 years ago
- triton ver of gqa flash attn, based on the tutorial☆12Aug 4, 2024Updated last year
- Draw emoji on USTC logo.☆10Sep 15, 2017Updated 8 years ago
- Inference framework for MoE layers based on TensorRT with Python binding☆41May 31, 2021Updated 4 years ago
- Confident Adaptive Transformers☆14Apr 18, 2021Updated 4 years ago