A plug-in of Microsoft DeepSpeed to fix the bug of DeepSpeed pipeline
☆25Apr 16, 2021Updated 4 years ago
Alternatives and similar repositories for TDS
Users that are interested in TDS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Distill CPM-1☆18May 6, 2021Updated 4 years ago
- Code for CPM-2 Pre-Train☆157Mar 18, 2023Updated 3 years ago
- Finetune CPM-1☆73Mar 18, 2023Updated 3 years ago
- ☆14Aug 18, 2022Updated 3 years ago
- Introduction to CPM☆165Sep 26, 2021Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- 中国法研杯 CAIL 2019☆13Jun 17, 2019Updated 6 years ago
- Code of the COLING22 paper "uChecker: Masked Pretrained Language Models as Unsupervised Chinese Spelling Checkers"☆19Aug 17, 2022Updated 3 years ago
- ☆15Dec 10, 2021Updated 4 years ago
- The implementation for the paper `Byte-Pair Encoding for Text-to-SQL Generation`.☆14Feb 26, 2020Updated 6 years ago
- ☆13May 26, 2022Updated 3 years ago
- Inference framework for MoE layers based on TensorRT with Python binding☆41May 31, 2021Updated 4 years ago
- ☆54Apr 15, 2022Updated 3 years ago
- MATCH-TUNING☆15Aug 6, 2022Updated 3 years ago
- MultilingualShareGPT, the free multi-language corpus for LLM training☆73Apr 6, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Source code for "Domain-Aware Dialogue State Tracker for Multi-Domain Dialogue Systems"☆10Oct 5, 2020Updated 5 years ago
- Baremetal softwares for TrivialMIPS platform☆11Aug 12, 2019Updated 6 years ago
- A router IP written in Verilog.☆12Dec 20, 2019Updated 6 years ago
- ICLR 2021: Pre-Training for Context Representation in Conversational Semantic Parsing☆31Aug 30, 2021Updated 4 years ago
- ☆11Dec 9, 2020Updated 5 years ago
- An external memory allocator example for PyTorch.☆16Aug 10, 2025Updated 8 months ago
- ☆17Jun 21, 2024Updated last year
- A summary of my projects☆49Dec 29, 2025Updated 3 months ago
- ☆11Apr 29, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [AFK] Hardware router in Chisel (THU Network Joint Lab 2020)☆14Oct 8, 2020Updated 5 years ago
- Blog post☆17Feb 16, 2024Updated 2 years ago
- Tutorial for rCore OS step by step (3rd edition)☆10Apr 24, 2021Updated 4 years ago
- A tiny Catalyst-like experiment runner framework on top of micrograd.☆51Jan 18, 2021Updated 5 years ago
- Relaxed Rust (for cats)☆14Nov 20, 2019Updated 6 years ago
- A simple USB to UART board designed with KiCad.☆14May 4, 2023Updated 2 years ago
- triton ver of gqa flash attn, based on the tutorial☆12Aug 4, 2024Updated last year
- Utility classes for dense and sparse matrices in JCuda☆11Mar 8, 2019Updated 7 years ago
- ☆49Dec 24, 2020Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- The code of AAAI 2020 paper "Transparent Classification with Multilayer Logical Perceptrons and Random Binarization".☆23Mar 10, 2024Updated 2 years ago
- 本项目提供了面向中文的XLNet预训练模型,旨在丰富中文自然语言处理资源,提供多元化的中文预训练模型选择。 我们欢迎各位专家学者下载使用,并共同促进和发展中文资源建设。☆11May 30, 2023Updated 2 years ago
- A collection of Models, Datasets, DataModules, Callbacks, Metrics, Losses and Loggers to better integrate pytorch-lightning with transfor…☆47May 29, 2023Updated 2 years ago
- ☆21Mar 29, 2020Updated 6 years ago
- A Survey of Neural Dialogue Systems☆19Dec 31, 2021Updated 4 years ago
- ☆22Feb 14, 2023Updated 3 years ago
- Official Pytorch Implementation for the paper 'SUPER-ADAM: Faster and Universal Framework of Adaptive Gradients'☆17Jan 12, 2022Updated 4 years ago