A plug-in of Microsoft DeepSpeed to fix the bug of DeepSpeed pipeline
☆25Apr 16, 2021Updated 5 years ago
Alternatives and similar repositories for TDS
Users that are interested in TDS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pretrain CPM-1☆53Apr 20, 2021Updated 5 years ago
- Distill CPM-1☆18May 6, 2021Updated 5 years ago
- Code for CPM-2 Pre-Train☆157Mar 18, 2023Updated 3 years ago
- Finetune CPM-1☆73Mar 18, 2023Updated 3 years ago
- Finetune CPM-2☆81Mar 18, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Introduction to CPM☆164Sep 26, 2021Updated 4 years ago
- 中国法研杯 CAIL 2019☆13Jun 17, 2019Updated 6 years ago
- ☆15Dec 10, 2021Updated 4 years ago
- The implementation for the paper `Byte-Pair Encoding for Text-to-SQL Generation`.☆14Feb 26, 2020Updated 6 years ago
- The official implementation of the paper "Self-Updatable Large Language Models by Integrating Context into Model Parameters"☆15May 18, 2025Updated last year
- Inference framework for MoE layers based on TensorRT with Python binding☆41May 31, 2021Updated 4 years ago
- ☆54Apr 15, 2022Updated 4 years ago
- MATCH-TUNING☆15Aug 6, 2022Updated 3 years ago
- [ACL-IJCNLP 2021] "EarlyBERT: Efficient BERT Training via Early-bird Lottery Tickets" by Xiaohan Chen, Yu Cheng, Shuohang Wang, Zhe Gan, …☆18Dec 30, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A "gym" style toolkit for building lightweight NAS systems.☆13Jun 13, 2022Updated 3 years ago
- Collaborative inference of latent diffusion via hivemind☆12May 29, 2023Updated 2 years ago
- Source code for "Domain-Aware Dialogue State Tracker for Multi-Domain Dialogue Systems"☆10Oct 5, 2020Updated 5 years ago
- A hybrid partitioner based quantum circuit simulation system on GPU☆47Aug 17, 2022Updated 3 years ago
- Baremetal softwares for TrivialMIPS platform☆11Aug 12, 2019Updated 6 years ago
- ICLR 2021: Pre-Training for Context Representation in Conversational Semantic Parsing☆31Aug 30, 2021Updated 4 years ago
- ☆224Sep 19, 2023Updated 2 years ago
- ☆11Dec 9, 2020Updated 5 years ago
- Tomasulo Simulator written in React as the project for Computer Architecture course, Spring 2019, Tsinghua University☆11Jun 9, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- An external memory allocator example for PyTorch.☆16Aug 10, 2025Updated 9 months ago
- ☆17Jun 21, 2024Updated last year
- reStructured Pre-training☆99Dec 22, 2022Updated 3 years ago
- ☆11Apr 29, 2024Updated 2 years ago
- [EMNLP 2022] Training Language Models with Memory Augmentation https://arxiv.org/abs/2205.12674☆193Jun 14, 2023Updated 2 years ago
- Blog post☆17Feb 16, 2024Updated 2 years ago
- Legate Hello World Pedagogical Library☆10Apr 5, 2023Updated 3 years ago
- ☆17Oct 17, 2022Updated 3 years ago
- A framework for evolving and testing question-answering datasets with various models.☆25Feb 28, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A simple USB to UART board designed with KiCad.☆14May 4, 2023Updated 3 years ago
- triton ver of gqa flash attn, based on the tutorial☆12Aug 4, 2024Updated last year
- ☆49Dec 24, 2020Updated 5 years ago
- Stock Analysis System based on Spark, Kafka.☆10Dec 22, 2019Updated 6 years ago
- Public Inflection Benchmarks☆67Mar 6, 2024Updated 2 years ago
- A collection of Models, Datasets, DataModules, Callbacks, Metrics, Losses and Loggers to better integrate pytorch-lightning with transfor…☆47May 29, 2023Updated 2 years ago
- A Survey of Neural Dialogue Systems☆19Dec 31, 2021Updated 4 years ago