基于PyTorch GPT-2的针对各种数据并行pretrain的研究代码.
☆11Dec 16, 2022Updated 3 years ago
Alternatives and similar repositories for DistributedTrainingGPT2
Users that are interested in DistributedTrainingGPT2 are comparing it to the libraries listed below
Sorting:
- Optimizing loading training data from cloud bucket storage for cloud-based distributed deep learning. Official repository for Quantifying…☆11Jan 1, 2022Updated 4 years ago
- 基于Lean大佬Lede源码编译。使用 Flippy 的 Openwrt 打包源码,主要制作 Phicomm N1、Amlogic S905x3 的 openwrt 固件及CR660X固件。☆12Oct 4, 2025Updated 5 months ago
- d3 plugin for web interfaces☆13Jul 2, 2020Updated 5 years ago
- ☆10Oct 8, 2021Updated 4 years ago
- A very hacky set of functions for getting plotly to do what I want when doing mech interp research, designed to be compatible with PyTorc…☆13Jun 16, 2023Updated 2 years ago
- It was may. A tiny OS.☆10Apr 13, 2023Updated 2 years ago
- ☆14Oct 7, 2023Updated 2 years ago
- ☆20Oct 8, 2024Updated last year
- This is a adobe amf library for golang, only amf3 supported for now.☆29May 15, 2013Updated 12 years ago
- ☆17Jun 8, 2019Updated 6 years ago
- 深入ElasticSearch☆17Mar 8, 2016Updated 10 years ago
- TREC QA dataset for question answering cleaned for usage in Question Answering☆14Aug 26, 2019Updated 6 years ago
- OpenWrt上Smartdns的自动守护进程,放到/etc/init.d目录用service smartdnsprocd enable开机自启☆19Jun 21, 2021Updated 4 years ago
- Code for Paper (Preserving Diversity in Supervised Fine-tuning of Large Language Models)☆52May 12, 2025Updated 10 months ago
- Evaluating Large Language Models with Grid-Based Game Competitions: An Extensible LLM Benchmark and Leaderboard☆25Dec 14, 2024Updated last year
- 在监控画质下实现对校园自行车的重识别,包含REID模型识别,向量数据库检索,UI展示☆11Feb 13, 2024Updated 2 years ago
- Official Pytorch implementation of "DBS: Dynamic Batch Size for Distributed Deep Neural Network Training"☆23Sep 30, 2021Updated 4 years ago
- My Assignment for CSE 599w http://dlsys.cs.washington.edu/☆15Dec 2, 2019Updated 6 years ago
- ☆11Dec 15, 2025Updated 3 months ago
- LongAttn :Selecting Long-context Training Data via Token-level Attention☆15Jul 16, 2025Updated 8 months ago
- ☆12Jun 15, 2023Updated 2 years ago
- Code for EMNLP'24 paper - On Diversified Preferences of Large Language Model Alignment☆16Aug 6, 2024Updated last year
- 一个高效的前后端集成框架,基于Vite、Vue、Webpack和Node.js。一键启动,开箱即用。 An efficient front-end and back-end integration framework based on Vite, Vue, Webpack…☆18Feb 11, 2026Updated last month
- A python implementation of exposure fusion☆19Jan 29, 2022Updated 4 years ago
- ☆16May 22, 2023Updated 2 years ago
- Use the tokenizer in parallel to achieve superior acceleration☆20Mar 21, 2024Updated 2 years ago
- A tool for PHP multi process asynchronous tasks manage☆18Aug 14, 2018Updated 7 years ago
- My attempt to improve the speed of the newton schulz algorithm, starting from the dion implementation.☆33Dec 5, 2025Updated 3 months ago
- Measuring the Signal to Noise Ratio in Language Model Evaluation☆29Aug 19, 2025Updated 7 months ago
- Examples of using Cilium for chaos testing and fault injection☆28Sep 12, 2024Updated last year
- ☆28Aug 6, 2020Updated 5 years ago
- ☆25Jun 12, 2023Updated 2 years ago
- 京东小组件☆24Jan 28, 2022Updated 4 years ago
- OPUS-Rota4: A Gradient-Based Protein Side-Chain Modeling Framework Assisted by Deep Learning-Based Predictors☆11Apr 14, 2022Updated 3 years ago
- The Go language implementation of gRPC over QUIC.☆30Dec 2, 2021Updated 4 years ago
- Reproduced the DFT method without using Verl. https://arxiv.org/abs/2508.05629☆21Oct 14, 2025Updated 5 months ago
- Based on the R1-Zero method, using rule-based rewards and GRPO on the Code Contests dataset.☆18Apr 22, 2025Updated 11 months ago
- Code for the paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers" with GPT-J implementation.☆15Mar 22, 2023Updated 3 years ago
- LCA-on-the-line (ICML 2024 Oral)☆13Feb 13, 2025Updated last year