BiEchi/DistributedTrainingGPT2

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/BiEchi/DistributedTrainingGPT2)

BiEchi / DistributedTrainingGPT2

基于PyTorch GPT-2的针对各种数据并行pretrain的研究代码.

☆11

Alternatives and similar repositories for DistributedTrainingGPT2

Users that are interested in DistributedTrainingGPT2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

cake-lab / DELI
View on GitHub
Optimizing loading training data from cloud bucket storage for cloud-based distributed deep learning. Official repository for Quantifying…
☆11Jan 1, 2022Updated 4 years ago
rq1025330 / Actions-OP
View on GitHub
基于Lean大佬Lede源码编译。使用 Flippy 的 Openwrt 打包源码，主要制作 Phicomm N1、Amlogic S905x3 的 openwrt 固件及CR660X固件。
☆12Oct 4, 2025Updated 9 months ago
quantmind / d3-view
View on GitHub
d3 plugin for web interfaces
☆14Jul 2, 2020Updated 6 years ago
BiEchi / chipyard
View on GitHub
☆10Oct 8, 2021Updated 4 years ago
Yvan-xy / MayOS
View on GitHub
It was may. A tiny OS.
☆10Apr 13, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
neelnanda-io / neel-plotly
View on GitHub
A very hacky set of functions for getting plotly to do what I want when doing mech interp research, designed to be compatible with PyTorc…
☆15Jun 16, 2023Updated 3 years ago
Unified-Language-Model-Alignment / src
View on GitHub
☆14Oct 7, 2023Updated 2 years ago
initialencounter / chronocat-termux
View on GitHub
☆20Oct 8, 2024Updated last year
cybertronai / bflm
View on GitHub
☆17Jun 8, 2019Updated 7 years ago
KaneGreen / GROMACS-Windows-Builder
View on GitHub
☆22Jul 2, 2026Updated 2 weeks ago
intervention-training / int
View on GitHub
☆16Feb 4, 2026Updated 5 months ago
allwefantasy / elasticsearch-deep
View on GitHub
深入ElasticSearch
☆17Mar 8, 2016Updated 10 years ago
sid7954 / TrecQA
View on GitHub
TREC QA dataset for question answering cleaned for usage in Question Answering
☆14Aug 26, 2019Updated 6 years ago
1phalley / smartdnsprocd
View on GitHub
OpenWrt上Smartdns的自动守护进程，放到/etc/init.d目录用service smartdnsprocd enable开机自启
☆19Jun 21, 2021Updated 5 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
Soptq / Dynamic_Load_Balance_DistributedDNN
View on GitHub
Official Pytorch implementation of "DBS: Dynamic Batch Size for Distributed Deep Neural Network Training"
☆23Sep 30, 2021Updated 4 years ago
sinacms / MultiProcess
View on GitHub
A tool for PHP multi process asynchronous tasks manage
☆18Aug 14, 2018Updated 7 years ago
wyc-ruiker / CSE-599W-2018
View on GitHub
My Assignment for CSE 599w http://dlsys.cs.washington.edu/
☆15Dec 2, 2019Updated 6 years ago
lavinal712 / control-lora-v3
View on GitHub
☆11Dec 15, 2025Updated 7 months ago
rubyangxg / JingBeanAppWidget
View on GitHub
京东小组件
☆24Jan 28, 2022Updated 4 years ago
uct8086 / jsonVee
View on GitHub
一个高效的前后端集成框架，基于Vite、Vue、Webpack和Node.js。一键启动，开箱即用。 An efficient front-end and back-end integration framework based on Vite, Vue, Webpack…
☆18May 8, 2026Updated 2 months ago
DzvinkaYarish / ControlNet-different-backbones
View on GitHub
☆12Jun 15, 2023Updated 3 years ago
lexmen318 / metaCAT
View on GitHub
☆16May 22, 2023Updated 3 years ago
dunzeng / MORE
View on GitHub
Code for EMNLP'24 paper - On Diversified Preferences of Large Language Model Alignment
☆16Aug 6, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
cilium / chaos-testing-examples
View on GitHub
Examples of using Cilium for chaos testing and fault injection
☆29Sep 12, 2024Updated last year
kenandaoerdect / exposure-fusion-python
View on GitHub
A python implementation of exposure fusion
☆19Jan 29, 2022Updated 4 years ago
sssgun / grpc-quic
View on GitHub
The Go language implementation of gRPC over QUIC.
☆30Dec 2, 2021Updated 4 years ago
lwplw / darknet2caffe
View on GitHub
Conversion of yolo from DarkNet to Caffe
☆26Nov 26, 2018Updated 7 years ago
LinWeizheDragon / Knowledge-Aware-Graph-Enhanced-GPT-2-for-Dialogue-State-Tracking
View on GitHub
This is the official repository of EMNLP 2021 paper "Knowledge-Aware Graph-Enhanced GPT-2 for Dialogue State Tracking".
☆24Nov 11, 2021Updated 4 years ago
spellml / deeplab-voc-2012
View on GitHub
☆29Aug 6, 2020Updated 5 years ago
Unakar / Bike-REID
View on GitHub
在监控画质下实现对校园自行车的重识别，包含REID模型识别，向量数据库检索，UI展示
☆11Feb 13, 2024Updated 2 years ago
Line-Kite / GraphLayoutLM
View on GitHub
☆14Sep 6, 2024Updated last year
Lunderberg / tvm-gdb-extension
View on GitHub
☆25Jun 12, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
OPUS-MaLab / opus_rota4
View on GitHub
OPUS-Rota4: A Gradient-Based Protein Side-Chain Modeling Framework Assisted by Deep Learning-Based Predictors
☆11Apr 14, 2022Updated 4 years ago
pengyanghua / DL2
View on GitHub
a deep learning-driven scheduler for elastic training in deep learning clusters
☆31Jan 14, 2021Updated 5 years ago
phonism / CP-Zero
View on GitHub
Based on the R1-Zero method, using rule-based rewards and GRPO on the Code Contests dataset.
☆18Apr 22, 2025Updated last year
WorldEditors / EvolvingPlasticANN
View on GitHub
Codes for Evolving Plastic ANNs
☆15Dec 18, 2022Updated 3 years ago
ScalingIntelligence / caesar
View on GitHub
Throughput-oriented multi-turn inference engine for KernelBench [ICML '25]
☆24May 27, 2025Updated last year
CMU-AIRe / POPE
View on GitHub
☆27Jan 31, 2026Updated 5 months ago
research-outcome / LLM-Game-Benchmark
View on GitHub
Evaluating Large Language Models with Grid-Based Game Competitions: An Extensible LLM Benchmark and Leaderboard
☆25Dec 14, 2024Updated last year