基于PyTorch GPT-2的针对各种数据并行pretrain的研究代码.
☆11Dec 16, 2022Updated 3 years ago
Alternatives and similar repositories for DistributedTrainingGPT2
Users that are interested in DistributedTrainingGPT2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Optimizing loading training data from cloud bucket storage for cloud-based distributed deep learning. Official repository for Quantifying…☆11Jan 1, 2022Updated 4 years ago
- 基于Lean大佬Lede源码编译。使用 Flippy 的 Openwrt 打包源码,主要制作 Phicomm N1、Amlogic S905x3 的 openwrt 固件及CR660X固件。☆12Oct 4, 2025Updated 8 months ago
- d3 plugin for web interfaces☆14Jul 2, 2020Updated 5 years ago
- ☆10Oct 8, 2021Updated 4 years ago
- A very hacky set of functions for getting plotly to do what I want when doing mech interp research, designed to be compatible with PyTorc…☆13Jun 16, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- It was may. A tiny OS.☆10Apr 13, 2023Updated 3 years ago
- ☆14Oct 7, 2023Updated 2 years ago
- ☆20Oct 8, 2024Updated last year
- This is a adobe amf library for golang, only amf3 supported for now.☆29May 15, 2013Updated 13 years ago
- ☆17Jun 8, 2019Updated 7 years ago
- 深入ElasticSearch☆17Mar 8, 2016Updated 10 years ago
- TREC QA dataset for question answering cleaned for usage in Question Answering☆14Aug 26, 2019Updated 6 years ago
- OpenWrt上Smartdns的自动守护进程,放到/etc/init.d目录用service smartdnsprocd enable开机自启☆19Jun 21, 2021Updated 4 years ago
- Evaluating Large Language Models with Grid-Based Game Competitions: An Extensible LLM Benchmark and Leaderboard☆25Dec 14, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code for Paper (Preserving Diversity in Supervised Fine-tuning of Large Language Models)☆57May 12, 2025Updated last year
- 在监控画质下实现对校园自行车的重识别,包含REID模型识别,向量数据库检索,UI展示☆11Feb 13, 2024Updated 2 years ago
- Official Pytorch implementation of "DBS: Dynamic Batch Size for Distributed Deep Neural Network Training"☆23Sep 30, 2021Updated 4 years ago
- My Assignment for CSE 599w http://dlsys.cs.washington.edu/☆15Dec 2, 2019Updated 6 years ago
- Compact and Agent-Native MoE Training System☆144Updated this week
- ☆11Dec 15, 2025Updated 5 months ago
- LongAttn :Selecting Long-context Training Data via Token-level Attention☆15Jul 16, 2025Updated 10 months ago
- ☆12Jun 15, 2023Updated 2 years ago
- 一个高效的前后端集成框架,基于Vite、Vue、Webpack和Node.js。一键启动,开箱即用。 An efficient front-end and back-end integration framework based on Vite, Vue, Webpack…☆18May 8, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code for EMNLP'24 paper - On Diversified Preferences of Large Language Model Alignment☆16Aug 6, 2024Updated last year
- ☆16May 22, 2023Updated 3 years ago
- Understanding deep networks and large models.☆28Jan 23, 2026Updated 4 months ago
- Use the tokenizer in parallel to achieve superior acceleration☆20Mar 21, 2024Updated 2 years ago
- A tool for PHP multi process asynchronous tasks manage☆18Aug 14, 2018Updated 7 years ago
- Implementation for POET and POET-X for LLM pretraining☆34Updated this week
- Measuring the Signal to Noise Ratio in Language Model Evaluation