基于PyTorch GPT-2的针对各种数据并行pretrain的研究代码.
☆11Dec 16, 2022Updated 3 years ago
Alternatives and similar repositories for DistributedTrainingGPT2
Users that are interested in DistributedTrainingGPT2 are comparing it to the libraries listed below
Sorting:
- Code for Paper (Preserving Diversity in Supervised Fine-tuning of Large Language Models)☆52May 12, 2025Updated 9 months ago
- Computational predictor of protein intrinsic disorder and its functions☆10Dec 4, 2023Updated 2 years ago
- OPUS-Rota4: A Gradient-Based Protein Side-Chain Modeling Framework Assisted by Deep Learning-Based Predictors☆11Apr 14, 2022Updated 3 years ago
- ☆10Aug 15, 2022Updated 3 years ago
- 在监控画质下实现对校园自行车的重识别,包含REID模型识别,向量数据库检索,UI展示☆10Feb 13, 2024Updated 2 years ago
- ☆11Dec 5, 2024Updated last year
- ☆13May 25, 2022Updated 3 years ago
- Force Fields☆14Oct 25, 2022Updated 3 years ago
- LongAttn :Selecting Long-context Training Data via Token-level Attention☆15Jul 16, 2025Updated 7 months ago
- AttentionDTA: prediction of drug–target binding affinity using attention model.https://ieeexplore.ieee.org/abstract/document/8983125☆13Aug 29, 2020Updated 5 years ago
- ☆12Jun 15, 2023Updated 2 years ago
- Code for EMNLP'24 paper - On Diversified Preferences of Large Language Model Alignment☆16Aug 6, 2024Updated last year
- Math evaluations of llama models.☆10Jan 3, 2024Updated 2 years ago
- Codes for Evolving Plastic ANNs☆14Dec 18, 2022Updated 3 years ago
- An Image Recognition tutorial written for the HyperionDev blog☆10Dec 19, 2017Updated 8 years ago
- OPUS-Rota4: A Gradient-Based Protein Side-Chain Modeling Framework Assisted by Deep Learning-Based Predictors☆10Apr 14, 2022Updated 3 years ago
- ☆14Sep 6, 2024Updated last year
- Code for the paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers" with GPT-J implementation.☆15Mar 22, 2023Updated 2 years ago
- natural annotated text-category pairs for text classification☆10Sep 10, 2021Updated 4 years ago
- Scripts and data to run AbDesign as described in Tools for protein science 2021☆14Nov 4, 2020Updated 5 years ago
- Ilya Sutskever 推荐的30篇Deep learning 必读论文 (中英文对照翻译版)☆13Dec 18, 2024Updated last year
- ☆11Dec 15, 2025Updated 2 months ago
- ☆10Jun 7, 2025Updated 8 months ago
- Optimizing loading training data from cloud bucket storage for cloud-based distributed deep learning. Official repository for Quantifying…☆11Jan 1, 2022Updated 4 years ago
- ☆15Apr 15, 2024Updated last year
- LCA-on-the-line (ICML 2024 Oral)☆13Feb 13, 2025Updated last year
- [EMNLP 2024] FlowBench: Revisiting and Benchmarking Workflow-Guided Planning for LLM-based Agents☆21Jan 6, 2025Updated last year
- ☆14Oct 7, 2023Updated 2 years ago
- 基于Lean大佬Lede源码编译。使用 Flippy 的 Openwrt 打包源码,主要制作 Phicomm N1、Amlogic S905x3 的 openwrt 固件及CR660X固件。☆12Oct 4, 2025Updated 4 months ago
- Expert Specialization MoE Solution based on CUTLASS☆27Jan 19, 2026Updated last month
- Code for paper: Unraveling the Shift of Visual Information Flow in MLLMs: From Phased Interaction to Efficient Inference☆13Jun 7, 2025Updated 8 months ago
- The official code for "OG-HFYOLO :Orientation Gradient Guidance and Heterogeneous Feature Fusion For Deformation Table Cell Instance Segm…☆13Jul 28, 2025Updated 7 months ago
- d3 plugin for web interfaces☆13Jul 2, 2020Updated 5 years ago
- ☆12Jul 2, 2025Updated 7 months ago
- A very hacky set of functions for getting plotly to do what I want when doing mech interp research, designed to be compatible with PyTorc…☆11Jun 16, 2023Updated 2 years ago
- ☆10Oct 8, 2021Updated 4 years ago
- Reproduced the DFT method without using Verl. https://arxiv.org/abs/2508.05629☆21Oct 14, 2025Updated 4 months ago
- ☆16Jan 10, 2023Updated 3 years ago
- The official repository for TherML - a machine learning approach to predict scFv and antibody thermostability☆14Sep 8, 2023Updated 2 years ago