beyondguo / TrainingDynamicsLinks
Compute training dynamics, plot data cartography, analysing data quality...
☆41Updated 2 years ago
Alternatives and similar repositories for TrainingDynamics
Users that are interested in TrainingDynamics are comparing it to the libraries listed below
Sorting:
- Code for ACL2023 paper: Pre-Training to Learn in Context☆106Updated last year
 - [EMNLP 2023] C-STS: Conditional Semantic Textual Similarity☆73Updated last year
 - self-adaptive in-context learning☆45Updated 2 years ago
 - ☆107Updated 3 months ago
 - Released code for our ICLR23 paper.☆66Updated 2 years ago
 - ☆54Updated last year
 - Implementation of ICML 23 Paper: Specializing Smaller Language Models towards Multi-Step Reasoning.☆131Updated 2 years ago
 - Princeton NLP's pre-training library based on fairseq with DeepSpeed kernel integration 🚃☆114Updated 3 years ago
 - [ACL 2023] Solving Math Word Problems via Cooperative Reasoning induced Language Models (LLMs + MCTS + Self-Improvement)☆49Updated last year
 - Code for RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs. ACL 2023.☆64Updated 11 months ago
 - The official repository for the paper "From Zero to Hero: Examining the Power of Symbolic Tasks in Instruction Tuning".☆66Updated 2 years ago
 - Retrieval as Attention☆82Updated 2 years ago
 - Complexity Based Prompting for Multi-Step Reasoning☆17Updated 2 years ago
 - reStructured Pre-training☆98Updated 2 years ago
 - Code for ICML 25 paper "Metadata Conditioning Accelerates Language Model Pre-training (MeCo)"☆46Updated 4 months ago
 - [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆51Updated 4 months ago
 - Code for paper 'Data-Efficient FineTuning'☆28Updated 2 years ago
 - ACL'23: Unified Demonstration Retriever for In-Context Learning☆37Updated last year
 - [NAACL 2024] Making Language Models Better Tool Learners with Execution Feedback☆42Updated last year
 - A collection of instruction data and scripts for machine translation.☆20Updated 2 years ago
 - Methods and evaluation for aligning language models temporally☆30Updated last year
 - Lightweight tool to identify Data Contamination in LLMs evaluation☆52Updated last year
 - On Transferability of Prompt Tuning for Natural Language Processing☆100Updated last year
 - [EMNLP 2025] Verification Engineering for RL in Instruction Following☆40Updated 3 weeks ago
 - This repository is the official implementation of our EMNLP 2022 paper ELMER: A Non-Autoregressive Pre-trained Language Model for Efficie…☆26Updated 3 years ago
 - Do Large Language Models Know What They Don’t Know?☆100Updated 11 months ago
 - ☆12Updated last year
 - [EMNLP 2022] Training Language Models with Memory Augmentation https://arxiv.org/abs/2205.12674☆196Updated 2 years ago
 - ☆103Updated last year
 - [NLPCC 2022] Kformer: Knowledge Injection in Transformer Feed-Forward Layers☆38Updated 3 years ago