beyondguo / TrainingDynamicsLinks
Compute training dynamics, plot data cartography, analysing data quality...
☆41Updated 2 years ago
Alternatives and similar repositories for TrainingDynamics
Users that are interested in TrainingDynamics are comparing it to the libraries listed below
Sorting:
- Released code for our ICLR23 paper.☆65Updated 2 years ago
- Code for paper 'Data-Efficient FineTuning'☆28Updated 2 years ago
- self-adaptive in-context learning☆45Updated 2 years ago
- [EMNLP 2023] C-STS: Conditional Semantic Textual Similarity☆73Updated last year
- Code for "Small Models are Valuable Plug-ins for Large Language Models"☆131Updated 2 years ago
- Code for ACL2023 paper: Pre-Training to Learn in Context☆107Updated last year
- the instructions and demonstrations for building a formal logical reasoning capable GLM☆54Updated last year
- Princeton NLP's pre-training library based on fairseq with DeepSpeed kernel integration 🚃☆114Updated 2 years ago
- On Transferability of Prompt Tuning for Natural Language Processing☆99Updated last year
- ☆104Updated last month
- reStructured Pre-training☆98Updated 2 years ago
- ☆59Updated 2 years ago
- This respository contains the code for extracting the test samples we used in our paper: "A Multitask, Multilingual, Multimodal Evaluatio…☆79Updated last year
- Lightweight tool to identify Data Contamination in LLMs evaluation☆51Updated last year
- [NeurIPS 2022] "A Win-win Deal: Towards Sparse and Robust Pre-trained Language Models", Yuanxin Liu, Fandong Meng, Zheng Lin, Jiangnan Li…☆21Updated last year
- A collection of instruction data and scripts for machine translation.☆20Updated last year
- ☆35Updated last year
- [ICLR 2024]EMO: Earth Mover Distance Optimization for Auto-Regressive Language Modeling(https://arxiv.org/abs/2310.04691)☆124Updated last year
- Source code for ACL 2023 paper Decoder Tuning: Efficient Language Understanding as Decoding☆51Updated 2 years ago
- [NeurIPS'24] Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models☆62Updated 8 months ago
- [NLPCC 2022] Kformer: Knowledge Injection in Transformer Feed-Forward Layers☆38Updated 2 years ago
- Code for RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs. ACL 2023.☆64Updated 9 months ago
- Code for ACL paper "Zero-Shot Text Classification via Self-Supervised Tuning"☆27Updated last year
- Code for embedding and retrieval research.☆17Updated last year
- [ICML'2024] Can AI Assistants Know What They Don't Know?☆83Updated last year
- Feeling confused about super alignment? Here is a reading list☆43Updated last year
- Implementation of ICML 23 Paper: Specializing Smaller Language Models towards Multi-Step Reasoning.☆132Updated 2 years ago
- ☆53Updated 3 years ago
- ⚡Research papers about leveraging the capabilities of language models⚡☆52Updated 2 years ago
- Grade-School Math with Irrelevant Context (GSM-IC) benchmark is an arithmetic reasoning dataset built upon GSM8K, by adding irrelevant se…☆60Updated 2 years ago