neubig / minllama-assignmentLinks
☆96Updated last year
Alternatives and similar repositories for minllama-assignment
Users that are interested in minllama-assignment are comparing it to the libraries listed below
Sorting:
- Advanced NLP, Spring 2025 https://cmu-l3.github.io/anlp-spring2025/☆66Updated 6 months ago
- An assignment for building an NLP system from scratch.☆26Updated last year
- ☆186Updated last year
- ☆373Updated 9 months ago
- Notes and commented code for RLHF (PPO)☆110Updated last year
- Direct Preference Optimization from scratch in PyTorch☆113Updated 6 months ago
- NeurIPS 2024 tutorial on LLM Inference☆47Updated 9 months ago
- Notes on Direct Preference Optimization☆23Updated last year
- A brief and partial summary of RLHF algorithms.☆132Updated 7 months ago
- ☆143Updated 6 months ago
- Organize the Web: Constructing Domains Enhances Pre-Training Data Curation☆66Updated 5 months ago
- The official evaluation suite and dynamic data release for MixEval.☆249Updated 10 months ago
- ☆86Updated last year
- ☆76Updated last year
- minimal GRPO implementation from scratch☆98Updated 6 months ago
- ☆53Updated 2 months ago
- Resources for cultural NLP research☆103Updated last week
- LLM-Merging: Building LLMs Efficiently through Merging☆203Updated last year
- The Paper List on Data Contamination for Large Language Models Evaluation.☆100Updated last month
- Distributed training (multi-node) of a Transformer model☆83Updated last year
- Code and Configs for Asynchronous RLHF: Faster and More Efficient RL for Language Models☆63Updated 5 months ago
- Reproducible, flexible LLM evaluations☆251Updated 2 months ago
- ☆192Updated 5 months ago
- Critique-out-Loud Reward Models☆70Updated 11 months ago
- The official implementation of the paper "What Matters in Transformers? Not All Attention is Needed".☆177Updated 6 months ago
- NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day☆256Updated last year
- ☆222Updated this week
- Code for STaR: Bootstrapping Reasoning With Reasoning (NeurIPS 2022)☆211Updated 2 years ago
- ☆82Updated 3 months ago
- A simplified implementation for experimenting with RLVR on GSM8K, This repository provides a starting point for exploring reasoning.☆129Updated 8 months ago