neubig / minllama-assignmentLinks
☆88Updated 9 months ago
Alternatives and similar repositories for minllama-assignment
Users that are interested in minllama-assignment are comparing it to the libraries listed below
Sorting:
- Advanced NLP, Spring 2025 https://cmu-l3.github.io/anlp-spring2025/☆58Updated 3 months ago
- An assignment for building an NLP system from scratch.☆26Updated last year
- ☆181Updated last year
- ☆316Updated 6 months ago
- Notes and commented code for RLHF (PPO)☆97Updated last year
- Notes on Direct Preference Optimization☆19Updated last year
- Direct Preference Optimization from scratch in PyTorch☆101Updated 3 months ago
- Organize the Web: Constructing Domains Enhances Pre-Training Data Curation☆56Updated 2 months ago
- The official evaluation suite and dynamic data release for MixEval.☆242Updated 8 months ago
- ☆122Updated 3 months ago
- NeurIPS 2024 tutorial on LLM Inference☆45Updated 7 months ago
- Website☆53Updated 2 years ago
- A brief and partial summary of RLHF algorithms.☆131Updated 4 months ago
- The official implementation of the paper "What Matters in Transformers? Not All Attention is Needed".☆174Updated 3 months ago
- Reproducible, flexible LLM evaluations☆219Updated this week
- minimal GRPO implementation from scratch☆92Updated 4 months ago
- ☆86Updated last year
- ☆135Updated 8 months ago
- Code for STaR: Bootstrapping Reasoning With Reasoning (NeurIPS 2022)☆206Updated 2 years ago
- Critique-out-Loud Reward Models☆67Updated 8 months ago
- LLM-Merging: Building LLMs Efficiently through Merging☆201Updated 9 months ago
- PyTorch building blocks for the OLMo ecosystem☆258Updated this week
- Evaluating LLMs with fewer examples☆160Updated last year
- Distributed training (multi-node) of a Transformer model☆72Updated last year
- ☆144Updated 7 months ago
- ☆53Updated last year
- "Improving Mathematical Reasoning with Process Supervision" by OPENAI☆110Updated 3 weeks ago
- ☆124Updated 9 months ago
- Student version of Assignment 1 for Stanford CS336 - Language Modeling From Scratch☆348Updated 3 months ago
- ☆198Updated 5 months ago