neubig / minllama-assignmentLinks
☆92Updated 11 months ago
Alternatives and similar repositories for minllama-assignment
Users that are interested in minllama-assignment are comparing it to the libraries listed below
Sorting:
- An assignment for building an NLP system from scratch.☆26Updated last year
- Advanced NLP, Spring 2025 https://cmu-l3.github.io/anlp-spring2025/☆64Updated 5 months ago
- ☆182Updated last year
- Notes and commented code for RLHF (PPO)☆104Updated last year
- ☆349Updated 8 months ago
- Direct Preference Optimization from scratch in PyTorch☆107Updated 4 months ago
- ☆132Updated 5 months ago
- NeurIPS 2024 tutorial on LLM Inference☆47Updated 8 months ago
- ☆38Updated last month
- A brief and partial summary of RLHF algorithms.☆132Updated 5 months ago
- Code for STaR: Bootstrapping Reasoning With Reasoning (NeurIPS 2022)☆209Updated 2 years ago
- Organize the Web: Constructing Domains Enhances Pre-Training Data Curation☆62Updated 3 months ago
- Reproducible, flexible LLM evaluations☆238Updated last month
- Minimalist BERT implementation assignment for CS11-711☆83Updated 2 years ago
- ☆86Updated last year
- Official Code Repository for LM-Steer Paper: "Word Embeddings Are Steers for Language Models" (ACL 2024 Outstanding Paper Award)☆123Updated last month
- A simplified implementation for experimenting with RLVR on GSM8K, This repository provides a starting point for exploring reasoning.☆121Updated 6 months ago
- ☆150Updated 9 months ago
- minimal GRPO implementation from scratch☆96Updated 5 months ago
- Notes on Direct Preference Optimization☆21Updated last year
- The official evaluation suite and dynamic data release for MixEval.☆244Updated 9 months ago
- ☆135Updated 9 months ago
- Code and Configs for Asynchronous RLHF: Faster and More Efficient RL for Language Models☆61Updated 4 months ago
- Distributed training (multi-node) of a Transformer model☆79Updated last year
- ☆214Updated 6 months ago
- ☆76Updated last year
- The Paper List on Data Contamination for Large Language Models Evaluation.☆99Updated last week
- Critique-out-Loud Reward Models☆70Updated 10 months ago
- ☆100Updated last year
- Website☆54Updated 2 years ago