neubig / minllama-assignmentLinks
☆95Updated 11 months ago
Alternatives and similar repositories for minllama-assignment
Users that are interested in minllama-assignment are comparing it to the libraries listed below
Sorting:
- Advanced NLP, Spring 2025 https://cmu-l3.github.io/anlp-spring2025/☆65Updated 5 months ago
- An assignment for building an NLP system from scratch.☆26Updated last year
- ☆184Updated last year
- ☆360Updated 8 months ago
- Direct Preference Optimization from scratch in PyTorch☆110Updated 5 months ago
- Notes and commented code for RLHF (PPO)☆107Updated last year
- ☆86Updated last year
- NeurIPS 2024 tutorial on LLM Inference☆47Updated 9 months ago
- A brief and partial summary of RLHF algorithms.☆132Updated 6 months ago
- minimal GRPO implementation from scratch☆97Updated 6 months ago
- Notes on Direct Preference Optimization☆21Updated last year
- Distributed training (multi-node) of a Transformer model☆83Updated last year
- ☆133Updated 5 months ago
- The official evaluation suite and dynamic data release for MixEval.☆245Updated 10 months ago
- ☆45Updated last month
- ☆52Updated last year
- ☆217Updated 7 months ago
- CS 224N Winter 2023 Default Final Project: Multitask BERT☆25Updated 2 years ago
- Website☆55Updated 2 years ago
- Organize the Web: Constructing Domains Enhances Pre-Training Data Curation☆64Updated 4 months ago
- The official implementation of the paper "What Matters in Transformers? Not All Attention is Needed".☆176Updated 5 months ago
- ☆60Updated last year
- ☆190Updated 9 months ago
- Resources for cultural NLP research☆103Updated 4 months ago
- Official Code Repository for LM-Steer Paper: "Word Embeddings Are Steers for Language Models" (ACL 2024 Outstanding Paper Award)☆125Updated 2 months ago
- a curated list of the role of small models in the LLM era☆104Updated 11 months ago
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆215Updated last month
- Project 2 (Building Large Language Models) for Stanford CS324: Understanding and Developing Large Language Models (Winter 2022)☆105Updated 2 years ago
- Critique-out-Loud Reward Models☆70Updated 10 months ago
- NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day☆256Updated last year