neubig / minllama-assignmentLinks
☆100Updated last year
Alternatives and similar repositories for minllama-assignment
Users that are interested in minllama-assignment are comparing it to the libraries listed below
Sorting:
- Advanced NLP, Spring 2025 https://cmu-l3.github.io/anlp-spring2025/☆70Updated 9 months ago
- An assignment for building an NLP system from scratch.☆27Updated last year
- ☆189Updated 2 years ago
- ☆405Updated last year
- Notes and commented code for RLHF (PPO)☆121Updated last year
- NeurIPS 2024 tutorial on LLM Inference☆47Updated last year
- ☆166Updated 3 months ago
- CS 224N Winter 2023 Default Final Project: Multitask BERT☆25Updated 2 years ago
- Direct Preference Optimization from scratch in PyTorch☆123Updated 9 months ago
- ☆85Updated 2 years ago
- Resources for cultural NLP research☆113Updated 3 months ago
- Notes on Direct Preference Optimization☆23Updated last year
- Distributed training (multi-node) of a Transformer model☆91Updated last year
- Minimalist BERT implementation assignment for CS11-711☆84Updated 3 years ago
- ☆82Updated last year
- A brief and partial summary of RLHF algorithms.☆142Updated 10 months ago
- minimal GRPO implementation from scratch☆102Updated 10 months ago
- Organize the Web: Constructing Domains Enhances Pre-Training Data Curation☆73Updated 8 months ago
- ☆71Updated last year
- Official Code Repository for LM-Steer Paper: "Word Embeddings Are Steers for Language Models" (ACL 2024 Outstanding Paper Award)☆134Updated 6 months ago
- The official evaluation suite and dynamic data release for MixEval.☆253Updated last year
- LLM-Merging: Building LLMs Efficiently through Merging☆208Updated last year
- ☆33Updated 7 months ago
- "Improving Mathematical Reasoning with Process Supervision" by OPENAI☆114Updated this week
- ☆37Updated 11 months ago
- ☆139Updated last year
- ☆107Updated 7 months ago
- ☆94Updated 5 months ago
- Code and written solutions of the assignments of the Stanford CS224N: Natural Language Processing with Deep Learning course from winter 2…☆269Updated last year
- Solutions for CS224n (2022)☆72Updated last year