neubig / minllama-assignmentLinks
☆99Updated last year
Alternatives and similar repositories for minllama-assignment
Users that are interested in minllama-assignment are comparing it to the libraries listed below
Sorting:
- Advanced NLP, Spring 2025 https://cmu-l3.github.io/anlp-spring2025/☆66Updated 7 months ago
- An assignment for building an NLP system from scratch.☆27Updated last year
- ☆189Updated last year
- Notes and commented code for RLHF (PPO)☆114Updated last year
- ☆393Updated 10 months ago
- Direct Preference Optimization from scratch in PyTorch☆120Updated 7 months ago
- Notes on Direct Preference Optimization☆23Updated last year
- NeurIPS 2024 tutorial on LLM Inference☆47Updated 11 months ago
- Resources for cultural NLP research☆106Updated last month
- ☆71Updated 3 months ago
- ☆86Updated last year
- minimal GRPO implementation from scratch☆99Updated 8 months ago
- Distributed training (multi-node) of a Transformer model☆86Updated last year
- ☆154Updated last month
- ☆64Updated last year
- Project 2 (Building Large Language Models) for Stanford CS324: Understanding and Developing Large Language Models (Winter 2022)☆105Updated 2 years ago
- Minimalist BERT implementation assignment for CS11-711☆84Updated 3 years ago
- Collection of links, tutorials and best practices of how to collect the data and build end-to-end RLHF system to finetune Generative AI m…☆224Updated 2 years ago
- A brief and partial summary of RLHF algorithms.☆136Updated 8 months ago
- The official evaluation suite and dynamic data release for MixEval.☆252Updated last year
- The official implementation of the paper "What Matters in Transformers? Not All Attention is Needed".☆179Updated 7 months ago
- ☆76Updated last year
- Website☆56Updated 2 years ago
- Organize the Web: Constructing Domains Enhances Pre-Training Data Curation☆68Updated 6 months ago
- LLM-Merging: Building LLMs Efficiently through Merging☆205Updated last year
- ☆94Updated 5 months ago
- ☆139Updated last year
- ☆225Updated 3 weeks ago
- Prune transformer layers☆74Updated last year
- It is a comprehensive resource hub compiling all LLM papers accepted at the International Conference on Learning Representations (ICLR) i…☆66Updated last year