neubig / minllama-assignmentLinks
☆87Updated 8 months ago
Alternatives and similar repositories for minllama-assignment
Users that are interested in minllama-assignment are comparing it to the libraries listed below
Sorting:
- Advanced NLP, Spring 2025 https://cmu-l3.github.io/anlp-spring2025/☆53Updated 2 months ago
- An assignment for building an NLP system from scratch.☆26Updated last year
- Notes and commented code for RLHF (PPO)☆94Updated last year
- Student version of Assignment 1 for Stanford CS336 - Language Modeling From Scratch☆115Updated last month
- NeurIPS 2024 tutorial on LLM Inference☆45Updated 5 months ago
- Minimalist BERT implementation assignment for CS11-711☆83Updated 2 years ago
- ☆177Updated last year
- Direct Preference Optimization from scratch in PyTorch☆92Updated last month
- ☆286Updated 5 months ago
- Notes on Direct Preference Optimization☆19Updated last year
- A brief and partial summary of RLHF algorithms.☆128Updated 3 months ago
- Simple and efficient pytorch-native transformer training and inference (batched)☆75Updated last year
- Organize the Web: Constructing Domains Enhances Pre-Training Data Curation☆50Updated last month
- CS 224N Winter 2023 Default Final Project: Multitask BERT☆25Updated 2 years ago
- Code and Configs for Asynchronous RLHF: Faster and More Efficient RL for Language Models☆54Updated last month
- ☆253Updated last week
- ☆87Updated last year
- A curated list of awesome resources dedicated to Scaling Laws for LLMs☆72Updated 2 years ago
- ☆45Updated 10 months ago
- ☆131Updated 6 months ago
- ☆97Updated 11 months ago
- Repo accompanying our paper "Do Llamas Work in English? On the Latent Language of Multilingual Transformers".☆76Updated last year
- Code and written solutions of the assignments of the Stanford CS224N: Natural Language Processing with Deep Learning course from winter 2…☆247Updated last year
- The official evaluation suite and dynamic data release for MixEval.☆242Updated 6 months ago
- minimal GRPO implementation from scratch☆90Updated 2 months ago
- Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"☆107Updated last year
- [NeurIPS'24] Official code for *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*☆106Updated 5 months ago
- Website☆53Updated 2 years ago
- Distributed training (multi-node) of a Transformer model☆68Updated last year
- ☆174Updated last month