cmu-l3 / anlp-spring2025-code
Advanced NLP, Spring 2025 https://cmu-l3.github.io/anlp-spring2025/
☆29Updated this week
Alternatives and similar repositories for anlp-spring2025-code:
Users that are interested in anlp-spring2025-code are comparing it to the libraries listed below
- ☆81Updated 6 months ago
- Notes and commented code for RLHF (PPO)☆79Updated last year
- ☆73Updated 8 months ago
- Organize the Web: Constructing Domains Enhances Pre-Training Data Curation☆39Updated last month
- Distributed training (multi-node) of a Transformer model☆62Updated 11 months ago
- ☆60Updated 11 months ago
- ☆40Updated 10 months ago
- An assignment for building an NLP system from scratch.☆24Updated last year
- ☆48Updated last year
- A brief and partial summary of RLHF algorithms.☆127Updated 3 weeks ago
- ☆158Updated last month
- An extension of the nanoGPT repository for training small MOE models.☆109Updated 3 weeks ago
- ☆90Updated last week
- This repository is maintained to release dataset and models for multimodal puzzle reasoning.☆76Updated last month
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆55Updated 7 months ago
- ☆160Updated 3 weeks ago
- Implementation of 🥥 Coconut, Chain of Continuous Thought, in Pytorch☆161Updated 3 months ago
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆169Updated last week
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆72Updated 7 months ago
- The official evaluation suite and dynamic data release for MixEval.☆233Updated 4 months ago
- NeurIPS 2024 tutorial on LLM Inference☆39Updated 3 months ago
- A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models☆44Updated last month
- model activation visualiser☆90Updated this week
- ☆27Updated this week
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆140Updated this week
- minimal GRPO implementation from scratch☆65Updated 2 weeks ago
- 🚢 Data Toolkit for Sailor Language Models☆88Updated last month
- Official Implementation of "Reasoning Language Models: A Blueprint"☆54Updated last month
- Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"☆170Updated 3 weeks ago
- ☆52Updated 3 weeks ago