neubig / minllama-assignmentLinks

☆90

Alternatives and similar repositories for minllama-assignment

Users that are interested in minllama-assignment are comparing it to the libraries listed below

Sorting:

cmu-l3 / anlp-spring2025-code
Advanced NLP, Spring 2025 https://cmu-l3.github.io/anlp-spring2025/
☆61Updated 4 months ago
neubig / anlp-code
☆181Updated last year
stanford-cs336 / spring2024-lectures
☆334Updated 7 months ago
neubig / nlp-from-scratch-assignment-spring2024
An assignment for building an NLP system from scratch.
☆26Updated last year
hkproj / rlhf-ppo
Notes and commented code for RLHF (PPO)
☆101Updated last year
0xallam / Direct-Preference-Optimization
Direct Preference Optimization from scratch in PyTorch
☆103Updated 4 months ago
stanford-cs336 / assignment5-alignment
☆33Updated 2 weeks ago
yihedeng9 / rlhf-summary-notes
A brief and partial summary of RLHF algorithms.
☆131Updated 5 months ago
hkproj / dpo-notes
Notes on Direct Preference Optimization
☆21Updated last year
fangyuan-ksgk / Tiny-GRPO
minimal GRPO implementation from scratch
☆94Updated 4 months ago
humza909 / LLM_Survey
☆86Updated last year
CASE-Lab-UMD / LLM-Drop
The official implementation of the paper "What Matters in Transformers? Not All Attention is Needed".
☆174Updated 4 months ago
hkproj / pytorch-transformer-distributed
Distributed training (multi-node) of a Transformer model
☆76Updated last year
DataArcTech / LLM-as-a-Judge
☆128Updated 4 months ago
wolfecameron / nanoMoE
An extension of the nanoGPT repository for training small MOE models.
☆164Updated 4 months ago
simran-khanuja / awesome-cultural-nlp
Resources for cultural NLP research
☆101Updated 3 months ago
huggingface / picotron_tutorial
☆206Updated 5 months ago
stanford-cs336 / spring2024-assignment1-basics
☆58Updated last year
CodeCreator / WebOrganizer
Organize the Web: Constructing Domains Enhances Pre-Training Data Curation
☆58Updated 3 months ago
neubig / minbert-assignment
Minimalist BERT implementation assignment for CS11-711
☆83Updated 2 years ago
llm-merging / LLM-Merging
LLM-Merging: Building LLMs Efficiently through Merging
☆202Updated 10 months ago
gpoesia / minbert-default-final-project
CS 224N Winter 2023 Default Final Project: Multitask BERT
☆25Updated 2 years ago
arpita8 / Awesome-Mixture-of-Experts-Papers
Survey: A collection of AWESOME papers and resources on the latest research in Mixture of Experts.
☆128Updated 11 months ago
allenai / OLMo-core
PyTorch building blocks for the OLMo ecosystem
☆269Updated this week
Dakingrai / awesome-mechanistic-interpretability-lm-papers
☆180Updated 8 months ago
cmu-l3 / neurips2024-inference-tutorial-code
NeurIPS 2024 tutorial on LLM Inference
☆45Updated 7 months ago
llm-efficiency-challenge / neurips_llm_efficiency_challenge
NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day
☆256Updated last year
Glaciohound / LM-Steer
Official Code Repository for LM-Steer Paper: "Word Embeddings Are Steers for Language Models" (ACL 2024 Outstanding Paper Award)
☆123Updated 3 weeks ago
stanford-cs324 / winter2022
Website
☆53Updated 2 years ago
JinjieNi / MixEval
The official evaluation suite and dynamic data release for MixEval.
☆242Updated 8 months ago