stanford-cs336 / spring2024-assignment1-basicsLinks

☆58

Alternatives and similar repositories for spring2024-assignment1-basics

Users that are interested in spring2024-assignment1-basics are comparing it to the libraries listed below

Sorting:

stanford-cs336 / spring2024-lectures
☆334Updated 7 months ago
neubig / minllama-assignment
☆90Updated 10 months ago
cmu-l3 / anlp-spring2025-code
Advanced NLP, Spring 2025 https://cmu-l3.github.io/anlp-spring2025/
☆61Updated 4 months ago
marin-community / marin
☆353Updated this week
hamishivi / EasyLM
Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…
☆75Updated 11 months ago
EleutherAI / delphi
Delphi was the home of a temple to Phoebus Apollo, which famously had the inscription, 'Know Thyself.' This library lets language models …
☆202Updated this week
srush / LLM-Talk
☆51Updated last year
llm-efficiency-challenge / neurips_llm_efficiency_challenge
NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day
☆256Updated last year
anthropics / toy-models-of-superposition
Notebooks accompanying Anthropic's "Toy Models of Superposition" paper
☆128Updated 2 years ago
srush / GPTWorld
A puzzle to learn about prompting
☆132Updated 2 years ago
HazyResearch / zoology
Understand and test language model architectures on synthetic tasks.
☆221Updated 3 weeks ago
callummcdougall / sae_vis
Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).
☆207Updated 7 months ago
ARBORproject / arborproject.github.io
☆81Updated 5 months ago
safety-research / safety-tooling
Inference API for many LLMs and other useful tools for empirical research
☆61Updated this week
METR / RE-Bench
☆95Updated 3 months ago
taufeeque9 / codebook-features
Sparse and discrete interpretability tool for neural networks
☆63Updated last year
callummcdougall / ARENA_2.0
Resources for skilling up in AI alignment research engineering. Covers basics of deep learning, mechanistic interpretability, and RL.
☆220Updated last year
srush / do-we-need-attention
☆166Updated 2 years ago
ServiceNow / PipelineRL
A scalable asynchronous reinforcement learning implementation with in-flight weight updates.
☆135Updated this week
normster / llm_rules
RuLES: a benchmark for evaluating rule-following in language models
☆228Updated 5 months ago
llm-merging / LLM-Merging
LLM-Merging: Building LLMs Efficiently through Merging
☆202Updated 10 months ago
goodfire-ai / r1-interpretability
Open source interpretability artefacts for R1.
☆157Updated 3 months ago
justinchiu / openlogprobs
Extract full next-token probabilities via language model APIs
☆247Updated last year
wolfecameron / nanoMoE
An extension of the nanoGPT repository for training small MOE models.
☆164Updated 4 months ago
Dakingrai / awesome-mechanistic-interpretability-lm-papers
☆180Updated 8 months ago
ckkissane / crosscoder-model-diff-replication
Open source replication of Anthropic's Crosscoders for Model Diffing
☆57Updated 9 months ago
srush / raspy
An interactive exploration of Transformer programming.
☆267Updated last year
da-fr / arc-prize-2024
Our solution for the arc challenge 2024
☆166Updated last month
neelnanda-io / 1L-Sparse-Autoencoder
☆124Updated last year
jonhue / activeft
PyTorch library for Active Fine-Tuning
☆87Updated 5 months ago