stanford-cs336 / spring2024-assignment1-basicsLinks
☆49Updated 11 months ago
Alternatives and similar repositories for spring2024-assignment1-basics
Users that are interested in spring2024-assignment1-basics are comparing it to the libraries listed below
Sorting:
- ☆89Updated 9 months ago
- ☆298Updated 6 months ago
- ☆181Updated 2 months ago
- ☆200Updated this week
- ☆25Updated 8 months ago
- A mechanistic approach for understanding and detecting factual errors of large language models.☆46Updated 11 months ago
- NeurIPS 2024 tutorial on LLM Inference☆45Updated 6 months ago
- PyTorch library for Active Fine-Tuning☆80Updated 4 months ago
- A scalable asynchronous reinforcement learning implementation with in-flight weight updates.☆125Updated this week
- Sparse and discrete interpretability tool for neural networks☆63Updated last year
- Open source replication of Anthropic's Crosscoders for Model Diffing☆55Updated 7 months ago
- Project 2 (Building Large Language Models) for Stanford CS324: Understanding and Developing Large Language Models (Winter 2022)☆105Updated 2 years ago
- Notebooks accompanying Anthropic's "Toy Models of Superposition" paper☆127Updated 2 years ago
- An extension of the nanoGPT repository for training small MOE models.☆152Updated 3 months ago
- ☆87Updated 2 months ago
- ☆97Updated 11 months ago
- Direct Preference Optimization from scratch in PyTorch☆98Updated 2 months ago
- Understand and test language model architectures on synthetic tasks.☆218Updated 2 weeks ago
- 🧠 Starter templates for doing interpretability research☆71Updated last year
- LOFT: A 1 Million+ Token Long-Context Benchmark☆202Updated last week
- Efficient Dictionary Learning with Switch Sparse Autoencoders (SAEs)☆23Updated 6 months ago
- ☆51Updated last year
- ☆163Updated 7 months ago
- Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'☆219Updated 6 months ago
- Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"☆109Updated last year
- ☆53Updated last year
- Notes on Direct Preference Optimization☆19Updated last year
- A MAD laboratory to improve AI architecture designs 🧪☆122Updated 6 months ago
- ☆26Updated 2 years ago
- LLM-Merging: Building LLMs Efficiently through Merging☆199Updated 9 months ago