stanford-cs336 / spring2024-assignment1-basicsLinks
☆57Updated last year
Alternatives and similar repositories for spring2024-assignment1-basics
Users that are interested in spring2024-assignment1-basics are comparing it to the libraries listed below
Sorting:
- ☆89Updated 9 months ago
- Project 2 (Building Large Language Models) for Stanford CS324: Understanding and Developing Large Language Models (Winter 2022)☆105Updated 2 years ago
- ☆51Updated last year
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆75Updated 11 months ago
- ☆266Updated this week
- Sparse and discrete interpretability tool for neural networks☆63Updated last year
- A mechanistic approach for understanding and detecting factual errors of large language models.☆46Updated last year
- NeurIPS 2024 tutorial on LLM Inference☆45Updated 7 months ago
- ☆321Updated 6 months ago
- A puzzle to learn about prompting☆131Updated 2 years ago
- Understanding how features learned by neural networks evolve throughout training☆36Updated 8 months ago
- ☆32Updated last month
- Simple and efficient pytorch-native transformer training and inference (batched)☆77Updated last year
- Notebooks accompanying Anthropic's "Toy Models of Superposition" paper☆127Updated 2 years ago
- ML/DL Math and Method notes☆61Updated last year
- Open source replication of Anthropic's Crosscoders for Model Diffing☆57Updated 8 months ago
- An interactive exploration of Transformer programming.☆265Updated last year
- Website☆53Updated 2 years ago
- ☆37Updated last year
- Code for the ACL 2023 paper: "Rethinking the Role of Scale for In-Context Learning: An Interpretability-based Case Study at 66 Billion Sc…☆31Updated last year
- PyTorch library for Active Fine-Tuning☆87Updated 5 months ago
- NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day☆256Updated last year
- Code for our paper "Decomposing The Dark Matter of Sparse Autoencoders"☆22Updated 5 months ago
- Delphi was the home of a temple to Phoebus Apollo, which famously had the inscription, 'Know Thyself.' This library lets language models …☆193Updated this week
- Easily run PyTorch on multiple GPUs & machines☆46Updated 3 weeks ago
- An extension of the nanoGPT repository for training small MOE models.☆162Updated 4 months ago
- Compiling useful links, papers, benchmarks, ideas, etc.☆45Updated 4 months ago
- LLM-Merging: Building LLMs Efficiently through Merging☆201Updated 9 months ago
- we got you bro☆35Updated 11 months ago
- Arrakis is a library to conduct, track and visualize mechanistic interpretability experiments.☆31Updated 2 months ago