stanford-cs336 / spring2024-assignment1-basicsLinks
☆58Updated last year
Alternatives and similar repositories for spring2024-assignment1-basics
Users that are interested in spring2024-assignment1-basics are comparing it to the libraries listed below
Sorting:
- ☆334Updated 7 months ago
- ☆90Updated 10 months ago
- Advanced NLP, Spring 2025 https://cmu-l3.github.io/anlp-spring2025/☆61Updated 4 months ago
- ☆353Updated this week
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆75Updated 11 months ago
- Delphi was the home of a temple to Phoebus Apollo, which famously had the inscription, 'Know Thyself.' This library lets language models …☆202Updated this week
- ☆51Updated last year
- NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day☆256Updated last year
- Notebooks accompanying Anthropic's "Toy Models of Superposition" paper☆128Updated 2 years ago
- A puzzle to learn about prompting☆132Updated 2 years ago
- Understand and test language model architectures on synthetic tasks.☆221Updated 3 weeks ago
- Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).☆207Updated 7 months ago
- ☆81Updated 5 months ago
- Inference API for many LLMs and other useful tools for empirical research☆61Updated this week
- ☆95Updated 3 months ago
- Sparse and discrete interpretability tool for neural networks☆63Updated last year
- Resources for skilling up in AI alignment research engineering. Covers basics of deep learning, mechanistic interpretability, and RL.☆220Updated last year
- ☆166Updated 2 years ago
- A scalable asynchronous reinforcement learning implementation with in-flight weight updates.☆135Updated this week
- RuLES: a benchmark for evaluating rule-following in language models☆228Updated 5 months ago
- LLM-Merging: Building LLMs Efficiently through Merging☆202Updated 10 months ago
- Open source interpretability artefacts for R1.☆157Updated 3 months ago
- Extract full next-token probabilities via language model APIs☆247Updated last year
- An extension of the nanoGPT repository for training small MOE models.☆164Updated 4 months ago
- ☆180Updated 8 months ago
- Open source replication of Anthropic's Crosscoders for Model Diffing☆57Updated 9 months ago
- An interactive exploration of Transformer programming.☆267Updated last year
- Our solution for the arc challenge 2024☆166Updated last month
- ☆124Updated last year
- PyTorch library for Active Fine-Tuning☆87Updated 5 months ago