stanford-cs336 / spring2024-assignment1-basics
☆28Updated 6 months ago
Alternatives and similar repositories for spring2024-assignment1-basics:
Users that are interested in spring2024-assignment1-basics are comparing it to the libraries listed below
- ☆200Updated 3 weeks ago
- ☆48Updated 11 months ago
- Textbook on reinforcement learning from human feedback☆112Updated this week
- Sparse and discrete interpretability tool for neural networks☆58Updated 11 months ago
- A puzzle to learn about prompting☆123Updated last year
- Extract full next-token probabilities via language model APIs☆230Updated 10 months ago
- Utilities for Training Very Large Models☆57Updated 3 months ago
- ☆51Updated last week
- A library to create and manage configuration files, especially for machine learning projects.☆76Updated 2 years ago
- PyTorch library for Active Fine-Tuning☆52Updated last week
- ☆136Updated 3 months ago
- Understand and test language model architectures on synthetic tasks.☆175Updated this week
- ☆76Updated 3 months ago
- Cold Compress is a hackable, lightweight, and open-source toolkit for creating and benchmarking cache compression methods built on top of…☆106Updated 5 months ago
- Simple and efficient pytorch-native transformer training and inference (batched)☆66Updated 9 months ago
- Can Language Models Solve Olympiad Programming?☆108Updated this week
- ☆135Updated this week
- PyTorch building blocks for OLMo☆47Updated this week
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆154Updated 2 months ago
- A set of Python scripts that makes your experience on TPU better☆44Updated 6 months ago
- Open source replication of Anthropic's Crosscoders for Model Diffing☆28Updated 2 months ago
- Our solution for the arc challenge 2024☆84Updated last month
- A mechanistic approach for understanding and detecting factual errors of large language models.☆39Updated 6 months ago
- ☆32Updated 11 months ago
- The simplest implementation of recent Sparse Attention patterns for efficient LLM inference.☆56Updated 2 months ago
- Discovering Data-driven Hypotheses in the Wild☆51Updated 2 months ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆90Updated 2 months ago
- CausalGym: Benchmarking causal interpretability methods on linguistic tasks☆40Updated last month
- Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*☆81Updated last year