haizelabs / nyc-ai-readingLinks
nyc is so back
☆19Updated 6 months ago
Alternatives and similar repositories for nyc-ai-reading
Users that are interested in nyc-ai-reading are comparing it to the libraries listed below
Sorting:
- Extract full next-token probabilities via language model APIs☆248Updated last year
- A domain-specific probabilistic programming language for modeling and inference with language models☆140Updated 8 months ago
- ☆126Updated 2 months ago
- METR Task Standard☆169Updated 10 months ago
- Mechanistic Interpretability Visualizations using React☆303Updated last year
- A reading list of relevant papers and projects on foundation model annotation☆28Updated 10 months ago
- ☆283Updated last year
- ☆75Updated last week
- ☆144Updated 5 months ago
- ☆318Updated last year
- A puzzle to learn about prompting☆135Updated 2 years ago
- ☆116Updated 3 weeks ago
- Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).☆233Updated last year
- ☆132Updated 2 years ago
- Python library which enables complex compositions of language models such as scratchpads, chain of thought, tool use, selection-inference…☆216Updated 6 months ago
- Draw more samples☆198Updated last year
- ☆185Updated last year
- Redwood Research's transformer interpretability tools☆14Updated 3 years ago
- Resources for skilling up in AI alignment research engineering. Covers basics of deep learning, mechanistic interpretability, and RL.☆236Updated 4 months ago
- ☆286Updated last year
- [NeurIPS 2023] Learning Transformer Programs☆162Updated last year
- A toolkit for describing model features and intervening on those features to steer behavior.☆224Updated 2 weeks ago
- Keeping language models honest by directly eliciting knowledge encoded in their activations.☆216Updated last week
- Neural theorem proving tutorial, version II☆40Updated last year
- 🧱 Modula software package☆316Updated 4 months ago
- ☆59Updated 3 months ago
- Stochastic Parameter Decomposition☆59Updated this week
- Tools for studying developmental interpretability in neural networks.☆117Updated 6 months ago
- ☆112Updated 10 months ago
- Resources from the EleutherAI Math Reading Group☆54Updated 10 months ago