craffel / llm-seminar
Seminar on Large Language Models (COMP790-101 at UNC Chapel Hill, Fall 2022)
☆308Updated last year
Related projects ⓘ
Alternatives and complementary repositories for llm-seminar
- Tools for understanding how transformer predictions are built layer-by-layer☆430Updated 5 months ago
- An interactive exploration of Transformer programming.☆246Updated last year
- Python library which enables complex compositions of language models such as scratchpads, chain of thought, tool use, selection-inference…☆196Updated 5 months ago
- The official code of LM-Debugger, an interactive tool for inspection and intervention in transformer-based language models.☆172Updated 2 years ago
- ☆253Updated 8 months ago
- PAIR.withgoogle.com and friend's work on interpretability methods☆149Updated 3 weeks ago
- Mistral: A strong, northwesterly wind: Framework for transparent and accessible large-scale language model training, built with Hugging F…☆563Updated last year
- git extension for {collaborative, communal, continual} model development☆205Updated this week
- Resources from the EleutherAI Math Reading Group☆51Updated last month
- Erasing concepts from neural representations with provable guarantees☆209Updated last week
- Repository containing code for "How to Train BERT with an Academic Budget" paper☆309Updated last year
- Puzzles for exploring transformers☆325Updated last year
- See the issue board for the current status of active and prospective projects!☆65Updated 2 years ago
- MinT: Minimal Transformer Library and Tutorials☆248Updated 2 years ago
- Extract full next-token probabilities via language model APIs☆229Updated 8 months ago
- Official repository for CMU Machine Learning Department's 10721: "Philosophical Foundations of Machine Intelligence".☆260Updated last year
- Transformer Grammars: Augmenting Transformer Language Models with Syntactic Inductive Biases at Scale, TACL (2022)☆119Updated 3 weeks ago
- ☆161Updated last year
- Interpretability for sequence generation models 🐛 🔍☆377Updated last week
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆516Updated this week
- Repository for research in the field of Responsible NLP at Meta.☆186Updated this week
- NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day☆252Updated last year
- A puzzle to learn about prompting☆121Updated last year
- Mechanistic Interpretability Visualizations using React☆198Updated 4 months ago
- Implementation of https://srush.github.io/annotated-s4☆469Updated last year
- Scaling Data-Constrained Language Models☆321Updated last month
- A prize for finding tasks that cause large language models to show inverse scaling☆597Updated last year
- ☆73Updated last year
- ☆239Updated 4 months ago
- Annotations of the interesting ML papers I read☆214Updated last week