craffel / llm-seminar
Seminar on Large Language Models (COMP790-101 at UNC Chapel Hill, Fall 2022)
☆309Updated last year
Related projects: ⓘ
- Mistral: A strong, northwesterly wind: Framework for transparent and accessible large-scale language model training, built with Hugging F…☆555Updated 10 months ago
- Repository containing code for "How to Train BERT with an Academic Budget" paper☆309Updated last year
- Interpretability for sequence generation models 🐛 🔍☆361Updated 3 weeks ago
- MinT: Minimal Transformer Library and Tutorials☆247Updated 2 years ago
- Tools for understanding how transformer predictions are built layer-by-layer☆408Updated 3 months ago
- Package to compute Mauve, a similarity score between neural text and human text. Install with `pip install mauve-text`.☆270Updated 2 months ago
- Official repository for CMU Machine Learning Department's 10721: "Philosophical Foundations of Machine Intelligence".☆260Updated last year
- A library for finding knowledge neurons in pretrained transformer models.☆145Updated 2 years ago
- ☆246Updated 6 months ago
- Python library which enables complex compositions of language models such as scratchpads, chain of thought, tool use, selection-inference…☆190Updated 3 months ago
- Task-based datasets, preprocessing, and evaluation for sequence models.☆552Updated this week
- An interactive exploration of Transformer programming.☆243Updated 10 months ago
- PAIR.withgoogle.com and friend's work on interpretability methods☆138Updated last week
- The official code of LM-Debugger, an interactive tool for inspection and intervention in transformer-based language models.☆168Updated 2 years ago
- Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)☆456Updated last year
- Aligning AI With Shared Human Values (ICLR 2021)☆230Updated last year
- Repository for research in the field of Responsible NLP at Meta.☆180Updated last month
- ☆229Updated 2 months ago
- Erasing concepts from neural representations with provable guarantees☆202Updated 3 months ago
- Puzzles for exploring transformers☆293Updated last year
- Transformer Grammars: Augmenting Transformer Language Models with Syntactic Inductive Biases at Scale, TACL (2022)☆118Updated 3 weeks ago
- Code for T-Few from "Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning"☆424Updated last year
- git extension for {collaborative, communal, continual} model development☆202Updated 3 months ago
- ☆322Updated 5 months ago
- Few-shot Learning of GPT-3☆337Updated last year
- Organize your experiments into discrete steps that can be cached and reused throughout the lifetime of your research project.☆525Updated 3 months ago
- Robustness Gym is an evaluation toolkit for machine learning.☆439Updated 2 years ago
- Mechanistic Interpretability Visualizations using React☆175Updated 2 months ago
- Adversarial Natural Language Inference Benchmark☆388Updated 2 years ago
- NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations☆770Updated 4 months ago