craffel / llm-seminarLinks

Seminar on Large Language Models (COMP790-101 at UNC Chapel Hill, Fall 2022)

☆311

Alternatives and similar repositories for llm-seminar

Users that are interested in llm-seminar are comparing it to the libraries listed below

Sorting:

stanford-crfm / mistral
Mistral: A strong, northwesterly wind: Framework for transparent and accessible large-scale language model training, built with Hugging F…
☆575Updated last year
google-research / cascades
Python library which enables complex compositions of language models such as scratchpads, chain of thought, tool use, selection-inference…
☆208Updated 2 months ago
mega002 / lm-debugger
The official code of LM-Debugger, an interactive tool for inspection and intervention in transformer-based language models.
☆178Updated 3 years ago
krishnap25 / mauve
Package to compute Mauve, a similarity score between neural text and human text. Install with `pip install mauve-text`.
☆294Updated last year
collin-burns / discovering_latent_knowledge
☆274Updated last year
inseq-team / inseq
Interpretability for sequence generation models 🐛 🔍
☆432Updated 3 months ago
IntelLabs / academic-budget-bert
Repository containing code for "How to Train BERT with an Academic Budget" paper
☆314Updated last year
nostalgebraist / transformer-utils
Utilities for the HuggingFace transformers library
☆70Updated 2 years ago
EleutherAI / project-menu
See the issue board for the current status of active and prospective projects!
☆65Updated 3 years ago
acmi-lab / cmu-10721-philosophy-machine-intelligence
Official repository for CMU Machine Learning Department's 10721: "Philosophical Foundations of Machine Intelligence".
☆262Updated 2 years ago
EleutherAI / concept-erasure
Erasing concepts from neural representations with provable guarantees
☆231Updated 6 months ago
inverse-scaling / prize
A prize for finding tasks that cause large language models to show inverse scaling
☆613Updated last year
PAIR-code / interpretability
PAIR.withgoogle.com and friend's work on interpretability methods
☆195Updated 3 weeks ago
google / seqio
Task-based datasets, preprocessing, and evaluation for sequence models.
☆583Updated last week
bigscience-workshop / t-zero
Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)
☆462Updated 2 years ago
srush / raspy
An interactive exploration of Transformer programming.
☆267Updated last year
google-deepmind / transformer_grammars
Transformer Grammars: Augmenting Transformer Language Models with Syntactic Inductive Biases at Scale, TACL (2022)
☆127Updated last month
fdalvi / NeuroX
A Python library that encapsulates various methods for neuron interpretation and analysis in Deep NLP models.
☆102Updated last year
AlignmentResearch / tuned-lens
Tools for understanding how transformer predictions are built layer-by-layer
☆512Updated last year
justinchiu / openlogprobs
Extract full next-token probabilities via language model APIs
☆247Updated last year
shayne-longpre / a-pretrainers-guide
☆72Updated 2 years ago
anthropics / evals
☆287Updated last year
r-three / git-theta
git extension for {collaborative, communal, continual} model development
☆217Updated 8 months ago
facebookresearch / ResponsibleNLP
Repository for research in the field of Responsible NLP at Meta.
☆202Updated 2 months ago
TomFrederik / unseal
Mechanistic Interpretability for Transformer Models
☆51Updated 3 years ago
hendrycks / ethics
Aligning AI With Shared Human Values (ICLR 2021)
☆290Updated 2 years ago
TransformerLensOrg / CircuitsVis
Mechanistic Interpretability Visualizations using React
☆272Updated 7 months ago
huggingface / olm-datasets
Pipeline for pulling and processing online language model pretraining data from the web
☆177Updated 2 years ago
tech-srl / RASP
An interpreter for RASP as described in the ICML 2021 paper "Thinking Like Transformers"
☆317Updated 10 months ago
zphang / minimal-opt
☆67Updated 2 years ago