craffel / comp790-deep-learning-spring-2022Links
Course repository for the Spring 2022 COMP790 course "Deep Learning" at UNC
☆19Updated 3 years ago
Alternatives and similar repositories for comp790-deep-learning-spring-2022
Users that are interested in comp790-deep-learning-spring-2022 are comparing it to the libraries listed below
Sorting:
- Course repository for the Spring 2023 COMP664 course "Deep Learning" at UNC☆15Updated 2 years ago
- ☆52Updated last year
- Google Research☆46Updated 2 years ago
- Embedding Recycling for Language models☆38Updated 2 years ago
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32Updated last year
- Interpretable and efficient predictors using pre-trained language models. Scikit-learn compatible.☆44Updated 7 months ago
- Minimum Description Length probing for neural network representations☆20Updated 8 months ago
- ☆44Updated 11 months ago
- Code Release for "Broken Neural Scaling Laws" (BNSL) paper☆59Updated last year
- Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"☆59Updated 2 years ago
- Utilities for Training Very Large Models☆58Updated last year
- Helper scripts and notes that were used while porting various nlp models☆48Updated 3 years ago
- Whispering Experts: Neural Interventions for Toxicity Mitigation in Language Models, ICML 2024☆22Updated last year
- A diff tool for language models☆44Updated last year
- Code for the paper "Code-Mixing on Sesame Street: Dawn of the Adversarial Polyglots" (NAACL-HLT 2021)☆10Updated 5 months ago
- ML/DL Math and Method notes☆64Updated last year
- Measuring if attention is explanation with ROAR☆22Updated 2 years ago
- A library to create and manage configuration files, especially for machine learning projects.☆79Updated 3 years ago
- 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.☆81Updated 3 years ago
- ☆20Updated last year
- An implementation of Transformer with Expire-Span, a circuit for learning which memories to retain☆34Updated 4 years ago
- Minimalist BERT implementation assignment for CS11-711☆83Updated 3 years ago
- Adding new tasks to T0 without catastrophic forgetting☆33Updated 2 years ago
- Using FlexAttention to compute attention with different masking patterns☆46Updated last year
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Updated 2 years ago
- Recycling diverse models☆45Updated 2 years ago
- ☆43Updated 3 years ago
- ☆22Updated 2 years ago
- ☆38Updated 2 years ago
- Transformers at any scale☆41Updated last year