amazon-science / SetLexSem-ChallengeLinks
☆18Updated last year
Alternatives and similar repositories for SetLexSem-Challenge
Users that are interested in SetLexSem-Challenge are comparing it to the libraries listed below
Sorting:
- ☆15Updated last year
- Minimum Description Length probing for neural network representations☆20Updated last year
- Implementations of growing and pruning in neural networks☆22Updated 2 years ago
- ☆20Updated 3 months ago
- Understanding how features learned by neural networks evolve throughout training☆41Updated last year
- Repository for the code and dataset for the paper: "Have LLMs Advanced enough? Towards Harder Problem Solving Benchmarks For Large Langu…☆39Updated 2 years ago
- Embedding Recycling for Language models☆38Updated 2 years ago
- Fluent dreaming for language models☆13Updated last year
- The Codebase for Causal Distillation for Language Models (NAACL '22)☆26Updated 3 years ago
- Understanding the correlation between different LLM benchmarks☆29Updated 2 years ago
- Revisiting Hierarchical Text Classification : Inference and Metrics☆16Updated last year
- Code Release for "Broken Neural Scaling Laws" (BNSL) paper☆59Updated 2 years ago
- An attempt to merge ESBN with Transformers, to endow Transformers with the ability to emergently bind symbols☆16Updated 4 years ago
- Updated code base for GlanceNets: Interpretable, Leak-proof Concept-based models☆25Updated 2 years ago
- Source-to-Source Debuggable Derivatives in Pure Python☆15Updated 2 years ago
- Utilities for Training Very Large Models☆58Updated last year
- A pure-Python Beaker client☆17Updated 3 months ago
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32Updated last year
- Interpretable and efficient predictors using pre-trained language models. Scikit-learn compatible.☆44Updated 2 months ago
- Code for our EMNLP '22 paper "Fixing Model Bugs with Natural Language Patches"☆19Updated 3 years ago
- ☆10Updated 2 years ago
- Hyperparameter tuning via uncertainty modeling☆49Updated last year
- Code for our ACL '23 paper titled "Grokking of Hierarchical Structure in Vanilla Transformers"☆24Updated 2 years ago
- This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"☆27Updated 2 years ago
- Sparse and discrete interpretability tool for neural networks☆64Updated last year
- Google Research☆46Updated 3 years ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Updated 2 years ago
- Official repository for the paper "Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks"☆61Updated 3 years ago
- Few-shot Learning with Auxiliary Data☆31Updated 2 years ago
- ☆10Updated last year