ltgoslo / elc-bertLinks
☆21Updated 9 months ago
Alternatives and similar repositories for elc-bert
Users that are interested in elc-bert are comparing it to the libraries listed below
Sorting:
- Minimum Bayes Risk Decoding for Hugging Face Transformers☆60Updated last year
- Transformer Grammars: Augmenting Transformer Language Models with Syntactic Inductive Biases at Scale, TACL (2022)☆134Updated last week
- Repository for the code of the "PPL-MCTS: Constrained Textual Generation Through Discriminator-Guided Decoding" paper, NAACL'22☆66Updated 3 years ago
- ☆53Updated last year
- Simple-to-use scoring function for arbitrarily tokenized texts.☆47Updated 11 months ago
- Experiments for efforts to train a new and improved t5☆76Updated last year
- Official Repository of Pretraining Without Attention (BiGS), BiGS is the first model to achieve BERT-level transfer learning on the GLUE …☆116Updated last year
- An unofficial implementation of the Infini-gram model proposed by Liu et al. (2024)☆33Updated last year
- Extract full next-token probabilities via language model APIs☆248Updated last year
- Code for Zero-Shot Tokenizer Transfer☆142Updated last year
- ☆167Updated 2 years ago
- A domain-specific probabilistic programming language for modeling and inference with language models☆141Updated 9 months ago
- ☆53Updated 2 years ago
- The evaluation pipeline for the 2024 BabyLM Challenge.☆33Updated last year
- ☆38Updated last year
- Code repository for the c-BTM paper☆108Updated 2 years ago
- Understand and test language model architectures on synthetic tasks.☆252Updated last month
- Erasing concepts from neural representations with provable guarantees☆243Updated last year
- Repository for "I am a Strange Dataset: Metalinguistic Tests for Language Models"☆45Updated 2 years ago
- Probabilistic programming with large language models☆160Updated 2 months ago
- Universal Neurons in GPT2 Language Models☆30Updated last year
- Rust library for indexing and quickly searching large pretraining corpora☆30Updated 3 months ago
- Official implementation of "GPT or BERT: why not both?"☆61Updated 6 months ago
- A fast implementation of T5/UL2 in PyTorch using Flash Attention☆113Updated 3 months ago
- Evaluation pipeline for the BabyLM Challenge 2023.☆77Updated 2 years ago
- ☆67Updated 3 years ago
- ☆53Updated 2 years ago
- Codes and files for the paper Are Emergent Abilities in Large Language Models just In-Context Learning☆33Updated last year
- ☆78Updated 3 years ago
- Code for exploring Based models from "Simple linear attention language models balance the recall-throughput tradeoff"☆248Updated 8 months ago