ltgoslo / elc-bertLinks
☆20Updated 6 months ago
Alternatives and similar repositories for elc-bert
Users that are interested in elc-bert are comparing it to the libraries listed below
Sorting:
- Transformer Grammars: Augmenting Transformer Language Models with Syntactic Inductive Biases at Scale, TACL (2022)☆130Updated 4 months ago
- Minimum Bayes Risk Decoding for Hugging Face Transformers☆60Updated last year
- Extract full next-token probabilities via language model APIs☆247Updated last year
- Simple-to-use scoring function for arbitrarily tokenized texts.☆46Updated 7 months ago
- Code for Zero-Shot Tokenizer Transfer☆138Updated 9 months ago
- Rust library for indexing and quickly searching large pretraining corpora☆29Updated this week
- Erasing concepts from neural representations with provable guarantees☆236Updated 8 months ago
- Experiments for efforts to train a new and improved t5☆75Updated last year
- ☆53Updated last year
- Official Repository of Pretraining Without Attention (BiGS), BiGS is the first model to achieve BERT-level transfer learning on the GLUE …☆114Updated last year
- ☆65Updated 2 years ago
- Utilities for the HuggingFace transformers library☆72Updated 2 years ago
- Evaluation pipeline for the BabyLM Challenge 2023.☆76Updated 2 years ago
- Repository for the code of the "PPL-MCTS: Constrained Textual Generation Through Discriminator-Guided Decoding" paper, NAACL'22☆66Updated 2 years ago
- ☆39Updated last year
- ☆67Updated 3 years ago
- ☆19Updated last year
- ☆52Updated last year
- CausalGym: Benchmarking causal interpretability methods on linguistic tasks☆48Updated 10 months ago
- Official implementation of "GPT or BERT: why not both?"☆61Updated 2 months ago
- Composable inference algorithms with LLMs and programmable logic☆69Updated 10 months ago
- The evaluation pipeline for the 2024 BabyLM Challenge.☆33Updated 11 months ago
- The official code repo for "Sub-Sentence Encoder: Contrastive Learning of Propositional Semantic Representations".☆83Updated last year
- Code repository for the c-BTM paper☆107Updated 2 years ago
- Code for SaGe subword tokenizer (EACL 2023)☆26Updated 10 months ago
- Code for the paper "The Impact of Positional Encoding on Length Generalization in Transformers", NeurIPS 2023☆135Updated last year
- Understand and test language model architectures on synthetic tasks.☆233Updated 3 weeks ago
- An unofficial implementation of the Infini-gram model proposed by Liu et al. (2024)☆33Updated last year
- A toolkit implementing advanced methods to transfer models and model knowledge across tokenizers.☆46Updated 3 months ago
- Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)☆192Updated last year