ltgoslo / elc-bertLinks
☆20Updated 4 months ago
Alternatives and similar repositories for elc-bert
Users that are interested in elc-bert are comparing it to the libraries listed below
Sorting:
- Simple-to-use scoring function for arbitrarily tokenized texts.☆46Updated 6 months ago
- Minimum Bayes Risk Decoding for Hugging Face Transformers☆58Updated last year
- Transformer Grammars: Augmenting Transformer Language Models with Syntactic Inductive Biases at Scale, TACL (2022)☆128Updated 2 months ago
- Code for Zero-Shot Tokenizer Transfer☆135Updated 7 months ago
- Official Repository of Pretraining Without Attention (BiGS), BiGS is the first model to achieve BERT-level transfer learning on the GLUE …☆114Updated last year
- The evaluation pipeline for the 2024 BabyLM Challenge.☆33Updated 9 months ago
- Repository for "I am a Strange Dataset: Metalinguistic Tests for Language Models"☆44Updated last year
- A toolkit implementing advanced methods to transfer models and model knowledge across tokenizers.☆45Updated last month
- BPE modification that implements removing of the intermediate tokens during tokenizer training.☆24Updated 9 months ago
- Experiments for efforts to train a new and improved t5☆76Updated last year
- Erasing concepts from neural representations with provable guarantees☆232Updated 7 months ago
- Code for SaGe subword tokenizer (EACL 2023)☆26Updated 9 months ago
- Extract full next-token probabilities via language model APIs☆247Updated last year
- ☆42Updated 5 months ago
- An unofficial implementation of the Infini-gram model proposed by Liu et al. (2024)☆33Updated last year
- ☆66Updated 2 years ago
- Evaluation pipeline for the BabyLM Challenge 2023.☆77Updated last year
- ☆51Updated 7 months ago
- Rust library for indexing and quickly searching large pretraining corpora☆28Updated last week
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆186Updated last month
- Repository for the code of the "PPL-MCTS: Constrained Textual Generation Through Discriminator-Guided Decoding" paper, NAACL'22☆66Updated 2 years ago
- A fast implementation of T5/UL2 in PyTorch using Flash Attention☆107Updated 5 months ago
- Probabilistic programming with large language models☆134Updated last month
- CausalGym: Benchmarking causal interpretability methods on linguistic tasks☆46Updated 9 months ago
- ☆39Updated last year
- Code repository for the c-BTM paper☆107Updated last year
- ☆67Updated 3 years ago
- ☆69Updated last year
- A Python library that encapsulates various methods for neuron interpretation and analysis in Deep NLP models.☆104Updated last year
- How do transformer LMs encode relations?☆52Updated last year