aiintelligentsystems / next-level-bertLinks

☆15

Alternatives and similar repositories for next-level-bert

Users that are interested in next-level-bert are comparing it to the libraries listed below

Sorting:

JHU-CLSP / ettin-encoder-vs-decoder
State-of-the-art paired encoder and decoder models (17M-1B params)
☆38Updated last week
allenai / EmbeddingRecycling
Embedding Recycling for Language models
☆39Updated 2 years ago
Leukas / CUTE
☆14Updated 2 months ago
ltgoslo / ltg-bert
LTG-Bert
☆33Updated last year
bltlab / seqscore
SeqScore: Scoring for named entity recognition and other sequence labeling tasks
☆23Updated 4 months ago
thevasudevgupta / bigbird
Google's BigBird (Jax/Flax & PyTorch) @ 🤗Transformers
☆49Updated 2 years ago
malteos / scincl
Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings (EMNLP 2022 paper)
☆71Updated 2 years ago
MeLeLBGU / SaGe
Code for SaGe subword tokenizer (EACL 2023)
☆25Updated 8 months ago
TristanThrush / perplexity-correlations
Simple and scalable tools for data-driven pretraining data selection.
☆25Updated 2 months ago
ltgoslo / bert-in-context
Official implementation of "BERTs are Generative In-Context Learners"
☆32Updated 4 months ago
orevaahia / magnet-tokenization
☆13Updated 8 months ago
g8a9 / ear
Code associated with the paper "Entropy-based Attention Regularization Frees Unintended Bias Mitigation from Lists"
☆49Updated 3 years ago
huggingface / olm-training
Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.
☆93Updated 2 years ago
ielab / Starbucks
Starbucks: Improved Training for 2D Matryoshka Embeddings
☆21Updated last month
LuisaMaerz / KnowMAN
KnowMAN: Weakly Supervised Multinomial Adversarial Networks
☆12Updated 3 years ago
ltgoslo / gpt-bert
Official implementation of "GPT or BERT: why not both?"
☆57Updated 2 weeks ago
alon-albalak / FLAD
Few-shot Learning with Auxiliary Data
☆31Updated last year
catie-aq / flashT5
A fast implementation of T5/UL2 in PyTorch using Flash Attention
☆107Updated 4 months ago
gchhablani / multilingual-vqa
Repository for Multilingual-VQA task created during HuggingFace JAX/Flax community week.
☆34Updated 4 years ago
guy-dar / embedding-space
☆54Updated 2 years ago
cisnlp / MEXA
🔍 Multilingual Evaluation of English-Centric LLMs via Cross-Lingual Alignment
☆12Updated 4 months ago
stanfordnlp / ColBERT-QA
Code for Relevance-guided Supervision for OpenQA with ColBERT (TACL'21)
☆41Updated 4 years ago
kayoyin / interpret-lm
Interpreting Language Models with Contrastive Explanations (EMNLP 2022 Best Paper Honorable Mention)
☆62Updated 3 years ago
Knowledgator / FlashDeBERTa
Trully flash implementation of DeBERTa disentangled attention mechanism.
☆63Updated 2 months ago
konstantinjdobler / focus
[EMNLP'23] Official Code for "FOCUS: Effective Embedding Initialization for Monolingual Specialization of Multilingual Models"
☆32Updated 2 months ago
ottowg / gsap-ner
☆10Updated 10 months ago
flairNLP / familiarity
Label shift estimation for transfer difficulty with Familiarity.
☆10Updated 6 months ago
Mihir3009 / In-BoXBART
In-BoXBART: Get Instructions into Biomedical Multi-task Learning
☆14Updated 2 years ago
google-research-datasets / swim-ir
SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…
☆49Updated last year
davidheineman / thresh
🌾 Universal, customizable and deployable fine-grained evaluation for text generation.
☆23Updated last year