chandar-lab / NeoBERTLinks

☆79

Alternatives and similar repositories for NeoBERT

Users that are interested in NeoBERT are comparing it to the libraries listed below

Sorting:

Knowledgator / FlashDeBERTa
Trully flash implementation of DeBERTa disentangled attention mechanism.
☆62Updated 2 months ago
AnswerDotAI / fastkmeans
☆64Updated last month
AnswerDotAI / ModernBERT-Instruct-mini-cookbook
☆49Updated 5 months ago
jxmorris12 / bm25_pt
minimal pytorch implementation of bm25 (with sparse tensors)
☆104Updated last year
LAGoM-NLP / transtokenizer
☆51Updated 6 months ago
mixedbread-ai / mxbai-rerank
Crispy reranking models by Mixedbread
☆33Updated 3 weeks ago
ltgoslo / gpt-bert
Official implementation of "GPT or BERT: why not both?"
☆57Updated last week
bminixhofer / tokenkit
A toolkit implementing advanced methods to transfer models and model knowledge across tokenizers.
☆40Updated last month
MinishLab / tokenlearn
Pre-train Static Word Embeddings
☆85Updated 2 months ago
s-smits / modernbert-finetune
Fine-tune ModernBERT on a large Dataset with Custom Tokenizer Training
☆67Updated 6 months ago
catie-aq / flashT5
A fast implementation of T5/UL2 in PyTorch using Flash Attention
☆107Updated 4 months ago
NathanGodey / headless-lm
Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…
☆27Updated last year
Zyphra / Zyda_processing
☆37Updated last year
Knowledgator / LiqFit
Efficient few-shot learning with cross-encoders.
☆56Updated last year
trapoom555 / Language-Model-STS-CFT
Improving Text Embedding of Language Models Using Contrastive Fine-tuning
☆64Updated last year
SeunghyunSEO / optimized_hf_llama_class_for_training
☆48Updated 11 months ago
jxmorris12 / cde
code for training & evaluating Contextual Document Embedding models
☆196Updated 2 months ago
Aleph-Alpha-Research / trigrams
☆56Updated 3 months ago
mixedbread-ai / batched
The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…
☆142Updated 3 weeks ago
IlyasMoutawwakil / py-txi
A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.
☆33Updated 2 months ago
ibm-granite / granite-embedding-models
☆29Updated last month
Knowledgator / TurboT5
Truly flash T5 realization!
☆68Updated last year
Nicolas-BZRD / EuroBERT
Optimus is a flexible and scalable framework built to train language models efficiently across diverse hardware configurations, including…
☆66Updated last month
MeLeLBGU / SaGe
Code for SaGe subword tokenizer (EACL 2023)
☆25Updated 8 months ago
jfkback / hypencoder-paper
Official Repository for "Hypencoder: Hypernetworks for Information Retrieval"
☆27Updated 5 months ago
sileod / tasksource
Datasets collection and preprocessings framework for NLP extreme multitask learning
☆185Updated last month
JHU-CLSP / ettin-encoder-vs-decoder
State-of-the-art paired encoder and decoder models (17M-1B params)
☆38Updated last week
princeton-nlp / LitSearch
[EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search
☆93Updated 8 months ago
microsoft / encoder-decoder-slm
Efficient encoder-decoder architecture for small language models (≤1B parameters) with cross-architecture knowledge distillation and visi…
☆29Updated 6 months ago
huggingface / olm-training
Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.
☆93Updated 2 years ago