chandar-lab / NeoBERTLinks
☆106Updated 8 months ago
Alternatives and similar repositories for NeoBERT
Users that are interested in NeoBERT are comparing it to the libraries listed below
Sorting:
- ☆91Updated 7 months ago
- minimal pytorch implementation of bm25 (with sparse tensors)☆104Updated 3 months ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆76Updated 2 weeks ago
- Pre-train Static Word Embeddings☆94Updated 5 months ago
- ☆53Updated last year
- A massively multilingual modern encoder language model☆126Updated 3 weeks ago
- A fast implementation of T5/UL2 in PyTorch using Flash Attention☆113Updated 3 months ago
- Official implementation of "GPT or BERT: why not both?"☆61Updated 6 months ago
- Truly flash T5 realization!☆72Updated 2 weeks ago
- A toolkit implementing advanced methods to transfer models and model knowledge across tokenizers.☆62Updated 7 months ago
- Datamodels for hugging face tokenizers☆99Updated this week
- Crispy reranking models by Mixedbread☆45Updated 4 months ago
- Fast, Modern, and Low Precision PyTorch Optimizers☆124Updated last month
- State-of-the-art paired encoder and decoder models (17M-1B params)☆58Updated 6 months ago
- ☆57Updated last month
- Code for SaGe subword tokenizer (EACL 2023)☆27Updated last year
- Fine-tune ModernBERT with custom tokenizers, curriculum learning, and next-gen optimizers.☆74Updated 3 weeks ago
- Efficient few-shot learning with cross-encoders.☆62Updated last year
- ☆96Updated 2 weeks ago
- ☆41Updated last year
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…☆28Updated last year
- ☆53Updated 3 months ago
- ☆59Updated 2 months ago
- Improving Text Embedding of Language Models Using Contrastive Fine-tuning☆64Updated last year
- Optimus is a flexible and scalable framework built to train language models efficiently across diverse hardware configurations, including…☆68Updated 2 months ago
- An introduction to LLM Sampling☆79Updated last year
- Model implementation for the contextual embeddings project☆40Updated 8 months ago
- PyLate efficient inference engine☆71Updated last month
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆32Updated 4 months ago
- ☆48Updated last year