chandar-lab / NeoBERTLinks
☆56Updated 3 weeks ago
Alternatives and similar repositories for NeoBERT
Users that are interested in NeoBERT are comparing it to the libraries listed below
Sorting:
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆58Updated last month
- ☆61Updated last week
- ☆47Updated 4 months ago
- Pre-train Static Word Embeddings☆79Updated 3 weeks ago
- ☆48Updated 5 months ago
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…☆27Updated last year
- Efficient encoder-decoder architecture for small language models (≤1B parameters) with cross-architecture knowledge distillation and visi…☆27Updated 4 months ago
- Crispy reranking models by Mixedbread☆32Updated 3 weeks ago
- Fine-tune ModernBERT on a large Dataset with Custom Tokenizer Training☆65Updated 4 months ago
- minimal pytorch implementation of bm25 (with sparse tensors)☆101Updated last year
- Official implementation of "GPT or BERT: why not both?"☆53Updated 2 weeks ago
- Model implementation for the contextual embeddings project☆33Updated 3 weeks ago
- Official Repository for "Hypencoder: Hypernetworks for Information Retrieval"☆25Updated 3 months ago
- ☆29Updated 5 months ago
- Fast, Modern, and Low Precision PyTorch Optimizers☆94Updated this week
- ☆52Updated 3 weeks ago
- Efficient few-shot learning with cross-encoders.☆53Updated last year
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated last year
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.☆78Updated 2 weeks ago
- Analysis on the cost of encoder based models☆11Updated 4 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆64Updated last year
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆137Updated last month
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆34Updated last year
- NLP with Rust for Python 🦀🐍☆62Updated last month
- An unofficial implementation of the Infini-gram model proposed by Liu et al. (2024)☆33Updated last year
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆32Updated last year
- Code for Zero-Shot Tokenizer Transfer☆133Updated 5 months ago
- My NER Experiments with ModernBERT☆21Updated last month
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆21Updated 4 months ago
- ☆38Updated last month