chandar-lab / NeoBERT
☆49Updated 2 months ago
Alternatives and similar repositories for NeoBERT
Users that are interested in NeoBERT are comparing it to the libraries listed below
Sorting:
- ☆43Updated 3 months ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆48Updated last week
- ☆42Updated last week
- Pre-train Static Word Embeddings☆60Updated last month
- ☆45Updated 3 months ago
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…☆26Updated last year
- Crispy reranking models by Mixedbread☆31Updated 2 weeks ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated last year
- Efficient few-shot learning with cross-encoders.☆51Updated last year
- minimal pytorch implementation of bm25 (with sparse tensors)☆101Updated last year
- Using short models to classify long texts☆21Updated 2 years ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆34Updated last week
- ☆29Updated 4 months ago
- Official Repository for "Hypencoder: Hypernetworks for Information Retrieval"☆24Updated 2 months ago
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆48Updated last year
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆19Updated 3 months ago
- ☆39Updated last week
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆132Updated 4 months ago
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆34Updated last year
- Efficient encoder-decoder architecture for small language models (≤1B parameters) with cross-architecture knowledge distillation and visi…☆23Updated 3 months ago
- Fast, Modern, Memory Efficient, and Low Precision PyTorch Optimizers☆92Updated 10 months ago
- ☆47Updated 8 months ago
- Fine-tune ModernBERT on a large Dataset with Custom Tokenizer Training☆66Updated 3 months ago
- ☆33Updated 10 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆61Updated last year
- Library for fast text representation and classification.☆28Updated last year
- Official implementation of "GPT or BERT: why not both?"☆53Updated 2 months ago
- QLoRA with Enhanced Multi GPU Support☆37Updated last year
- A PyTorch Lightning Callback for pushing models to the Hugging Face Hub 🤗⚡️☆36Updated 3 years ago
- ☆57Updated 7 months ago