chandar-lab / NeoBERTLinks
☆83Updated 4 months ago
Alternatives and similar repositories for NeoBERT
Users that are interested in NeoBERT are comparing it to the libraries listed below
Sorting:
- ☆77Updated 3 months ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆66Updated last week
- ☆49Updated 8 months ago
- minimal pytorch implementation of bm25 (with sparse tensors)☆104Updated last year
- A massively multilingual modern encoder language model☆92Updated 2 weeks ago
- Fine-tune ModernBERT on a large Dataset with Custom Tokenizer Training☆67Updated 8 months ago
- ☆52Updated 8 months ago
- Official implementation of "GPT or BERT: why not both?"☆59Updated 2 months ago
- A toolkit implementing advanced methods to transfer models and model knowledge across tokenizers.☆46Updated 3 months ago
- Pre-train Static Word Embeddings☆86Updated last month
- A fast implementation of T5/UL2 in PyTorch using Flash Attention☆107Updated 6 months ago
- Efficient few-shot learning with cross-encoders.☆59Updated last year
- ☆57Updated last week
- Datamodels for hugging face tokenizers☆77Updated 2 weeks ago
- Fast, Modern, and Low Precision PyTorch Optimizers☆113Updated last month
- ☆39Updated last year
- Crispy reranking models by Mixedbread☆36Updated 3 weeks ago
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…☆27Updated last year
- ☆48Updated last year
- Generalist and Lightweight Model for Text Classification☆163Updated 3 months ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆33Updated 3 weeks ago
- QLoRA with Enhanced Multi GPU Support☆37Updated 2 years ago
- Small python package to measure OCR quality and other related metrics.☆25Updated last year
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and te…☆43Updated last year
- Using open source LLMs to build synthetic datasets for direct preference optimization☆66Updated last year
- Optimus is a flexible and scalable framework built to train language models efficiently across diverse hardware configurations, including…☆67Updated 3 months ago
- State-of-the-art paired encoder and decoder models (17M-1B params)☆50Updated 2 months ago
- ☆71Updated 2 months ago
- Truly flash T5 realization!☆70Updated last year
- An introduction to LLM Sampling☆79Updated 9 months ago