State-of-the-art paired encoder and decoder models (17M-1B params)
☆59Aug 6, 2025Updated 6 months ago
Alternatives and similar repositories for ettin-encoder-vs-decoder
Users that are interested in ettin-encoder-vs-decoder are comparing it to the libraries listed below
Sorting:
- ☆52Jul 10, 2025Updated 7 months ago
- Official Repository for "Hypencoder: Hypernetworks for Information Retrieval"☆33Sep 20, 2025Updated 5 months ago
- ☆92Jul 4, 2025Updated 7 months ago
- Code for SaGe subword tokenizer (EACL 2023)☆27Nov 30, 2024Updated last year
- FlexiTokens☆18Dec 27, 2025Updated 2 months ago
- [ICML'25] "Rethinking Addressing in Language Models via Contextualized Equivariant Positional Encoding" by Jiajun Zhu, Peihao Wang, Ruisi…☆14Jun 6, 2025Updated 8 months ago
- Efficient Pre-training of Masked Language Model via Concept-based Curriculum Masking☆13Feb 5, 2023Updated 3 years ago
- Documenting large text datasets 🖼️ 📚☆14Dec 17, 2024Updated last year
- Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders☆18May 23, 2025Updated 9 months ago
- Code Implementation for "NASH: A Simple Unified Framework of Structured Pruning for Accelerating Encoder-Decoder Language Models" (EMNLP …☆17Oct 17, 2023Updated 2 years ago
- The training codes of Jasper-Token-Compression-600M☆19Nov 19, 2025Updated 3 months ago
- Tool to perform paired evaluation of automatic systems☆13Oct 20, 2021Updated 4 years ago
- PathPiece tokenizer☆13Nov 10, 2024Updated last year
- triple-encoders is a library for contextualizing distributed Sentence Transformers representations.☆15Sep 3, 2024Updated last year
- SCT: An Efficient Self-Supervised Cross-View Training For Sentence Embedding (TACL)☆16Jul 27, 2024Updated last year
- ☆13Oct 2, 2023Updated 2 years ago
- The offcial repository for 'CharacterBERT and Self-Teaching for Improving the Robustness of Dense Retrievers on Queries with Typos', SIGI…☆16May 4, 2022Updated 3 years ago
- ACL22 paper: Imputing Out-of-Vocabulary Embeddings with LOVE Makes Language Models Robust with Little Cost☆42Nov 15, 2023Updated 2 years ago
- The code for the paper "Efficient Self-Supervised Video Hashing with Selective State Spaces" (AAAI'25).☆22Aug 2, 2025Updated 7 months ago
- This repository helps you evaluate your models on the FreshStack benchmark!☆33Dec 9, 2025Updated 2 months ago
- Experiments for efforts to train a new and improved t5☆76Apr 15, 2024Updated last year
- Code and datasets for EMNLP 2022 paper: Beyond prompting: Making Pre-trained Language Models Better Zero-shot Learners by Clustering Repr…☆19Jan 1, 2024Updated 2 years ago
- ☆107Jun 2, 2025Updated 9 months ago
- ☆22Jun 10, 2025Updated 8 months ago
- BPE modification that implements removing of the intermediate tokens during tokenizer training.☆26Nov 25, 2024Updated last year
- [ICLR 2025] Official codebase for the ICLR 2025 paper "Multimodal Situational Safety"☆30Jun 23, 2025Updated 8 months ago
- Data for the HIPE 2022 shared task.☆21Nov 29, 2023Updated 2 years ago
- Official implementation of Vector-ICL: In-context Learning with Continuous Vector Representations (ICLR 2025)☆21Jun 2, 2025Updated 9 months ago
- ☆44Feb 11, 2026Updated 2 weeks ago
- Official code implementation for the ACL 2025 paper: 'Dynamic Scaling of Unit Tests for Code Reward Modeling'☆27May 16, 2025Updated 9 months ago
- We introduce EMMET and unify model editing with popular algorithms ROME and MEMIT.☆25Dec 16, 2024Updated last year
- A toolkit implementing advanced methods to transfer models and model knowledge across tokenizers.☆64Jul 6, 2025Updated 7 months ago
- My NER Experiments with ModernBERT and Ettin☆26Jul 17, 2025Updated 7 months ago
- Minimalist implementation of a GPT2 with Language Model Head with PyTorch Lightning, Transformers and PyTorch-NLP.☆24Jun 12, 2023Updated 2 years ago
- One-stop shop for running and fine-tuning transformer-based language models for retrieval☆63Updated this week
- ☆41May 27, 2025Updated 9 months ago
- Official code release for "SuperBPE: Space Travel for Language Models"☆89Jan 9, 2026Updated last month
- [NeurIPS 2025] MergeBench: A Benchmark for Merging Domain-Specialized LLMs☆43Feb 11, 2026Updated 2 weeks ago
- A Text2SQL benchmark for evaluation of Large Language Models☆41Feb 24, 2026Updated last week