State-of-the-art paired encoder and decoder models (17M-1B params)
☆69Aug 6, 2025Updated 8 months ago
Alternatives and similar repositories for ettin-encoder-vs-decoder
Users that are interested in ettin-encoder-vs-decoder are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository helps you evaluate your models on the FreshStack benchmark!☆34Dec 9, 2025Updated 4 months ago
- ☆98Jul 4, 2025Updated 10 months ago
- Documenting large text datasets 🖼️ 📚☆14Dec 17, 2024Updated last year
- Code for SaGe subword tokenizer (EACL 2023)☆28Nov 30, 2024Updated last year
- PathPiece tokenizer☆14Nov 10, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Better Live Text for MacOS☆35Feb 8, 2026Updated 2 months ago
- One-stop shop for running and fine-tuning transformer-based language models for retrieval☆65Apr 15, 2026Updated 2 weeks ago
- Tool to perform paired evaluation of automatic systems☆13Oct 20, 2021Updated 4 years ago
- The training codes of Jasper-Token-Compression-600M☆19Nov 19, 2025Updated 5 months ago
- [NeurIPS 2025] Encoder-Decoder Diffusion Language Models for Efficient Training and Inference☆41Oct 29, 2025Updated 6 months ago
- ACL22 paper: Imputing Out-of-Vocabulary Embeddings with LOVE Makes Language Models Robust with Little Cost☆42Nov 15, 2023Updated 2 years ago
- The offcial repository for 'CharacterBERT and Self-Teaching for Improving the Robustness of Dense Retrievers on Queries with Typos', SIGI…☆16May 4, 2022Updated 4 years ago
- Efficient Pre-training of Masked Language Model via Concept-based Curriculum Masking☆13Feb 5, 2023Updated 3 years ago
- FlexiTokens☆21Dec 27, 2025Updated 4 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆111Jun 2, 2025Updated 11 months ago
- Expand -> Retrieve -> Rerank - simple method with strong results on BRIGHT benchmark☆22Aug 22, 2025Updated 8 months ago
- Code Implementation for "NASH: A Simple Unified Framework of Structured Pruning for Accelerating Encoder-Decoder Language Models" (EMNLP …☆17Oct 17, 2023Updated 2 years ago
- triple-encoders is a library for contextualizing distributed Sentence Transformers representations.☆15Sep 3, 2024Updated last year
- SCT: An Efficient Self-Supervised Cross-View Training For Sentence Embedding (TACL)☆16Jul 27, 2024Updated last year
- ☆12Jan 25, 2026Updated 3 months ago
- MiniLM (BERT) embeddings from scratch☆20Aug 14, 2025Updated 8 months ago
- ☆16Jun 14, 2024Updated last year
- Minimalist implementation of a GPT2 with Language Model Head with PyTorch Lightning, Transformers and PyTorch-NLP.☆24Jun 12, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Resk is a robust Python library designed to enhance security and manage context when interacting with LLMs. It provides a protective …☆19Apr 13, 2026Updated 3 weeks ago
- ☆15Jun 19, 2025Updated 10 months ago
- ☆11Feb 9, 2024Updated 2 years ago
- Official implementation of NeurIPS'24 Spotlight paper "Monte Carlo Tree Search based Space Transfer for Black-box Optimization".☆13Nov 28, 2024Updated last year
- Natural Perturbation for Robust Question Answering☆12Apr 7, 2020Updated 6 years ago
- Code and datasets for EMNLP 2022 paper: Beyond prompting: Making Pre-trained Language Models Better Zero-shot Learners by Clustering Repr…☆19Jan 1, 2024Updated 2 years ago
- 🚀🤗 A collection of templates for Hugging Face Spaces☆34Oct 9, 2023Updated 2 years ago
- ☆114Jun 9, 2022Updated 3 years ago
- ☆14Nov 2, 2022Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Interactive documentation and programming with Scala, iPython notebook style.☆19Mar 9, 2016Updated 10 years ago
- ALBERT Persian Playground☆13Jun 12, 2023Updated 2 years ago
- Baselines for Model-Based Optimization installation fixes and compatible with newer AMPERE+ GPUs (e.g. 3090)☆11Apr 30, 2023Updated 3 years ago
- Collection of LLM completions for reasoning-gym task datasets☆31Jul 4, 2025Updated 10 months ago
- A toolkit implementing advanced methods to transfer models and model knowledge across tokenizers.☆67Jul 6, 2025Updated 9 months ago
- My NER Experiments with ModernBERT and Ettin☆27Jul 17, 2025Updated 9 months ago
- An unofficial Python 3 version of jemdoc.☆11Feb 8, 2026Updated 2 months ago