State-of-the-art paired encoder and decoder models (17M-1B params)
☆73Aug 6, 2025Updated 10 months ago
Alternatives and similar repositories for ettin-encoder-vs-decoder
Users that are interested in ettin-encoder-vs-decoder are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆57Jul 10, 2025Updated 11 months ago
- This repository helps you evaluate your models on the FreshStack benchmark!☆34Dec 9, 2025Updated 6 months ago
- ☆101Jul 4, 2025Updated 11 months ago
- Documenting large text datasets 🖼️ 📚☆14Dec 17, 2024Updated last year
- KalDB is a cloud-native polystore for search and analytics☆41Updated this week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official Repository for "Hypencoder: Hypernetworks for Information Retrieval"☆40Sep 20, 2025Updated 8 months ago
- Code for SaGe subword tokenizer (EACL 2023)☆28Nov 30, 2024Updated last year
- Experiments for efforts to train a new and improved t5☆76Apr 15, 2024Updated 2 years ago
- Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders☆19May 23, 2025Updated last year
- One-stop shop for running and fine-tuning transformer-based language models for retrieval☆65Updated this week
- Tool to perform paired evaluation of automatic systems☆13Oct 20, 2021Updated 4 years ago
- The training codes of Jasper-Token-Compression-600M☆19Nov 19, 2025Updated 6 months ago
- [NeurIPS 2025] Encoder-Decoder Diffusion Language Models for Efficient Training and Inference☆43Oct 29, 2025Updated 7 months ago
- Efficient Pre-training of Masked Language Model via Concept-based Curriculum Masking☆13Feb 5, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- FlexiTokens☆23Dec 27, 2025Updated 5 months ago
- ☆13Oct 2, 2023Updated 2 years ago
- Expand -> Retrieve -> Rerank - simple method with strong results on BRIGHT benchmark☆22Aug 22, 2025Updated 9 months ago
- Code Implementation for "NASH: A Simple Unified Framework of Structured Pruning for Accelerating Encoder-Decoder Language Models" (EMNLP …☆17Oct 17, 2023Updated 2 years ago
- triple-encoders is a library for contextualizing distributed Sentence Transformers representations.☆15Sep 3, 2024Updated last year
- SCT: An Efficient Self-Supervised Cross-View Training For Sentence Embedding (TACL)☆16Jul 27, 2024Updated last year
- ☆12Jan 25, 2026Updated 4 months ago
- Contextualized per-token embeddings☆36Updated this week
- [TMLR 2025 & ICLR 2025 DeLTa] Official Implementation of Design Editing for Offline Model-based Optimization 🧬 🤖☆10Apr 17, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- a benchmark to evaluate the situated inductive reasoning☆16Jan 7, 2025Updated last year
- ☆16Jun 14, 2024Updated 2 years ago
- ADAG: Transluce's MLP neuron-level circuit tracing library☆28Apr 10, 2026Updated 2 months ago
- We introduce EMMET and unify model editing with popular algorithms ROME and MEMIT.☆28Dec 16, 2024Updated last year
- Resk is a robust Python library designed to enhance security and manage context when interacting with LLMs. It provides a protective …☆20Jun 6, 2026Updated last week
- BPE modification that implements removing of the intermediate tokens during tokenizer training.☆27Nov 25, 2024Updated last year
- Official implementation of NeurIPS'24 Spotlight paper "Monte Carlo Tree Search based Space Transfer for Black-box Optimization".☆13Nov 28, 2024Updated last year
- Natural Perturbation for Robust Question Answering☆12Apr 7, 2020Updated 6 years ago
- Code and datasets for EMNLP 2022 paper: Beyond prompting: Making Pre-trained Language Models Better Zero-shot Learners by Clustering Repr…☆19Jan 1, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Phi-2 Fine Tuning to build a mental health GPT.☆11Jan 6, 2024Updated 2 years ago
- 🚀🤗 A collection of templates for Hugging Face Spaces☆34Oct 9, 2023Updated 2 years ago
- Scaling Laws for Mixture of Experts Models☆15Feb 25, 2025Updated last year
- ☆114Jun 9, 2022Updated 4 years ago
- ☆15Dec 15, 2025Updated 6 months ago
- Interactive documentation and programming with Scala, iPython notebook style.☆19Mar 9, 2016Updated 10 years ago
- A python script to write a report automatically in docx for a twitter-graph☆14Apr 14, 2022Updated 4 years ago