ltgoslo / gpt-bertView external linksLinks
Official implementation of "GPT or BERT: why not both?"
☆61Jul 28, 2025Updated 6 months ago
Alternatives and similar repositories for gpt-bert
Users that are interested in gpt-bert are comparing it to the libraries listed below
Sorting:
- ☆23Aug 19, 2025Updated 5 months ago
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆22Jun 30, 2025Updated 7 months ago
- The official repo for LIFT: Language-Image Alignment with Fixed Text Encoders☆42Jun 10, 2025Updated 8 months ago
- ☆40May 27, 2025Updated 8 months ago
- DPO, but faster 🚀☆47Dec 6, 2024Updated last year
- ☆13Feb 2, 2025Updated last year
- A context-aware embedding similarity score☆11Aug 23, 2023Updated 2 years ago
- Label shift estimation for transfer difficulty with Familiarity.☆10Feb 4, 2025Updated last year
- Code for "Inducer-tuning: Connecting Prefix-tuning and Adapter-tuning" (EMNLP 2022) and "Empowering Parameter-Efficient Transfer Learning…☆11Feb 6, 2023Updated 3 years ago
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 10 months ago
- ☆106Jun 2, 2025Updated 8 months ago
- [ICML'25] "Rethinking Addressing in Language Models via Contextualized Equivariant Positional Encoding" by Jiajun Zhu, Peihao Wang, Ruisi…☆14Jun 6, 2025Updated 8 months ago
- This repository contains the code for the EMNLP'23 paper "AdaSent: Efficient Domain-Adapted Sentence Embeddings for Few-Shot Classificati…☆16Jun 3, 2024Updated last year
- Universal Neurons in GPT2 Language Models☆30May 28, 2024Updated last year
- Official Code for M-RᴇᴡᴀʀᴅBᴇɴᴄʜ: Evaluating Reward Models in Multilingual Settings (ACL 2025 Main)☆40May 16, 2025Updated 8 months ago
- [ICLR 2025] Weighted-Reward Preference Optimization for Implicit Model Fusion☆13Mar 17, 2025Updated 10 months ago
- GLADIS: A General and Large Acronym Disambiguation Benchmark (EACL 23)☆18Jun 24, 2024Updated last year
- Interface Design for Self-Supervised Speech Models, Accepted to Interspeech2024☆16Nov 19, 2024Updated last year
- ☆13Dec 6, 2024Updated last year
- EchoX: Towards Mitigating Acoustic-Semantic Gap via Echo Training for Speech-to-Speech LLMs☆47Sep 19, 2025Updated 4 months ago
- RWKV-7: Surpassing GPT☆104Nov 17, 2024Updated last year
- Visualise, evaluate, and manage annotated data☆34Nov 10, 2022Updated 3 years ago
- ☆59Nov 18, 2025Updated 2 months ago
- ☆24May 23, 2025Updated 8 months ago
- VocalVerse: A powerful vocal evaluation framework powered by the Qwen LLMs☆37Jan 22, 2026Updated 3 weeks ago
- ☆16May 14, 2024Updated last year
- FlowMirror-HydraVox — A natively accelerated multi-head autoregressive TTS system derived from CosyVoice 3.0. It predicts multiple tokens…☆38Jan 22, 2026Updated 3 weeks ago
- Interspeech Tutorial - Resource Efficient and Cross-Modal Learning Toward Foundation Modeling☆15Oct 9, 2023Updated 2 years ago
- Evaluation tool used in the BigVSAN paper☆14Mar 22, 2024Updated last year
- ☆13Dec 17, 2021Updated 4 years ago
- German Language Understanding Evaluation Benchmark @NAACL24☆22Dec 11, 2025Updated 2 months ago
- State-of-the-art paired encoder and decoder models (17M-1B params)☆58Aug 6, 2025Updated 6 months ago
- Minimal code to train ELMo models in recent versions of TensorFlow☆14Apr 30, 2023Updated 2 years ago
- SCT: An Efficient Self-Supervised Cross-View Training For Sentence Embedding (TACL)☆16Jul 27, 2024Updated last year
- Analyzing mBERT's multilinguality in a small laboratory setting☆13Jun 12, 2023Updated 2 years ago
- Repo for Turkish Wiki NER dataset.☆11Jul 11, 2023Updated 2 years ago
- ☆15Jun 14, 2024Updated last year
- Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …☆22Updated this week
- A byte-level decoder architecture that matches the performance of tokenized Transformers.☆67Apr 24, 2024Updated last year