Extending the Context of Pretrained LLMs by Dropping Their Positional Embedding
☆207Jan 12, 2026Updated last month
Alternatives and similar repositories for DroPE
Users that are interested in DroPE are comparing it to the libraries listed below
Sorting:
- MatFormer repo☆72Dec 9, 2024Updated last year
- ☆16Jun 12, 2025Updated 8 months ago
- 🚀 Collection of libraries used with fms-hf-tuning to accelerate fine-tuning and training of large models.☆13Jan 30, 2026Updated last month
- decontamination☆26Dec 3, 2025Updated 3 months ago
- Unofficial implementation of paper : Exploring the Space of Key-Value-Query Models with Intention☆12May 24, 2023Updated 2 years ago
- ☆15Nov 10, 2025Updated 3 months ago
- ☆13Jun 8, 2024Updated last year
- Sound field reconstruction using neural processes with dynamic kernels☆15Mar 25, 2025Updated 11 months ago
- Speaker adaptive forced alignment (phonetic segmentation) using Wav2Vec2☆23Updated this week
- VITS2 using Phoneme-Level Japanese BERT☆14Dec 17, 2023Updated 2 years ago
- 44100Hz日本語HuBERTに対応した QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion です。☆16May 21, 2023Updated 2 years ago
- ☆16Dec 18, 2023Updated 2 years ago
- Convert Korean to Katakana☆13Dec 13, 2023Updated 2 years ago
- AI based singing voice synthesis☆37Jun 10, 2024Updated last year
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆18Oct 20, 2024Updated last year
- 💥 Make peer-2-peer global works☆47Jan 29, 2026Updated last month
- Agent-RRM: Exploring Reasoning Reward Model for Agents☆49Updated this week
- ☆19Mar 25, 2025Updated 11 months ago
- ☆24Dec 11, 2024Updated last year
- The source code for the paper CrossSinger (asru2023)☆18Oct 12, 2023Updated 2 years ago
- Googleの音声復元モデルMiipher-2の再現実装の学習および推論コード。学習済みモデルも公開しています。☆31Feb 7, 2026Updated last month
- ☆49Sep 26, 2025Updated 5 months ago
- ☆27Nov 25, 2025Updated 3 months ago
- CycleQD is a framework for parameter space model merging.☆48Feb 1, 2025Updated last year
- [ICLR 2026] GRAPE: Group Representational Position Encoding (https://arxiv.org/abs/2512.07805)☆79Jan 27, 2026Updated last month
- Repository for "TESS-2: A Large-Scale, Generalist Diffusion Language Model"☆55Feb 20, 2025Updated last year
- 来自于文章Paraformer-v2: An improved non-autoregressive transformer for noise-robust speech recognition☆27Nov 20, 2024Updated last year
- ☆43Aug 5, 2025Updated 7 months ago
- Landing repository for the paper "Softpick: No Attention Sink, No Massive Activations with Rectified Softmax"☆88Sep 12, 2025Updated 5 months ago
- LEIA: Facilitating Cross-Lingual Knowledge Transfer in Language Models with Entity-based Data Augmentation☆23Apr 24, 2024Updated last year
- Python library to use Pleias-RAG models☆68May 1, 2025Updated 10 months ago
- Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch☆20Mar 2, 2024Updated 2 years ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆110Mar 7, 2025Updated last year
- A Japanese G2P tool based on pyopenjtalk☆25Aug 6, 2022Updated 3 years ago
- Official implementation of Log-linear Sparse Attention (LLSA).☆58Feb 2, 2026Updated last month
- RVCで音声学習をするための便利スクリプト集☆26Apr 8, 2023Updated 2 years ago
- research impl of Native Sparse Attention (2502.11089)☆63Feb 19, 2025Updated last year
- ☆49Jul 22, 2024Updated last year
- Precise Anime face detection☆24May 26, 2024Updated last year