Extending the Context of Pretrained LLMs by Dropping Their Positional Embedding
☆218Jan 12, 2026Updated 5 months ago
Alternatives and similar repositories for DroPE
Users that are interested in DroPE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Large language models to diffusion finetuning code☆26Jun 2, 2025Updated last year
- Sound field reconstruction using neural processes with dynamic kernels☆16Mar 25, 2025Updated last year
- MatFormer repo☆76Dec 9, 2024Updated last year
- Speaker adaptive forced alignment (phonetic segmentation) using Wav2Vec2☆23May 7, 2026Updated last month
- FlashSampling: Fast and Memory-Efficient Exact Sampling (https://huggingface.co/papers/2603.15854)☆74May 13, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for "What really matters in matrix-whitening optimizers?"☆24Oct 31, 2025Updated 7 months ago
- ☆21Jun 12, 2025Updated last year
- The source code for the paper CrossSinger (asru2023)☆18Oct 12, 2023Updated 2 years ago
- Repository for "TESS-2: A Large-Scale, Generalist Diffusion Language Model"☆57Feb 20, 2025Updated last year
- Domain-Robust Visual Imitation Learning with Mutual Information Constraints code☆19Mar 1, 2021Updated 5 years ago
- Unofficial implementation of paper : Exploring the Space of Key-Value-Query Models with Intention☆12May 24, 2023Updated 3 years ago
- Minimalistic Google Docs based workflow for Distill.pub☆10Jun 14, 2023Updated 3 years ago
- ☆27May 12, 2026Updated last month
- Giving Up Control: Neurons as Reinforcement Learning Agents☆13May 6, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆24Dec 11, 2024Updated last year
- Open Character Training☆88Apr 4, 2026Updated 2 months ago
- Model souping for LLMs☆73Nov 18, 2025Updated 6 months ago
- Official Implementation of "Maximum Likelihood Reinforcement Learning (MaxRL)"☆188May 28, 2026Updated 2 weeks ago
- Data and code for understanding and generation of Kamon.☆35Mar 10, 2026Updated 3 months ago
- 为 RWKV 设计的「Deep Think」实现。☆27Dec 7, 2025Updated 6 months ago
- Implementation of "Look, Listen and Recognise:character-aware audio-visual subtitling"☆21Nov 3, 2025Updated 7 months ago
- [ICLR 2026] GRAPE: Group Representational Position Encoding (https://arxiv.org/abs/2512.07805)☆95May 13, 2026Updated last month
- Code and training scripts for FlexOlmo☆150Apr 20, 2026Updated last month
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Official repository Flash Local Linear Attention☆36May 28, 2026Updated 2 weeks ago
- ☆35Apr 12, 2024Updated 2 years ago
- ☆17Apr 25, 2023Updated 3 years ago
- 来自于文章Paraformer-v2: An improved non-autoregressive transformer for noise-robust speech recognition☆29Nov 20, 2024Updated last year
- OpenFLAM: Framewise Language Audio Model☆108Jun 4, 2026Updated last week
- Listwise Reward Estimation for Offline Preference-based Reinforcement Learning (ICML 2024)☆18Jun 18, 2024Updated last year
- MLX Implementation of Recursive Reasoning with Tiny Networks☆78Oct 11, 2025Updated 8 months ago
- Landing repository for the paper "Softpick: No Attention Sink, No Massive Activations with Rectified Softmax"☆93Sep 12, 2025Updated 9 months ago
- ☆69Updated this week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Landing repository for the paper "Predicting the Order of Upcoming Tokens Improves Language Modeling"☆46May 13, 2026Updated last month
- 💥 Make peer-2-peer global works☆53Jan 29, 2026Updated 4 months ago
- An MCP tool server that provides stateful, TUI-compatible terminal sessions.☆15Feb 3, 2025Updated last year
- Authors' PyTorch implementation of 'Recomposing the Reinforcement Learning Building-Blocks with Hypernetworks' (HypeRL)☆26Jun 9, 2021Updated 5 years ago
- Official repository for the paper "Automating Continual Learning"☆20Jun 11, 2025Updated last year
- A frontend for your PDS☆25Oct 20, 2025Updated 7 months ago
- ☆16Apr 2, 2025Updated last year