Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation, ICML 2024
☆22Jun 26, 2024Updated last year
Alternatives and similar repositories for BiPE
Users that are interested in BiPE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆16Jun 24, 2024Updated last year
- LongAttn :Selecting Long-context Training Data via Token-level Attention☆15Jul 16, 2025Updated 8 months ago
- FLOPS counter for all your GPU benchmarking needs☆13Aug 8, 2024Updated last year
- ☆39Nov 13, 2025Updated 4 months ago
- ☆18Apr 30, 2024Updated last year
- ☆17Jun 3, 2024Updated last year
- SWE-Swiss: A Multi-Task Fine-Tuning and RL Recipe for High-Performance Issue Resolution☆103Sep 24, 2025Updated 6 months ago
- A sumary of MoE experimental setups across a number of different papers.☆16Feb 16, 2023Updated 3 years ago
- Repository of IPBench☆19Jan 4, 2026Updated 2 months ago
- [ICLR'25] Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?"☆79Nov 25, 2024Updated last year
- [SIGIR 2024] This is the official PyTorch implementation for the paper: "EulerFormer: Sequential User Behavior Modeling with Complex Vect…☆17Oct 5, 2024Updated last year
- ☆13Apr 2, 2024Updated last year
- Code for "Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective"