TsinghuaC3I/Fourier-Position-Embedding

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/TsinghuaC3I/Fourier-Position-Embedding)

TsinghuaC3I / Fourier-Position-Embedding

[ICML 2025] Fourier Position Embedding: Enhancing Attention’s Periodic Extension for Length Generalization

☆119

Alternatives and similar repositories for Fourier-Position-Embedding

Users that are interested in Fourier-Position-Embedding are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zhixuan-lin / forgetting-transformer
View on GitHub
[ICLR 2025 & COLM 2025] Official PyTorch implementation of the Forgetting Transformer and Adaptive Computation Pruning
☆150Feb 25, 2026Updated 5 months ago
chuanyang-Zheng / DAPE
View on GitHub
The this is the official implementation of "DAPE: Data-Adaptive Positional Encoding for Length Extrapolation"
☆41Oct 11, 2024Updated last year
zigzagcai / varlen_mamba
View on GitHub
Mamba SSM architecture that supports training on variable-length sequences
☆12Sep 1, 2025Updated 10 months ago
archinetai / aligner-pytorch
View on GitHub
Sequence alignement methods with helpers for PyTorch.
☆24Nov 30, 2022Updated 3 years ago
liuhuang31 / Megatts2_HierSpeechpp
View on GitHub
Megatts2 use HierSpeechpp's vocoder
☆18Dec 2, 2024Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
TsinghuaC3I / Intuitive-Fine-Tuning
View on GitHub
[ACL 2025, Main Conference, Oral] Intuitive Fine-Tuning: Towards Simplifying Alignment into a Single Process
☆30Aug 2, 2024Updated last year
Ingrid725 / LaPE
View on GitHub
☆19Mar 28, 2024Updated 2 years ago
Lyun0912-wu / LongAttn
View on GitHub
LongAttn ：Selecting Long-context Training Data via Token-level Attention
☆15Jul 16, 2025Updated last year
BaohaoLiao / ApiQ
View on GitHub
[EMNLP 2024] Quantize LLM to extremely low-bit, and finetune the quantized LLMs
☆15Jul 18, 2024Updated 2 years ago
MingyuJ666 / Rope_with_LLM
View on GitHub
[ICML'25] Our study systematically investigates massive values in LLMs' attention mechanisms. First, we observe massive values are concen…
☆87Jun 20, 2025Updated last year
zerolllin / Delta-L-Normalization
View on GitHub
☆16Oct 11, 2025Updated 9 months ago
zhoujiahuan1991 / ICML2025-TCPA
View on GitHub
☆23May 8, 2025Updated last year
hrlics / HoPE
View on GitHub
[NeurIPS 2025] HoPE: Hybrid of Position Embedding for Long Context Vision-Language Models
☆29Feb 19, 2026Updated 5 months ago
BryceZhuo / HybridNorm
View on GitHub
The official implementation of HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization
☆19Mar 7, 2025Updated last year
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
NingMiao / InteL-VAEs
View on GitHub
Codes for paper <InteL-VAEs: Adding Inductive Biases to VariationalAuto-Encoders via Intermediary Latents>.
☆18Jun 25, 2021Updated 5 years ago
thunlp / SparsingLaw
View on GitHub
The open-source materials for paper "Sparsing Law: Towards Large Language Models with Greater Activation Sparsity".
☆32Nov 12, 2024Updated last year
Jackson-Kang / VQVC-Pytorch
View on GitHub
An unofficial implementation of Vector Quantization Voice Conversion (VQVC).
☆29Apr 12, 2021Updated 5 years ago
DragonAura / EE_DA_OJ
View on GitHub
2022 秋季学期清华大学电子系数据与算法课程 OJ 参考解答
☆10Jun 18, 2023Updated 3 years ago
Doraemonzzz / hgru2-pytorch
View on GitHub
☆24Sep 25, 2024Updated last year
0417keito / UTAUTAI
View on GitHub
UTAUTAI(Unrestricted Tune Automated Technology Artificial Interigence)
☆17Oct 27, 2023Updated 2 years ago
jshuadvd / LongRoPE
View on GitHub
Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper
☆154Jul 20, 2024Updated 2 years ago
7Xin / DPI-TTS
View on GitHub
☆13Sep 12, 2024Updated last year
FrontierLabs / F5R-TTS
View on GitHub
Official code for "F5R-TTS: Improving Flow-Matching based Text-to-Speech with Group Relative Policy Optimization"
☆169Mar 3, 2026Updated 4 months ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
mush42 / istft-onnx
View on GitHub
Export an ONNX graph that performs ISTFT. Designed for TTS models.
☆28Apr 23, 2024Updated 2 years ago
RobertCsordas / switchhead
View on GitHub
☆16Jun 11, 2025Updated last year
kensho-technologies / pathpiece
View on GitHub
PathPiece tokenizer
☆14Nov 10, 2024Updated last year
fla-org / flash-linear-attention
View on GitHub
🚀 Efficient implementations for emerging model architectures
☆5,473Updated this week
liuhuadai / ViT-TTS
View on GitHub
PyTorch Implementation of ViT-TTS (EMNLP'23)
☆11Oct 20, 2023Updated 2 years ago
AwesomeSeq / Comba-triton
View on GitHub
☆47Jun 16, 2025Updated last year
philsyn / DiffWave-Vocoder
View on GitHub
Pytorch Reimplementation of DiffWave Vocoder: a high quality, fast, and small neural vocoder.
☆90Apr 13, 2021Updated 5 years ago
gmongaras / Cottention_Transformer
View on GitHub
Code for the paper "Cottention: Linear Transformers With Cosine Attention"
☆20Nov 15, 2025Updated 8 months ago
revsic / torch-nansy
View on GitHub
Torch implementation of NANSY, Neural Analysis and Synthesis, arXiv:2110.14513
☆64Feb 13, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
DAMO-NLP-SG / CLEX
View on GitHub
[ICLR 2024] CLEX: Continuous Length Extrapolation for Large Language Models
☆78Mar 12, 2024Updated 2 years ago
revsic / torch-diffusion-wavegan
View on GitHub
Parallel waveform generation with DiffusionGAN
☆17Mar 26, 2022Updated 4 years ago
RobertCsordas / llm_effective_depth
View on GitHub
Official codebase for our paper "Do Language Models Use Their Depth Efficiently?"
☆29Jun 25, 2025Updated last year
Scarfmonster / HiFiPLN
View on GitHub
Multispeaker Community Vocoder Model for DiffSinger
☆39Aug 11, 2025Updated 11 months ago
EvanZhuang / mixinputs
View on GitHub
Official implementation for Text Generation Beyond Discrete Token Sampling
☆26Aug 11, 2025Updated 11 months ago
C0-Design / MemoryFormer
View on GitHub
An implementation is provided here for the NeurIPS2024 paper "MemoryFormer : Minimize Transformer Computation by Removing Fully-Connected…
☆16Mar 24, 2026Updated 4 months ago
sail-sg / Attention-Sink
View on GitHub
[ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)
☆164Jul 8, 2025Updated last year