NormXU/Consistent-DynamicNTKRoPE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/NormXU/Consistent-DynamicNTKRoPE)

NormXU / Consistent-DynamicNTKRoPE

An Experiment on Dynamic NTK Scaling RoPE

☆65

Alternatives and similar repositories for Consistent-DynamicNTKRoPE

Users that are interested in Consistent-DynamicNTKRoPE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

keezen / ntk_alibi
View on GitHub
NTK scaled version of ALiBi position encoding in Transformer.
☆69Aug 16, 2023Updated 2 years ago
bigai-nlco / CREAM
View on GitHub
[NeurIPS 2024] | An Efficient Recipe for Long Context Extension via Middle-Focused Positional Encoding
☆22Oct 10, 2024Updated last year
jquesnelle / yarn
View on GitHub
YaRN: Efficient Context Window Extension of Large Language Models
☆1,739Apr 17, 2024Updated 2 years ago
dwzhu-pku / PoSE
View on GitHub
Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length (ICLR 2024)
☆208May 20, 2024Updated 2 years ago
sheryc / resonance_rope
View on GitHub
[ACL 24 Findings] Implementation of Resonance RoPE and the PosGen synthetic dataset.
☆24Mar 5, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
bojone / rerope
View on GitHub
Rectified Rotary Position Embeddings
☆394May 20, 2024Updated 2 years ago
tongmeihan1995 / DocEE
View on GitHub
DocEE: A Large-Scale and Fine-grained Benchmark for Document-level Event Extraction
☆42Apr 19, 2023Updated 3 years ago
bdusell / stack-attention
View on GitHub
Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"
☆18Mar 15, 2024Updated 2 years ago
janphilippfranken / sami
View on GitHub
Self-Supervised Alignment with Mutual Information
☆20May 24, 2024Updated 2 years ago
zexuanqiu / CLongEval
View on GitHub
CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models
☆49Mar 7, 2024Updated 2 years ago
RenzeLou / Muffin
View on GitHub
MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following
☆16Oct 31, 2024Updated last year
ArthurLeoM / peft-givens
View on GitHub
source code of (quasi-)Givens Orthogonal Fine Tuning integrated to peft lib
☆16Mar 13, 2025Updated last year
M3-IT / YING-VLM
View on GitHub
Vision Large Language Models trained on M3IT instruction tuning dataset
☆17Aug 16, 2023Updated 2 years ago
Leooyii / LCEG
View on GitHub
[COLM'25] A Controlled Study on Long Context Extension and Generalization in LLMs
☆65Mar 9, 2026Updated 4 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
GATECH-EIC / Linearized-LLM
View on GitHub
[ICML 2024] When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models
☆35Jun 12, 2024Updated 2 years ago
jenni-ai / T2FW
View on GitHub
Fine-Tuning Pre-trained Transformers into Decaying Fast Weights
☆20Oct 9, 2022Updated 3 years ago
tencent-ailab / OASum
View on GitHub
☆15Oct 20, 2023Updated 2 years ago
tml-epfl / icl-alignment
View on GitHub
Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]
☆33Jan 23, 2025Updated last year
chtmp223 / suri
View on GitHub
Suri: Multi-constraint instruction following for long-form text generation [EMNLP’24]
☆27Oct 3, 2025Updated 9 months ago
October2001 / ProLong
View on GitHub
[ACL 2024 (Oral)] A Prospector of Long-Dependency Data for Large Language Models
☆61Jul 23, 2024Updated 2 years ago
seanzhang-zhichen / baichuan-Dynamic-NTK-ALiBi
View on GitHub
百川Dynamic NTK-ALiBi的代码实现：无需微调即可推理更长文本
☆49Aug 27, 2023Updated 2 years ago
chenllliang / ATP-AMR
View on GitHub
Source code for paper "ATP: AMRize Than Parse! Enhancing AMR Parsing with PseudoAMRs" @NAACL-2022
☆15Mar 31, 2023Updated 3 years ago
ThisIsHwang / EXIT
View on GitHub
Official code and resources for the paper "EXIT: Context-Aware Extractive Compression for Enhancing Retrieval-Augmented Generation."
☆25Jul 15, 2026Updated last week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
yangjianxin1 / LongQLoRA
View on GitHub
LongQLoRA: Extent Context Length of LLMs Efficiently
☆170Nov 12, 2023Updated 2 years ago
liyucheng09 / llm-compressive
View on GitHub
Longitudinal Evaluation of LLMs via Data Compression
☆32May 29, 2024Updated 2 years ago
HKUNLP / ChunkLlama
View on GitHub
[ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"
☆450Oct 16, 2024Updated last year
IronBeliever / CaR
View on GitHub
Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation
☆91Nov 13, 2024Updated last year
THUDM / LongReward
View on GitHub
☆63Oct 29, 2024Updated last year
hugcis / evolving-structures-in-complex-systems
View on GitHub
Dataset and code to reproduce the results of the paper "Evolving Structures in Complex Systems"
☆11Dec 16, 2019Updated 6 years ago
W4ngatang / qags
View on GitHub
Question Answering and Generation for Summarization
☆72Nov 27, 2022Updated 3 years ago
xphung / plan9_webasm
View on GitHub
WebAssembly port of Plan9 (fourth edition) libraries, device drivers, file systems and Inferno kernel
☆20Jan 30, 2023Updated 3 years ago
tianyi-lab / Cherry_LLM
View on GitHub
[NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other mo…
☆416Jun 25, 2025Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
huggingface / gaia
View on GitHub
Hugging Face and Pyserini interoperability
☆20May 18, 2023Updated 3 years ago
NLP-Core-Team / mmlu_ru
View on GitHub
MMLU eval for RU/EN
☆16Jul 31, 2023Updated 2 years ago
Zce1112zslx / IKE
View on GitHub
☆41Nov 30, 2023Updated 2 years ago
leezythu / FocusLLM
View on GitHub
FocusLLM: Scaling LLM’s Context by Parallel Decoding
☆45Dec 8, 2024Updated last year
RyokoAI / BigKnow2022
View on GitHub
BigKnow2022: Bringing Language Models Up to Speed
☆16Mar 27, 2023Updated 3 years ago
Zoeyyao27 / SirLLM
View on GitHub
This repository contains the code for the paper: SirLLM: Streaming Infinite Retentive LLM
☆60May 28, 2024Updated 2 years ago
OpenNLPLab / lightning-attention
View on GitHub
Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language Models
☆344Feb 23, 2025Updated last year