sheryc/resonance_rope

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sheryc/resonance_rope)

sheryc / resonance_rope

[ACL 24 Findings] Implementation of Resonance RoPE and the PosGen synthetic dataset.

☆24

Alternatives and similar repositories for resonance_rope

Users that are interested in resonance_rope are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

codefuse-ai / Collinear-Constrained-Attention
View on GitHub
☆62Jun 17, 2024Updated 2 years ago
linkedin / ControlLLM
View on GitHub
Control LLM
☆23Apr 6, 2025Updated last year
zhiyuanhubj / LongRecipe
View on GitHub
LongRecipe: Recipe for Efficient Long Context Generalization in Large Language Models
☆79Oct 16, 2024Updated last year
kiaia / GIRAFFE
View on GitHub
Extending context length of visual language models
☆12Dec 18, 2024Updated last year
AntNLP / nope_head_scale
View on GitHub
☆29May 4, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
dwzhu-pku / PoSE
View on GitHub
Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length (ICLR 2024)
☆208May 20, 2024Updated 2 years ago
DAMO-NLP-SG / CLEX
View on GitHub
[ICLR 2024] CLEX: Continuous Length Extrapolation for Large Language Models
☆78Mar 12, 2024Updated 2 years ago
apple / ml-dataset-decomposition
View on GitHub
Official repo of dataset-decomposition paper [NeurIPS 2024]
☆21Jan 8, 2025Updated last year
MayDomine / Burst-Attention
View on GitHub
Distributed IO-aware Attention algorithm
☆24Sep 24, 2025Updated 10 months ago
alexnix300 / neural-render
View on GitHub
Upscale, enhance, and reimagine your renders with a single prompt using Stable Diffusion and FLUX.
☆14Aug 26, 2024Updated last year
NormXU / Consistent-DynamicNTKRoPE
View on GitHub
An Experiment on Dynamic NTK Scaling RoPE
☆65Nov 26, 2023Updated 2 years ago
EMNLP-2024-CritiCS / Collective-Critics-for-Creative-Story-Generation
View on GitHub
☆14Jan 10, 2025Updated last year
October2001 / ProLong
View on GitHub
[ACL 2024 (Oral)] A Prospector of Long-Dependency Data for Large Language Models
☆61Jul 23, 2024Updated 2 years ago
icip-cas / SSO
View on GitHub
A scalable automated alignment method for large language models. Resources for "Aligning Large Language Models via Self-Steering Optimiza…
☆20Nov 21, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
theAdamColton / ijepa-enhanced
View on GitHub
recipe for training fully-featured self supervised image jepa models
☆14Jun 4, 2025Updated last year
aykutcayir34 / DifferentialTransformer
View on GitHub
☆13Oct 14, 2024Updated last year
TaiMingLu / know-dont-tell
View on GitHub
☆19Oct 14, 2024Updated last year
psunlpgroup / FoVer
View on GitHub
This repository includes code and materials for the paper "Efficient PRM Training Data Synthesis via Formal Verification" (ACL 2026 Findi…
☆19Apr 7, 2026Updated 3 months ago
MiuLab / FactAlign
View on GitHub
Source code of our EMNLP 2024 paper "FactAlign: Long-form Factuality Alignment of Large Language Models"
☆19Oct 3, 2024Updated last year
delcypher / nsolv
View on GitHub
Nsolv - A front-end that allows multiple SMTLIBv2 compliant solvers to executed in parallel.
☆11Dec 7, 2012Updated 13 years ago
kyegomez / SelfExtend
View on GitHub
Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta
☆13Nov 11, 2024Updated last year
jzhang38 / EasyContext
View on GitHub
Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.
☆760Sep 27, 2024Updated last year
shoaibahmed / llm_depth_pruning
View on GitHub
Official implementation of the paper: "A deeper look at depth pruning of LLMs"
☆15Jul 24, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
princeton-nlp / ProLong
View on GitHub
Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"
☆261Sep 12, 2025Updated 10 months ago
WangWenhao0716 / PDF-Embedding
View on GitHub
[NeurIPS 2024] The official implementation of "Image Copy Detection for Diffusion Models"
☆18Oct 1, 2024Updated last year
astramind-ai / Mixture-of-depths
View on GitHub
Unofficial implementation for the paper "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"
☆175Jun 20, 2024Updated 2 years ago
HKUNLP / ChunkLlama
View on GitHub
[ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"
☆451Oct 16, 2024Updated last year
muellerzr / import-timer
View on GitHub
Pragmatic approach to parsing import profiles for CI's
☆12Jul 1, 2024Updated 2 years ago
bluvolve-dev / reactive-course-service-with-nextjs-ui-
View on GitHub
☆11Oct 15, 2020Updated 5 years ago
theAdamColton / vq-clip
View on GitHub
Train vector quantized CLIP models using pytorch lightning
☆21Jul 14, 2024Updated 2 years ago
leezythu / FocusLLM
View on GitHub
FocusLLM: Scaling LLM’s Context by Parallel Decoding
☆45Dec 8, 2024Updated last year
jquesnelle / yarn
View on GitHub
YaRN: Efficient Context Window Extension of Large Language Models
☆1,743Apr 17, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
zexuanqiu / CLongEval
View on GitHub
CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models
☆49Mar 7, 2024Updated 2 years ago
caskcsg / TextSmoothing
View on GitHub
☆36Mar 15, 2022Updated 4 years ago
haasn / -g-pl
View on GitHub
/g/ programming language
☆13Nov 9, 2011Updated 14 years ago
RyanLiu112 / GenPRM
View on GitHub
[AAAI 2026] Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".
☆102Nov 8, 2025Updated 8 months ago
zjunlp / OneEdit
View on GitHub
OneEdit: A Neural-Symbolic Collaboratively Knowledge Editing System.
☆20Oct 14, 2024Updated last year
liushulinle / MarsRL
View on GitHub
MarsRL: Advancing Multi-Agent Reasoning System via Reinforcement Learning with Agentic Pipeline Parallelism
☆18Nov 18, 2025Updated 8 months ago
Agnishom / IOITC16
View on GitHub
Problems from IOITC'16 (India)
☆10Jan 12, 2022Updated 4 years ago