[ACL 2024 (Oral)] A Prospector of Long-Dependency Data for Large Language Models
β59Jul 23, 2024Updated last year
Alternatives and similar repositories for ProLong
Users that are interested in ProLong are comparing it to the libraries listed below
Sorting:
- (ACL 2025) π₯π₯π₯Code for "Empowering Multimodal Large Language Models with Evol-Instruct"β20May 15, 2025Updated 9 months ago
- LongAttn οΌSelecting Long-context Training Data via Token-level Attentionβ15Jul 16, 2025Updated 7 months ago
- (ICLR 2025 Spotlight) DEEM: Official implementation of Diffusion models serve as the eyes of large language models for image perception.β48Jul 1, 2025Updated 7 months ago
- [COLING 2024 (Oral)] PromISe:Releasing the Capabilities of LLMs with Prompt Introspective Searchβ23Aug 26, 2024Updated last year
- Marathon: A Multiple-choice Long Context Evaluation Benchmark for Large Language Models.β10May 16, 2024Updated last year
- β48Nov 25, 2024Updated last year
- Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Modelsβ41Sep 30, 2024Updated last year
- β31Sep 12, 2025Updated 5 months ago
- [ICLR 2026] Adaptive Social Learning via Mode Policy Optimization for Language Agentsβ48Feb 2, 2026Updated 3 weeks ago
- β62Oct 29, 2024Updated last year
- β18Oct 14, 2024Updated last year
- Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervisionβ18Apr 1, 2025Updated 10 months ago
- [ACL 2025 (Findings)] DEMO: Reframing Dialogue Interaction with Fine-grained Element Modelingβ22Dec 16, 2024Updated last year
- Codebase for Instruction Following without Instruction Tuningβ36Sep 24, 2024Updated last year
- [EMNLP'24] LongHeads: Multi-Head Attention is Secretly a Long Context Processorβ31Apr 8, 2024Updated last year
- β38Nov 13, 2025Updated 3 months ago
- Official repository of paper "Context-DPO: Aligning Language Models for Context-Faithfulness"β21Feb 17, 2025Updated last year
- Repository of IPBenchβ19Jan 4, 2026Updated last month
- Official Implementation of FastKV: Decoupling of Context Reduction and KV Cache Compression for Prefill-Decoding Accelerationβ29Nov 22, 2025Updated 3 months ago
- [ICLR 2025] Language Imbalance Driven Rewarding for Multilingual Self-improvingβ24Aug 25, 2025Updated 6 months ago
- Aioli: A unified optimization framework for language model data mixingβ32Jan 17, 2025Updated last year
- The this is the official implementation of "DAPE: Data-Adaptive Positional Encoding for Length Extrapolation"β41Oct 11, 2024Updated last year
- β11Aug 20, 2025Updated 6 months ago
- β28Oct 28, 2024Updated last year
- (NIPS 2025) OpenOmni: Official implementation of Advancing Open-Source Omnimodal Large Language Models with Progressive Multimodal Alignβ¦β127Nov 8, 2025Updated 3 months ago
- β14Jan 24, 2025Updated last year
- Dataset and baseline for Coling 2022 long paper (oral): "ConFiguRe: Exploring Discourse-level Chinese Figures of Speech"β13Jul 27, 2023Updated 2 years ago
- Efficient retrieval head analysis with triton flash attention that supports topK probabilityβ13Jun 15, 2024Updated last year
- β47Oct 2, 2025Updated 4 months ago
- Official repository of the AAAI'2022 paper "GALAXY: A Generative Pre-trained Model for Task-Oriented Dialog with Semi-Supervised Learningβ¦β109Jul 15, 2022Updated 3 years ago
- [EMNLP 2024 (Oral)] Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QAβ147Dec 22, 2025Updated 2 months ago
- LongRecipe: Recipe for Efficient Long Context Generalization in Large Language Modelsβ79Oct 16, 2024Updated last year
- Aligning Agentic World Models via Knowledgeable Experience Learningβ31Jan 25, 2026Updated last month
- [NeurIPS 2024] Fast Best-of-N Decoding via Speculative Rejectionβ55Oct 29, 2024Updated last year
- CoT-Valve: Length-Compressible Chain-of-Thought Tuningβ91Feb 14, 2025Updated last year
- Dateset Reset Policy Optimizationβ31Apr 12, 2024Updated last year
- Source code of paper ''KVSharer: Efficient Inference via Layer-Wise Dissimilar KV Cache Sharing''β31Oct 24, 2024Updated last year
- β15Oct 20, 2023Updated 2 years ago
- β21Jul 21, 2025Updated 7 months ago