[ACL 2024 (Oral)] A Prospector of Long-Dependency Data for Large Language Models
β60Jul 23, 2024Updated last year
Alternatives and similar repositories for ProLong
Users that are interested in ProLong are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- (ACL 2025) π₯π₯π₯Code for "Empowering Multimodal Large Language Models with Evol-Instruct"β22May 15, 2025Updated last year
- LongAttn οΌSelecting Long-context Training Data via Token-level Attentionβ15Jul 16, 2025Updated 10 months ago
- Marathon: A Multiple-choice Long Context Evaluation Benchmark for Large Language Models.β10May 16, 2024Updated 2 years ago
- (ICLR 2025 Spotlight) DEEM: Official implementation of Diffusion models serve as the eyes of large language models for image perception.β51Jul 1, 2025Updated 11 months ago
- Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Modelsβ41Sep 30, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- β31Sep 12, 2025Updated 8 months ago
- [COLING 2024 (Oral)] PromISe:Releasing the Capabilities of LLMs with Prompt Introspective Searchβ23Aug 26, 2024Updated last year
- β47Nov 25, 2024Updated last year
- [ACL 2026] Repository of IPBenchβ22Apr 6, 2026Updated 2 months ago
- [ICLR 2025] Language Imbalance Driven Rewarding for Multilingual Self-improvingβ25Apr 6, 2026Updated 2 months ago
- [ICLR 2026] Adaptive Social Learning via Mode Policy Optimization for Language Agentsβ50Feb 2, 2026Updated 4 months ago
- Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervisionβ19Apr 1, 2025Updated last year
- Official repository of the AAAI'2022 paper "GALAXY: A Generative Pre-trained Model for Task-Oriented Dialog with Semi-Supervised Learningβ¦β109Jul 15, 2022Updated 3 years ago
- Aioli: A unified optimization framework for language model data mixingβ32Jan 17, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [EMNLP'24] LongHeads: Multi-Head Attention is Secretly a Long Context Processorβ32Apr 8, 2024Updated 2 years ago
- A toolkit for modeling and simulation of cloud-native applications.β16Aug 4, 2025Updated 10 months ago
- Dataset and baseline for Coling 2022 long paper (oral): "ConFiguRe: Exploring Discourse-level Chinese Figures of Speech"β13Jul 27, 2023Updated 2 years ago
- β19Oct 14, 2024Updated last year
- β62Oct 29, 2024Updated last year
- The this is the official implementation of "DAPE: Data-Adaptive Positional Encoding for Length Extrapolation"β41Oct 11, 2024Updated last year
- This repo contains code and data for ICLR 2025 paper MIA-Bench: Towards Better Instruction Following Evaluation of Multimodal LLMsβ39Mar 9, 2025Updated last year
- (NIPS 2025) OpenOmni: Official implementation of Advancing Open-Source Omnimodal Large Language Models with Progressive Multimodal Alignβ¦β140May 9, 2026Updated last month
- [ACL 2025 (Findings)] DEMO: Reframing Dialogue Interaction with Fine-grained Element Modelingβ22Dec 16, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Efficient retrieval head analysis with triton flash attention that supports topK probabilityβ13Jun 15, 2024Updated last year
- [EMNLP 2024 (Oral)] Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QAβ153Dec 22, 2025Updated 5 months ago
- Codebase for Instruction Following without Instruction Tuningβ36Sep 24, 2024Updated last year
- LongRecipe: Recipe for Efficient Long Context Generalization in Large Language Modelsβ78Oct 16, 2024Updated last year
- [ACL'24 Outstanding] Data and code for L-Eval, a comprehensive long context language models evaluation benchmarkβ404Jul 9, 2024Updated last year
- π° Must-read papers on KV Cache Compression (constantly updating π€).β713Apr 15, 2026Updated last month
- β11Aug 20, 2025Updated 9 months ago
- open-source code for paper: Retrieval Head Mechanistically Explains Long-Context Factualityβ240Aug 2, 2024Updated last year
- [EMNLP'24 (Main)] DRPO(Dynamic Rewarding with Prompt Optimization) is a tuning-free approach for self-alignment. DRPO leverages a search-β¦β25Nov 17, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [NeurIPS 2024] Fast Best-of-N Decoding via Speculative Rejectionβ55Oct 29, 2024Updated last year
- CoT-Valve: Length-Compressible Chain-of-Thought Tuningβ92Feb 14, 2025Updated last year
- [ACL'24 Oral] Analysing The Impact of Sequence Composition on Language Model Pre-Trainingβ23Aug 18, 2024Updated last year
- ACL 2024 | LooGLE: Long Context Evaluation for Long-Context Language Modelsβ195Oct 8, 2024Updated last year
- DeFacto - Demonstrations and Feedback for improving factual consistency of text summarizationβ30Dec 19, 2022Updated 3 years ago
- [ACL Findings 2026] Official Implementation of "FastKV: Decoupling of Context Reduction and KV Cache Compression for Prefill-Decoding Accβ¦β32Apr 14, 2026Updated last month
- Code for the EMNLP24 paper "A simple and effective L2 norm based method for KV Cache compression."β18Dec 13, 2024Updated last year