QwenLM / Self-Lengthen
☆77Updated 2 months ago
Alternatives and similar repositories for Self-Lengthen:
Users that are interested in Self-Lengthen are comparing it to the libraries listed below
- Reformatted Alignment☆113Updated 3 months ago
- Code implementation of synthetic continued pretraining☆79Updated 2 weeks ago
- Source code of "Reasons to Reject? Aligning Language Models with Judgments"☆58Updated 10 months ago
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆88Updated 3 months ago
- Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large L…☆42Updated 6 months ago
- We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLMs.☆58Updated 2 months ago
- [EMNLP 2024 (Oral)] Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA☆108Updated 2 months ago
- This is the official repository of the paper "OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI"☆90Updated last month
- The demo, code and data of FollowRAG☆68Updated last month
- Code and Data for "Long-context LLMs Struggle with Long In-context Learning"☆97Updated 6 months ago
- [ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".☆107Updated 2 months ago
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆76Updated last year
- Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"☆147Updated last month
- Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models☆20Updated 3 months ago
- ☆48Updated 10 months ago
- The official repository of the Omni-MATH benchmark.☆67Updated 3 weeks ago
- Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"☆118Updated 5 months ago
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆106Updated 8 months ago
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆129Updated 2 months ago
- [NeurIPS 2024] CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs☆93Updated 3 weeks ago
- Conifer: Improving Complex Constrained Instruction-Following Ability of Large Language Models☆84Updated 9 months ago
- The code and data of DPA-RAG☆54Updated 3 months ago
- Scalable Meta-Evaluation of LLMs as Evaluators☆42Updated 11 months ago
- ☆69Updated this week
- [ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement☆172Updated 9 months ago
- The official code repo and data hub of top_nsigma sampling strategy for LLMs.☆20Updated this week
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆126Updated 2 months ago
- Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?"☆68Updated last month
- Large Language Models Can Self-Improve in Long-context Reasoning☆61Updated last month
- ☆98Updated last month