☆25Dec 13, 2024Updated last year
Alternatives and similar repositories for Evolution-Analysis
Users that are interested in Evolution-Analysis are comparing it to the libraries listed below
Sorting:
- ☆16Feb 22, 2025Updated last year
- R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning☆73May 25, 2025Updated 9 months ago
- MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning Transfer (EMNLP 2025)☆11Apr 18, 2025Updated 10 months ago
- [EMNLP 2024 Main] Official implementation of the paper "The Accuracy Paradox in RLHF: When Better Reward Models Don't Yield Better Langua…☆13Nov 11, 2024Updated last year
- ☆12Mar 7, 2024Updated last year
- From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.☆25Oct 7, 2025Updated 4 months ago
- [ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆19Mar 10, 2025Updated 11 months ago
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆27Mar 1, 2025Updated last year
- Code of EMNLP 2025 paper 'UltraIF: Advancing Instruction Following from the Wild'.☆21Apr 3, 2025Updated 10 months ago
- MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following☆16Oct 31, 2024Updated last year
- ☆46Oct 28, 2025Updated 4 months ago
- FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions☆52Jul 3, 2024Updated last year
- Exploration of automated dataset selection approaches at large scales.☆52Mar 4, 2025Updated 11 months ago
- A scalable automated alignment method for large language models. Resources for "Aligning Large Language Models via Self-Steering Optimiza…☆20Nov 21, 2024Updated last year
- ☆24Apr 3, 2025Updated 10 months ago
- Official Repository for Task-Circuit Quantization☆24Jun 1, 2025Updated 9 months ago
- ☆17Aug 1, 2025Updated 7 months ago
- Self-Supervised Alignment with Mutual Information☆20May 24, 2024Updated last year
- Implementation and datasets for "Training Language Models to Generate Quality Code with Program Analysis Feedback"☆41Jul 21, 2025Updated 7 months ago
- A comprehensive benchmark for evaluating deep research agents on academic survey tasks☆50Sep 4, 2025Updated 5 months ago
- Unified MultiWOZ evaluation scripts for the context-to-response task.☆59Oct 11, 2023Updated 2 years ago
- [AAAI'26, Oral] Code for "Teaching Large Language Models to Maintain Contextual Faithfulness via Synthetic Tasks and Reinforcement Learni…☆43Jul 16, 2025Updated 7 months ago
- MathFusion: Enhancing Mathematical Problem-solving of LLM through Instruction Fusion (ACL 2025)☆35Jul 16, 2025Updated 7 months ago
- The official implementation of Self-Exploring Language Models (SELM)☆63Jun 4, 2024Updated last year
- ☆47Aug 5, 2025Updated 6 months ago
- ☆28Nov 10, 2025Updated 3 months ago
- [EMNLP 2023] ALCUNA: Large Language Models Meet New Knowledge☆29Oct 30, 2023Updated 2 years ago
- ☆34Feb 6, 2026Updated 3 weeks ago
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆42Dec 29, 2025Updated 2 months ago
- 3D Gaussian Splatting for underwater scene reconstruction via physcial-based appearance-medium decoupling☆23Feb 13, 2026Updated 2 weeks ago
- Official PyTorch Implementation for Vision-Language Models Create Cross-Modal Task Representations, ICML 2025☆33May 1, 2025Updated 10 months ago
- ☆29Dec 16, 2025Updated 2 months ago
- The demo, code and data of FollowRAG☆75Jun 30, 2025Updated 8 months ago
- Library to facilitate pruning of LLMs based on context☆32Jan 31, 2024Updated 2 years ago
- PLATO dialog model with pre-trained parameters in pytorch version☆29May 20, 2022Updated 3 years ago
- Learning from preferences is a common paradigm for fine-tuning language models. Yet, many algorithmic design decisions come into play. Ou…☆32Apr 20, 2024Updated last year
- ☆35May 16, 2025Updated 9 months ago
- Code for paper called Self-Training Elicits Concise Reasoning in Large Language Models☆42Apr 22, 2025Updated 10 months ago
- ☆39May 2, 2024Updated last year