☆54Mar 6, 2025Updated last year
Alternatives and similar repositories for Reasoning-Self-Evolution-Survey
Users that are interested in Reasoning-Self-Evolution-Survey are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆52Feb 12, 2025Updated last year
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".☆55Nov 29, 2024Updated last year
- Papers of Implicit Reasoning in LLMs.☆25Mar 13, 2025Updated last year
- The open source implementation of "Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers"☆19Mar 11, 2024Updated 2 years ago
- MathFusion: Enhancing Mathematical Problem-solving of LLM through Instruction Fusion (ACL 2025)☆37Jul 16, 2025Updated 10 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Latest Advances on Long Chain-of-Thought Reasoning☆634Jul 18, 2025Updated 10 months ago
- Code for NAACL 2025 paper "AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge"☆17Mar 2, 2026Updated 2 months ago
- ☆20Nov 3, 2024Updated last year
- ☆20May 28, 2025Updated last year
- Official codes for COLING 2024 paper "Robust and Scalable Model Editing for Large Language Models": https://arxiv.org/abs/2403.17431v1☆14Mar 27, 2024Updated 2 years ago
- Grade-School Math with Irrelevant Context (GSM-IC) benchmark is an arithmetic reasoning dataset built upon GSM8K, by adding irrelevant se…☆67Feb 13, 2023Updated 3 years ago
- ☆36Jul 8, 2025Updated 10 months ago
- Paper List of Inference/Test Time Scaling/Computing☆381May 17, 2026Updated last week
- The repository for our paper: Neighboring Perturbations of Knowledge Editing on Large Language Models☆16May 4, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"☆113Sep 28, 2024Updated last year
- Code of LVAgent: Long Video Understanding by Multi-Round Dynamical Collaboration of MLLM Agents☆34Nov 24, 2025Updated 6 months ago
- Generative AI Act II: Test Time Scaling Drives Cognition Engineering☆212Apr 22, 2025Updated last year
- [NeurIPS 25] The official implementation of SPC: Evolving Self-Play Critic via Adversarial Games for LLM Reasoning☆27Sep 21, 2025Updated 8 months ago
- The repository of CLEME (EMNLP 2023) and CLEME2.0 (ACL 2025)☆12May 17, 2025Updated last year
- ☆11Dec 20, 2024Updated last year
- This repo lists some researches and applications in PU learning.☆12Mar 12, 2020Updated 6 years ago
- A Sober Look at Language Model Reasoning☆94Nov 18, 2025Updated 6 months ago
- ☆55Apr 7, 2026Updated last month
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆49Apr 11, 2025Updated last year
- This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"☆27Mar 30, 2023Updated 3 years ago
- Latest Advances on System-2 Reasoning☆1,352Jun 8, 2025Updated 11 months ago
- Official implementation of "What does CLIP know about a red circle? Visual Prompt Engineering for VLMs", ICCV 2023☆12Sep 21, 2023Updated 2 years ago
- Collection of papers and repos for multimodal chain-of-thought☆89Nov 6, 2024Updated last year
- Models, data, and codes for the paper: MetaAligner: Towards Generalizable Multi-Objective Alignment of Language Models☆25Sep 26, 2024Updated last year
- ☆29Mar 17, 2025Updated last year
- The repository of EMNLP 2023 "MixEdit: Revisiting Data Augmentation and Beyond for Grammatical Error Correction"☆12Nov 25, 2023Updated 2 years ago
- A controlled benchmark on evaluating and studying the dynamics of Long Context Language Models☆26Oct 17, 2025Updated 7 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆554Jan 2, 2025Updated last year
- ☆18Sep 3, 2024Updated last year
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆54Feb 23, 2024Updated 2 years ago
- ☆17Jun 4, 2024Updated last year
- ☆21Dec 14, 2024Updated last year
- ☆20Jan 16, 2024Updated 2 years ago
- A curated list of awesome LLM Inference-Time Self-Improvement (ITSI, pronounced "itsy") papers from our recent survey: A Survey on Large …☆106Dec 24, 2024Updated last year