YuxiXie/SelfEval-Guided-Decoding

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/YuxiXie/SelfEval-Guided-Decoding)

YuxiXie / SelfEval-Guided-Decoding

☆103

Alternatives and similar repositories for SelfEval-Guided-Decoding

Users that are interested in SelfEval-Guided-Decoding are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

XuZhao0 / Model-Selection-Reasoning
View on GitHub
Model Selection with Large Language Models for Reasoning (EMNLP2023 Findings)
☆30Dec 23, 2023Updated 2 years ago
chang-github-00 / LLM-Predictive-Decoding
View on GitHub
☆16Jul 9, 2025Updated last year
WING-NUS / ELCo
View on GitHub
The Dataset and Official Implementation for <The ELCo Dataset: Bridging Emoji and Lexical Composition> @ LREC-COLING 2024
☆16May 11, 2024Updated 2 years ago
YuxiXie / MCTS-DPO
View on GitHub
This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.
☆331Jan 29, 2026Updated 5 months ago
XinyuanLu00 / QACheck
View on GitHub
About Data and Codes for EMNLP 2023 System Demo Paper "QACHECK: A Demonstration System for Question-Guided Multi-Hop Fact-Checking"
☆19Dec 19, 2023Updated 2 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
zwhe99 / LLM-MT-Eval
View on GitHub
{DeepL, Google, WMT-Best, davinci-003, turbo, gpt-4} × {En-De, En-Cs, En-Ru, En-Zh, De-Fr, En-Ja, Uk-En, Uk-Cs, En-Hr, En-Ha, En-Is}
☆14Jun 18, 2023Updated 3 years ago
Yiwei98 / ESC
View on GitHub
☆14Jul 17, 2025Updated last year
maitrix-org / llm-reasoners
View on GitHub
A library for advanced large language model reasoning
☆2,341Jun 10, 2025Updated last year
declare-lab / SAT
View on GitHub
Code for the EMNLP 2022 Findings short paper "SAT: Improving Semi-Supervised Text Classification with Simple Instance-Adaptive Self-Train…
☆12Feb 25, 2023Updated 3 years ago
SiyuanWangw / ULogic
View on GitHub
☆23Aug 1, 2024Updated last year
qtli / GSM-Plus
View on GitHub
GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.
☆66Jul 8, 2024Updated 2 years ago
TIGER-AI-Lab / Program-of-Thoughts
View on GitHub
Data and Code for Program of Thoughts [TMLR 2023]
☆317May 15, 2024Updated 2 years ago
LZhengisme / self-infilling
View on GitHub
[ICML 2024] Self-Infilling Code Generation
☆18May 5, 2024Updated 2 years ago
moqingyan / dsr-lm
View on GitHub
☆13Jul 8, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Aatlantise / syntactic-augmentation-nli
View on GitHub
Create augmentation examples from MultiNLI by subject-object inversion and passivizing.
☆17Feb 22, 2021Updated 5 years ago
qishenghu / InstructCoder
View on GitHub
InstructCoder: Instruction Tuning Large Language Models for Code Editing | Oral ACL-2024 srw
☆66Oct 4, 2024Updated last year
isle-dev / MetricEval
View on GitHub
MetricEval: A framework that conceptualizes and operationalizes four main components of metric evaluation, in terms of reliability and va…
☆12Nov 6, 2023Updated 2 years ago
BKHMSI / cultural-trends
View on GitHub
Investigating Cultural Alignment of Large Language Models
☆13Aug 14, 2024Updated last year
chchenhui / fabscore
View on GitHub
FabScore: Fine-Grained Evaluation of Fabrications in Automated AI Research
☆20Updated this week
Tiiiger / templm
View on GitHub
Code release for "TempLM: Distilling Language Models into Template-Based Generators"
☆14Jul 21, 2022Updated 4 years ago
allenai / openpi-dataset
View on GitHub
OpenPI dataset for tracking entities in open domain procedural text
☆24Aug 13, 2024Updated last year
xingyaoww / LeTI
View on GitHub
Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."
☆66Jun 29, 2023Updated 3 years ago
OFA-Sys / gsm8k-ScRel
View on GitHub
Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models
☆269Sep 12, 2024Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
veronica320 / Faithful-COT
View on GitHub
Code and data accompanying our paper on arXiv "Faithful Chain-of-Thought Reasoning".
☆169May 7, 2024Updated 2 years ago
TianHongZXY / CoRe
View on GitHub
[ACL 2023] Solving Math Word Problems via Cooperative Reasoning induced Language Models (LLMs + MCTS + Self-Improvement)
☆51Dec 15, 2023Updated 2 years ago
Aman-4-Real / MMTG
View on GitHub
[ACM MM 2022] (Oral): Multi-Modal Experience Inspired AI Creation
☆21Nov 27, 2024Updated last year
jeffhj / LM-reasoning
View on GitHub
This repository contains a collection of papers and resources on Reasoning in Large Language Models.
☆572Nov 13, 2023Updated 2 years ago
cindermond / leap
View on GitHub
Implementation of the methods described in our paper "Explicit Planning Helps Language Models in Logical Reasoning"
☆23Apr 12, 2023Updated 3 years ago
princeton-nlp / WhatICLLearns
View on GitHub
[ACL 2023 Findings] What In-Context Learning “Learns” In-Context: Disentangling Task Recognition and Task Learning
☆21Jul 9, 2023Updated 3 years ago
keven980716 / weak-to-strong-deception
View on GitHub
[ICLR 2025] Code&Data for the paper "Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization"
☆15Jun 21, 2024Updated 2 years ago
hkust-nlp / dart-math
View on GitHub
[NeurIPS'24] Official code for *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*
☆120Dec 10, 2024Updated last year
wenhuchen / TheoremQA
View on GitHub
The dataset and code for paper: TheoremQA: A Theorem-driven Question Answering dataset
☆161Apr 23, 2024Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
abhinavkashyap / domadapter
View on GitHub
Domain Adaptation and Adapters
☆16Feb 28, 2023Updated 3 years ago
teacherpeterpan / Unsupervised-Multi-hop-QA
View on GitHub
Codes for NAACL 2021 Paper "Unsupervised Multi-hop Question Answering by Question Generation"
☆92Nov 16, 2022Updated 3 years ago
reasoning-machines / pal
View on GitHub
PaL: Program-Aided Language Models (ICML 2023)
☆525Jun 30, 2023Updated 3 years ago
lupantech / PromptPG
View on GitHub
Data and code for the ICLR 2023 paper "Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical Reasoning".
☆165Dec 27, 2023Updated 2 years ago
Timothyxxx / Chain-of-ThoughtsPapers
View on GitHub
A trend starts from "Chain of Thought Prompting Elicits Reasoning in Large Language Models".
☆2,105Oct 5, 2023Updated 2 years ago
RUCAIBox / Erya
View on GitHub
☆19Oct 6, 2023Updated 2 years ago
duyngtr16061999 / KDMCSE
View on GitHub
☆10Apr 7, 2024Updated 2 years ago