DAMO-NLP-SG / contrastive-cotLinks

Contrastive Chain-of-Thought Prompting

☆68

Alternatives and similar repositories for contrastive-cot

Users that are interested in contrastive-cot are comparing it to the libraries listed below

Sorting:

ernie-research / Tool-Augmented-Reward-Model
[ICLR'24 spotlight] Tool-Augmented Reward Modeling
☆51Updated 6 months ago
wwxu21 / CUT
Source code of "Reasons to Reject? Aligning Language Models with Judgments"
☆58Updated last year
kaistAI / Janus
[NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messages
☆52Updated 3 months ago
RUCAIBox / BAMBOO
☆35Updated last year
TIGER-AI-Lab / MAmmoTH2
Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]
☆149Updated last year
Alsace08 / SumCoT
[ACL 2023] Code and Data Repo for Paper "Element-aware Summary and Summary Chain-of-Thought (SumCoT)"
☆53Updated last year
shizhediao / R-Tuning
[NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don't…
☆126Updated last year
OSU-NLP-Group / llm-planning-eval
[ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"
☆54Updated last year
qinyiwei / InfoBench
☆57Updated last year
csitfun / LogiCoT
the instructions and demonstrations for building a formal logical reasoning capable GLM
☆55Updated last year
chujiezheng / LLM-Extrapolation
Official repository for ACL 2025 paper "Model Extrapolation Expedites Alignment"
☆76Updated 6 months ago
princeton-nlp / LLMBar
[ICLR 2024] Evaluating Large Language Models at Evaluating Instruction Following
☆134Updated last year
WHGTyen / BIG-Bench-Mistake
A dataset of LLM-generated chain-of-thought steps annotated with mistake location.
☆84Updated last year
liyucheng09 / Contamination_Detector
Lightweight tool to identify Data Contamination in LLMs evaluation
☆52Updated last year
BeastyZ / LLM-Verified-Retrieval
Repo for Llatrieval
☆31Updated last year
HillZhang1999 / ICD
Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"
☆69Updated last year
FreedomIntelligence / OVM
☆68Updated last year
LuLuLuyi / LongHeads
[EMNLP'24] LongHeads: Multi-Head Attention is Secretly a Long Context Processor
☆31Updated last year
WadeYin9712 / Dynosaur
Code and data for "Dynosaur: A Dynamic Growth Paradigm for Instruction-Tuning Data Curation" (EMNLP 2023)
☆64Updated 2 years ago
SparkJiao / dpo-trajectory-reasoning
[EMNLP 2024] Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".
☆82Updated 10 months ago
zjunlp / TRICE
[NAACL 2024] Making Language Models Better Tool Learners with Execution Feedback
☆42Updated last year
starrYYxuan / LeCo
This the implementation of LeCo
☆31Updated 10 months ago
lifan-yuan / CRAFT
Code for ICLR 2024 paper "CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets"
☆60Updated last year
chaochun / nlu-asdiv-dataset
☆50Updated 2 years ago
abhika-m / FAVA
☆75Updated last year
qhjqhj00 / WebBrain
☆68Updated 2 years ago
Junjie-Ye / ToolEyes
[COLING 2025] ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios
☆71Updated 6 months ago
Re-Align / just-eval
A simple GPT-based evaluation tool for multi-aspect, interpretable assessment of LLMs.
☆89Updated last year
KwanWaiChung / MT-Eval
Code and data for "MT-Eval: A Multi-Turn Capabilities Evaluation Benchmark for Large Language Models"
☆49Updated 3 weeks ago
yifanzhang-pro / AutoMathText
[ACL 2025 Findings] Autonomous Data Selection with Zero-shot Generative Classifiers for Mathematical Texts (As Huggingface Daily Papers: …
☆88Updated 2 weeks ago