SeungoneKim/CoTEVer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/SeungoneKim/CoTEVer)

SeungoneKim / CoTEVer

[EACL 2023] CoTEVer: Chain of Thought Prompting Annotation Toolkit for Explanation Verification

☆42

Alternatives and similar repositories for CoTEVer

Users that are interested in CoTEVer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

joeljang / FLM
View on GitHub
All-in-one repository for Fine-tuning & Pretraining (Large) Language Models
☆15Mar 8, 2023Updated 3 years ago
kaistAI / KtrlF
View on GitHub
[NAACL 2024] Official repository for "KTRL+F: Knowledge-Augmented In-Document Search"
☆23Oct 11, 2024Updated last year
kaistAI / GAP
View on GitHub
[ACL 2023] Gradient Ascent Post-training Enhances Language Model Generalization
☆29Sep 12, 2024Updated last year
SeungoneKim / SICK_Summarization
View on GitHub
[COLING 2022] Mind the Gap! Injecting Commonsense Knowledge for Abstractive Dialogue Summarization
☆25Mar 28, 2024Updated 2 years ago
soheeyang / unified-prompt-selection
View on GitHub
[TACL 2024] Improving Probability-based Prompt Selection Through Unified Evaluation and Analysis
☆11Nov 14, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
joeljang / ELM
View on GitHub
[ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuning
☆99Apr 26, 2023Updated 3 years ago
convei-lab / BotsTalk
View on GitHub
🤖 Code for our EMNLP 2022 paper: "BotsTalk: Machine-sourced Framework for Automatic Curation of Large-scale Multi-skill Dialogue Dataset…
☆16Oct 7, 2024Updated last year
shizhediao / automate-cot
View on GitHub
Source code for the paper "Automatic Prompt Augmentation and Selection with Chain-of-Thought from Labeled Data"
☆20Feb 24, 2024Updated 2 years ago
joeljang / Pretraining_T5_custom_dataset
View on GitHub
Continue Pretraining T5 on custom dataset based on available pretrained model checkpoints
☆38Mar 21, 2021Updated 5 years ago
prometheus-eval / cmu-paper-reviewer
View on GitHub
Code repository for the "CMU Paper Reviewer System", a agentic system that generates reviews for academic papers.
☆25Jun 9, 2026Updated last month
kaistAI / Knowledge-Entropy
View on GitHub
[ICLR 2025 Oral] Knowledge Entropy Decay during Language Model Pretraining Hinders New Knowledge Acquisition
☆17Nov 25, 2024Updated last year
haven-jeon / KoGPT2-subtasks
View on GitHub
NSMC, KorSTS ... fine-tunings
☆18Feb 23, 2022Updated 4 years ago
amy-hyunji / Contextualized-Generative-Retrieval
View on GitHub
☆16Oct 6, 2022Updated 3 years ago
kyle8581 / DialogueCoT
View on GitHub
[EMNLP 2023] Official repository for Dialogue Chain-of-Thought Distillation (DONUT & DOCTOR)
☆11Nov 15, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
naver-ai / ALMoST
View on GitHub
☆24Dec 2, 2023Updated 2 years ago
guijinSON / MM-Eval
View on GitHub
Official implementation for "MM-Eval: A Multilingual Meta-Evaluation Benchmark for LLM-as-a-Judge and Reward Models"
☆20Oct 26, 2024Updated last year
thu-ml / LM-Calibration
View on GitHub
☆17May 31, 2023Updated 3 years ago
Silin159 / PeaCoK
View on GitHub
☆35Jan 7, 2026Updated 6 months ago
haven-jeon / KoBART-chatbot
View on GitHub
KoBART chatbot
☆45Jun 22, 2021Updated 5 years ago
MattYoon / reasoning-models-confidence
View on GitHub
[NeurIPS 2025] Reasoning Models Better Express Their Confidence"
☆23Nov 19, 2025Updated 8 months ago
seonghyeonye / TAPP
View on GitHub
[AAAI 2024] Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Following
☆79Sep 13, 2024Updated last year
unbiarirang / Fixed-Input-Parameterization
View on GitHub
This repository contains the official code for the paper: "Prompt Injection: Parameterization of Fixed Inputs"
☆32Sep 13, 2024Updated last year
facebookresearch / lss_eval
View on GitHub
This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…
☆31Aug 25, 2023Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
abaheti95 / LoL-RL
View on GitHub
Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients
☆26Sep 10, 2024Updated last year
naver-ai / KoBBQ
View on GitHub
Official code and dataset repository of KoBBQ (TACL 2024)
☆19May 13, 2024Updated 2 years ago
zhao-zilong / ssc-cot
View on GitHub
Git for "Stepwise Self-Consistent Mathematical Reasoning with Large Language Models"
☆12Nov 26, 2024Updated last year
seonghyeonye / Flipped-Learning
View on GitHub
[ICLR 2023] Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners
☆117Jun 28, 2025Updated last year
PierreColombo / RankingNLPSystems
View on GitHub
What are the best Systems? New Perspectives on NLP Benchmarking
☆13Mar 16, 2023Updated 3 years ago
tangjialong / Event-Schema-Harvester
View on GitHub
☆32Jul 31, 2023Updated 2 years ago
sczzz3 / EHRDiff
View on GitHub
An offical implementation of EHRDiff [TMLR]
☆33Jun 25, 2024Updated 2 years ago
liujch1998 / rainier
View on GitHub
☆29Feb 17, 2024Updated 2 years ago
kaistAI / CoT-Collection
View on GitHub
[EMNLP 2023] The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning
☆258Oct 31, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
neulab / data-agora
View on GitHub
[ACL 2025 Main] Official Repository for "Evaluating Language Models as Synthetic Data Generators"
☆40Dec 13, 2024Updated last year
SciPhi-AI / RAG-Performance
View on GitHub
Measuring RAG solutions throughput and latency
☆19Jul 23, 2024Updated last year
kaistAI / InstructIR
View on GitHub
IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our foc…
☆32Jun 13, 2024Updated 2 years ago
interview-eval / interview-eval
View on GitHub
Interview-based evaluation of LLMs
☆30May 21, 2026Updated last month
joeljang / negated-prompts-for-llms
View on GitHub
[NeurIPS 2022 Workshop] A Case Study with Negated Prompts using T0 (3B, 11B), InstructGPT (350M-175B), GPT-3 (350M - 175B) & OPT (125M - …
☆24Sep 27, 2022Updated 3 years ago
joeljang / temporalwiki
View on GitHub
[EMNLP 2022] TemporalWiki: A Lifelong Benchmark for Training and Evaluating Ever-Evolving Language Models
☆75May 15, 2024Updated 2 years ago
lbox-kr / lbox-open
View on GitHub
☆108Apr 11, 2025Updated last year