FranxYao/FlanT5-CoT-Specialization

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/FranxYao/FlanT5-CoT-Specialization)

FranxYao / FlanT5-CoT-Specialization

Implementation of ICML 23 Paper: Specializing Smaller Language Models towards Multi-Step Reasoning.

☆132

Alternatives and similar repositories for FlanT5-CoT-Specialization

Users that are interested in FlanT5-CoT-Specialization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

nlx-group / Shortcutted-Commonsense-Reasoning
View on GitHub
Code for the article "Shortcutted Commonsense: Data Spuriousness in Deep Learning of Commonsense Reasoning", Outstanding Paper at EMNLP20…
☆10Nov 7, 2021Updated 4 years ago
itsnamgyu / reasoning-teacher
View on GitHub
Large Language Models Are Reasoning Teachers (ACL 2023)
☆345Mar 7, 2025Updated last year
sunlab-osu / Understanding-CoT
View on GitHub
☆88Jun 1, 2023Updated 3 years ago
KyujinHan / Korean_selenium_DeepL
View on GitHub
DeepL을 통한 한국 번역 자동화 코드
☆12Jul 27, 2023Updated 2 years ago
HKUST-KnowComp / SubeventWriter
View on GitHub
Official code repository for the main conference paper in EMNLP 2022: SubeventWriter: Iterative Sub-event Sequence Generation with Cohere…
☆11Oct 16, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
FranxYao / chain-of-thought-hub
View on GitHub
Benchmarking large language models' complex reasoning ability with chain-of-thought prompting
☆2,777Aug 4, 2024Updated last year
zjunlp / Prompt4ReasoningPapers
View on GitHub
[ACL 2023] Reasoning with Language Model Prompting: A Survey
☆1,009May 21, 2025Updated last year
ServiceNow / promptmix-emnlp-2023
View on GitHub
Offical code repository for PromptMix: A Class Boundary Augmentation Method for Large Language Model Distillation, EMNLP 2023
☆12Dec 13, 2023Updated 2 years ago
allenai / DecomP
View on GitHub
Repository for Decomposed Prompting
☆99Nov 15, 2023Updated 2 years ago
hkust-nlp / dart-math
View on GitHub
[NeurIPS'24] Official code for *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*
☆120Dec 10, 2024Updated last year
peterljq / Tutorial-of-Data-Distillation-and-Condensation
View on GitHub
A comprehensive overview of Data Distillation and Condensation (DDC). DDC is a data-centric task where a representative (i.e., small but …
☆13Dec 1, 2022Updated 3 years ago
sangmichaelxie / doremi
View on GitHub
Pytorch implementation of DoReMi, a method for optimizing the data mixture weights in language modeling datasets
☆357Dec 26, 2023Updated 2 years ago
allenai / feb
View on GitHub
Code associated with the paper: "Few-Shot Self-Rationalization with Natural Language Prompts"
☆12Apr 27, 2022Updated 4 years ago
chaochun / nlu-asdiv-dataset
View on GitHub
☆52Jul 4, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
xlang-ai / xlang-paper-reading
View on GitHub
Paper collection on building and evaluating language model agents via executable language grounding
☆364Apr 29, 2024Updated 2 years ago
fansunqi / AKeyS
View on GitHub
Agentic Keyframe Search for Video Question Answering
☆18Jun 30, 2026Updated 2 weeks ago
arkilpatel / SVAMP
View on GitHub
NAACL 2021: Are NLP Models really able to Solve Simple Math Word Problems?
☆142Jun 30, 2022Updated 4 years ago
aitsc / GLMKD
View on GitHub
Are Intermediate Layers and Labels Really Necessary? A General Language Model Distillation Method ; GKD: A General Knowledge Distillation…
☆34Aug 4, 2023Updated 2 years ago
FreedomIntelligence / OVM
View on GitHub
☆74Apr 2, 2024Updated 2 years ago
sail-sg / symbolic-instruction-tuning
View on GitHub
The official repository for the paper "From Zero to Hero: Examining the Power of Symbolic Tasks in Instruction Tuning".
☆65Apr 18, 2023Updated 3 years ago
YJiangcm / Lion
View on GitHub
[EMNLP 2023] Lion: Adversarial Distillation of Proprietary Large Language Models
☆210Feb 11, 2024Updated 2 years ago
jeffhj / LM-reasoning
View on GitHub
This repository contains a collection of papers and resources on Reasoning in Large Language Models.
☆572Nov 13, 2023Updated 2 years ago
hkust-nlp / deita
View on GitHub
Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]
☆599Dec 9, 2024Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
ylsung / vl-merging
View on GitHub
PyTorch codes for the paper "An Empirical Study of Multimodal Model Merging"
☆37Oct 11, 2023Updated 2 years ago
princeton-nlp / continual-factoid-memorization
View on GitHub
Continual Memorization of Factoids in Large Language Models
☆12Nov 20, 2024Updated last year
bigai-nlco / LooGLE
View on GitHub
ACL 2024 | LooGLE: Long Context Evaluation for Long-Context Language Models
☆199Oct 8, 2024Updated last year
liutiedong / goat
View on GitHub
a Fine-tuned LLaMA that is Good at Arithmetic Tasks
☆178Sep 15, 2023Updated 2 years ago
GanjinZero / math401-llm
View on GitHub
Source codes and datasets for How well do Large Language Models perform in Arithmetic tasks?
☆57Apr 17, 2023Updated 3 years ago
HKUST-KnowComp / PseudoReasoner
View on GitHub
Official code repository for Findings of EMNLP 2022 paper: PseudoReasoner: Leveraging Pseudo Labels for Commonsense Knowledge Base Popula…
☆11Oct 18, 2022Updated 3 years ago
Xuekai-Zhu / key-configuration-of-llms
View on GitHub
☆22Mar 18, 2024Updated 2 years ago
JiaQiSJTU / FaithEval-FFLM
View on GitHub
A zero-shot faithfulness evaluation metric for text summarization
☆11Oct 17, 2023Updated 2 years ago
NJU-LINK / MVU-Eval
View on GitHub
MVU-Eval @NeurIPS DB 2025
☆18Nov 11, 2025Updated 8 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
MadryLab / datamodels-data
View on GitHub
Data for "Datamodels: Predicting Predictions with Training Data"
☆97May 25, 2023Updated 3 years ago
txsun1997 / Metric-Fairness
View on GitHub
EMNLP'2022: BERTScore is Unfair: On Social Bias in Language Model-Based Metrics for Text Generation
☆41Oct 19, 2022Updated 3 years ago
HKUNLP / ProGen
View on GitHub
[EMNLP-2022 Findings] Code for paper “ProGen: Progressive Zero-shot Dataset Generation via In-context Feedback”.
☆27Feb 4, 2023Updated 3 years ago
CogComp / APSI
View on GitHub
Code for EMNLP 2020 paper: Analogous Process Structure Induction for Sub-event Sequence Prediction
☆11Oct 19, 2020Updated 5 years ago
HKUST-KnowComp / AbsPyramid
View on GitHub
Official code repository for the paper: AbsPyramid: Benchmarking the Abstration Ability of Language Models with a Unified Entailment Grap…
☆13Oct 30, 2024Updated last year
rosewang2008 / zero-shot-teacher-feedback
View on GitHub
🏆 Ambassador Paper for Innovative Use of NLP for Building Educational Applications 2023: Is ChatGPT a Good Teacher Coach? Measuring Zero…
☆14Jul 21, 2024Updated last year
hccngu / DialCoT
View on GitHub
DialCoT Meets PPO: Decomposing and Exploring Reasoning Paths in Smaller Language Models
☆13Nov 2, 2023Updated 2 years ago