shizhediao / active-promptLinks

Source code for the paper "Active Prompting with Chain-of-Thought for Large Language Models"

☆243

Alternatives and similar repositories for active-prompt

Users that are interested in active-prompt are comparing it to the libraries listed below

Sorting:

night-chen / ToolQA
ToolQA, a new dataset to evaluate the capabilities of LLMs in answering challenging questions with external tools. It offers two levels …
☆272Updated last year
AI21Labs / in-context-ralm
☆284Updated last year
veronica320 / Faithful-COT
Code and data accompanying our paper on arXiv "Faithful Chain-of-Thought Reasoning".
☆161Updated last year
TIGER-AI-Lab / Program-of-Thoughts
Data and Code for Program of Thoughts [TMLR 2023]
☆279Updated last year
yueyu1030 / AttrPrompt
[NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.
☆150Updated last year
kaistAI / CoT-Collection
[EMNLP 2023] The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning
☆244Updated last year
bhargaviparanjape / language-programmes
☆172Updated 2 years ago
princeton-nlp / ALCE
[EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627
☆490Updated 9 months ago
HITsz-TMG / awesome-llm-attributions
A Survey of Attributions for Large Language Models
☆205Updated 10 months ago
Leezekun / Directional-Stimulus-Prompting
[NeurIPS 2023] Codebase for the paper: "Guiding Large Language Models with Directional Stimulus Prompting"
☆111Updated 2 years ago
GAIR-NLP / auto-j
Generative Judge for Evaluating Alignment
☆244Updated last year
FreedomIntelligence / ReasoningNLP
paper list on reasoning in NLP
☆190Updated 3 months ago
freshllms / freshqa
Data and code for FreshLLMs (https://arxiv.org/abs/2310.03214)
☆364Updated last week
facebookresearch / Shepherd
This is the repo for the paper Shepherd -- A Critic for Language Model Generation
☆219Updated last year
xlang-ai / Binder
[ICLR 2023] Code for the paper "Binding Language Models in Symbolic Languages"
☆319Updated last year
mingkaid / rl-prompt
Accompanying repo for the RLPrompt paper
☆334Updated last year
anchen1011 / FireAct
FireAct: Toward Language Agent Fine-tuning
☆280Updated last year
oriyor / reasoning-on-cots
Implementation of the paper: "Answering Questions by Meta-Reasoning over Multiple Chains of Thought"
☆96Updated last year
RUCAIBox / HaluEval
This is the repository of HaluEval, a large-scale hallucination evaluation benchmark for Large Language Models.
☆488Updated last year
sambanova / toolbench
ToolBench, an evaluation suite for LLM tool manipulation capabilities.
☆154Updated last year
FranxYao / GPT-Bargaining
Code for Arxiv 2023: Improving Language Model Negociation with Self-Play and In-Context Learning from AI Feedback
☆207Updated 2 years ago
Ber666 / ToolkenGPT
ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings - NeurIPS 2023 (oral)
☆262Updated last year
jinlanfu / GPTScore
Source Code of Paper "GPTScore: Evaluate as You Desire"
☆252Updated 2 years ago
kojima-takeshi188 / zero_shot_cot
Prod Env
☆423Updated last year
StonyBrookNLP / ircot
Repository for Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions, ACL23
☆222Updated last year
raunak-agarwal / instruction-datasets
All available datasets for Instruction Tuning of Large Language Models
☆254Updated last year
zorazrw / filco
[Preprint] Learning to Filter Context for Retrieval-Augmented Generaton
☆192Updated last year
nelson-liu / lost-in-the-middle
Code and data for "Lost in the Middle: How Language Models Use Long Contexts"
☆351Updated last year
salesforce / BOLAA
☆183Updated 5 months ago
i-Eval / FairEval
☆139Updated last year