lz1oceani / verify_cot
☆129Updated last year
Alternatives and similar repositories for verify_cot:
Users that are interested in verify_cot are comparing it to the libraries listed below
- Official Implementation of InstructZero; the first framework to optimize bad prompts of ChatGPT(API LLMs) and finally obtain good prompts…☆185Updated 5 months ago
- ☆120Updated 7 months ago
- Self-Alignment with Principle-Following Reward Models☆150Updated 10 months ago
- TART: A plug-and-play Transformer module for task-agnostic reasoning☆193Updated last year
- Code and data accompanying our paper on arXiv "Faithful Chain-of-Thought Reasoning".☆157Updated 8 months ago
- Learning to Compress Prompts with Gist Tokens - https://arxiv.org/abs/2304.08467☆274Updated last year
- Challenge LLMs to Reason About Reasoning: A Benchmark to Unveil Cognitive Depth in LLMs☆42Updated 6 months ago
- 🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Pap…☆134Updated last month
- ☆81Updated last year
- Code for Arxiv 2023: Improving Language Model Negociation with Self-Play and In-Context Learning from AI Feedback☆204Updated last year
- Flacuna was developed by fine-tuning Vicuna on Flan-mini, a comprehensive instruction collection encompassing various tasks. Vicuna is al…☆111Updated last year
- ☆171Updated last year
- Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'☆175Updated last month
- Implementation of the paper: "Answering Questions by Meta-Reasoning over Multiple Chains of Thought"☆93Updated 11 months ago
- A codebase for "Language Models can Solve Computer Tasks"☆230Updated 8 months ago
- [EMNLP 2023] The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning☆221Updated last year
- [NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.☆148Updated last year
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆153Updated last month
- Simple next-token-prediction for RLHF☆222Updated last year
- [NeurIPS 2023 D&B] Code repository for InterCode benchmark https://arxiv.org/abs/2306.14898☆203Updated 8 months ago
- ☆89Updated this week
- ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings - NeurIPS 2023 (oral)☆250Updated 9 months ago
- PASTA: Post-hoc Attention Steering for LLMs☆109Updated last month
- ☆113Updated 2 months ago
- ☆137Updated 9 months ago
- [ACL 2024] Exploring Collaboration Mechanisms for LLM Agents: A Social Psychology View☆106Updated 8 months ago
- Code for paper "LEVER: Learning to Verifiy Language-to-Code Generation with Execution" (ICML'23)☆81Updated last year
- ☆77Updated last year
- Benchmarking LLMs with Challenging Tasks from Real Users☆206Updated 2 months ago
- This is the repo for the paper Shepherd -- A Critic for Language Model Generation☆217Updated last year