soochan-lee / RoT
Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with Language Models"
☆43Updated last year
Alternatives and similar repositories for RoT:
Users that are interested in RoT are comparing it to the libraries listed below
- Small and Efficient Mathematical Reasoning LLMs☆71Updated last year
- [ICLR 2023] Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners☆113Updated 6 months ago
- [NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messages☆44Updated 3 months ago
- An experiment to see if chatgpt can improve the output of the stanford alpaca dataset☆12Updated last year
- Evaluating LLMs with CommonGen-Lite☆89Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆55Updated 6 months ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated last year
- Patch for MPT-7B which allows using and training a LoRA☆58Updated last year
- ☆73Updated last year
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated last year
- ☆48Updated 4 months ago
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆63Updated last year
- ☆20Updated last year
- Camel-Coder: Collaborative task completion with multiple agents. Role-based prompts, intervention mechanism, and thoughtful suggestions☆33Updated last year
- Advanced Reasoning Benchmark Dataset for LLMs☆45Updated last year
- ☆37Updated last year
- ☆74Updated last year
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆52Updated 3 months ago
- Flacuna was developed by fine-tuning Vicuna on Flan-mini, a comprehensive instruction collection encompassing various tasks. Vicuna is al…☆111Updated last year
- Sakura-SOLAR-DPO: Merge, SFT, and DPO☆116Updated last year
- ☆119Updated 5 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆59Updated last year
- Baby's CoThought: Leveraging LLMs for Enhanced Reasoning in Compact Models☆17Updated 2 months ago
- ☆44Updated 4 months ago
- SCREWS: A Modular Framework for Reasoning with Revisions☆27Updated last year
- Official repo for EMNLP 2023 paper "Explain-then-Translate: An Analysis on Improving Program Translation with Self-generated Explanations…☆28Updated last year
- ☆36Updated 2 years ago
- [NeurIPS 2023] PyTorch code for Can Language Models Teach? Teacher Explanations Improve Student Performance via Theory of Mind☆67Updated last year
- 👻 Code and benchmark for our EMNLP 2023 paper - "FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions"☆53Updated 9 months ago
- Pytorch implementation for "Compressed Context Memory For Online Language Model Interaction" (ICLR'24)☆54Updated 11 months ago