soochan-lee / RoT
Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with Language Models"
☆42Updated last year
Related projects ⓘ
Alternatives and complementary repositories for RoT
- Small and Efficient Mathematical Reasoning LLMs☆71Updated 9 months ago
- An experiment to see if chatgpt can improve the output of the stanford alpaca dataset☆12Updated last year
- Patch for MPT-7B which allows using and training a LoRA☆58Updated last year
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆87Updated last year
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆23Updated 8 months ago
- ☆72Updated last year
- ☆20Updated last year
- Advanced Reasoning Benchmark Dataset for LLMs☆45Updated last year
- Multi-Domain Expert Learning☆67Updated 9 months ago
- Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"☆42Updated last week
- Using open source LLMs to build synthetic datasets for direct preference optimization☆40Updated 8 months ago
- This is the official repository for Inheritune.☆105Updated last month
- [NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messages☆37Updated last month
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆74Updated 10 months ago
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆41Updated 10 months ago
- Flacuna was developed by fine-tuning Vicuna on Flan-mini, a comprehensive instruction collection encompassing various tasks. Vicuna is al…☆111Updated last year
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆112Updated last year
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆41Updated last month
- Evaluating LLMs with CommonGen-Lite☆85Updated 8 months ago
- ☆41Updated 2 weeks ago
- ☆112Updated last month
- Sakura-SOLAR-DPO: Merge, SFT, and DPO☆115Updated 10 months ago
- ☆41Updated 3 weeks ago
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆129Updated this week
- Experiments with generating opensource language model assistants☆97Updated last year
- Code repository for the c-BTM paper☆105Updated last year
- Repository for "I am a Strange Dataset: Metalinguistic Tests for Language Models"☆39Updated 10 months ago
- ☆42Updated 4 months ago
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆80Updated 2 months ago
- Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA☆101Updated 3 months ago