soochan-lee / RoTLinks
Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with Language Models"
☆45Updated 2 years ago
Alternatives and similar repositories for RoT
Users that are interested in RoT are comparing it to the libraries listed below
Sorting:
- Small and Efficient Mathematical Reasoning LLMs☆73Updated 2 years ago
- [ICLR 2023] Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners☆116Updated 7 months ago
- ☆74Updated 2 years ago
- Code repository for the c-BTM paper☆108Updated 2 years ago
- Multi-Domain Expert Learning☆67Updated 2 years ago
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆143Updated 2 years ago
- Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA☆104Updated 8 months ago
- 🍼 Baby's CoThought: Leveraging LLMs for Enhanced Reasoning in Compact Models (BabyLM Challenge)☆17Updated last year
- ☆129Updated last year
- Sakura-SOLAR-DPO: Merge, SFT, and DPO☆116Updated 2 years ago
- Flacuna was developed by fine-tuning Vicuna on Flan-mini, a comprehensive instruction collection encompassing various tasks. Vicuna is al…☆111Updated 2 years ago
- Mixing Language Models with Self-Verification and Meta-Verification☆112Updated last year
- QLoRA: Efficient Finetuning of Quantized LLMs☆79Updated last year
- Advanced Reasoning Benchmark Dataset for LLMs☆47Updated 2 years ago
- Retrieval Augmented Generation Generalized Evaluation Dataset☆60Updated 6 months ago
- Repository for "I am a Strange Dataset: Metalinguistic Tests for Language Models"☆45Updated 2 years ago
- ☆84Updated 2 years ago
- Patch for MPT-7B which allows using and training a LoRA☆58Updated 2 years ago
- ☆78Updated 2 years ago
- [NeurIPS 2023] PyTorch code for Can Language Models Teach? Teacher Explanations Improve Student Performance via Theory of Mind☆66Updated 2 years ago
- ☆32Updated 2 years ago
- Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"☆40Updated last year
- A repository for transformer critique learning and generation☆89Updated 2 years ago
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆66Updated 2 years ago
- ☆173Updated 2 years ago
- A set of utilities for running few-shot prompting experiments on large-language models☆126Updated 2 years ago
- ☆95Updated 2 years ago
- ☆80Updated 10 months ago
- ☆21Updated 2 years ago
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆191Updated 6 months ago