soochan-lee / RoTLinks
Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with Language Models"
☆43Updated last year
Alternatives and similar repositories for RoT
Users that are interested in RoT are comparing it to the libraries listed below
Sorting:
- [ICLR 2023] Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners☆116Updated 8 months ago
- Small and Efficient Mathematical Reasoning LLMs☆71Updated last year
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated last year
- ☆120Updated 8 months ago
- Codebase accompanying the Summary of a Haystack paper.☆78Updated 8 months ago
- ☆49Updated 6 months ago
- ☆72Updated last year
- An experiment to see if chatgpt can improve the output of the stanford alpaca dataset☆12Updated 2 years ago
- The official code repo and data hub of top_nsigma sampling strategy for LLMs.☆25Updated 3 months ago
- Experiments with generating opensource language model assistants☆97Updated 2 years ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated last year
- Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"☆41Updated 6 months ago
- Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)☆29Updated last year
- Patch for MPT-7B which allows using and training a LoRA☆58Updated 2 years ago
- Multi-Domain Expert Learning☆66Updated last year
- ☆44Updated 6 months ago
- [NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messages☆47Updated 6 months ago
- [AAAI 2024] Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Following☆79Updated 8 months ago
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆123Updated last year
- ☆19Updated last year
- QLoRA with Enhanced Multi GPU Support☆37Updated last year
- Code repository for the c-BTM paper☆106Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆57Updated 9 months ago
- A repository for transformer critique learning and generation☆89Updated last year
- Repository for "I am a Strange Dataset: Metalinguistic Tests for Language Models"☆43Updated last year
- ☆34Updated 11 months ago
- Pytorch implementation for "Compressed Context Memory For Online Language Model Interaction" (ICLR'24)☆60Updated last year
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆26Updated 5 months ago
- Evaluating LLMs with CommonGen-Lite☆90Updated last year
- For experiments involving instruct gpt. Currently used for documenting open research questions.☆70Updated 2 years ago