xf-zhao / LoTLinks
Official implementation of LoT paper: "Enhancing Zero-Shot Chain-of-Thought Reasoning in Large Language Models through Logic"
☆25Updated last year
Alternatives and similar repositories for LoT
Users that are interested in LoT are comparing it to the libraries listed below
Sorting:
- Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)☆29Updated last year
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆54Updated last year
- Code for the paper <SelfCheck: Using LLMs to Zero-Shot Check Their Own Step-by-Step Reasoning>☆49Updated last year
- B-STAR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners☆82Updated 2 weeks ago
- official implementation of paper "Process Reward Model with Q-value Rankings"☆59Updated 4 months ago
- ☆45Updated last year
- About The corresponding code from our paper " REFINER: Reasoning Feedback on Intermediate Representations" (EACL 2024). Do not hesitate t…☆70Updated last year
- Neuro-Symbolic Integration Brings Causal and Reliable Reasoning Proofs☆36Updated last year
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated last year
- Sotopia-π: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)☆65Updated last year
- Discriminator-Guided Chain-of-Thought Reasoning☆47Updated 7 months ago
- ☆32Updated 3 months ago
- Evaluation on Logical Reasoning and Abstract Reasoning Challenges☆27Updated last month
- [ICML 2025] Flow of Reasoning: Training LLMs for Divergent Reasoning with Minimal Examples☆89Updated last week
- Agent Skill Induction: "Inducing Programmatic Skills for Agentic Tasks"☆20Updated last month
- A testbed for agents and environments that can automatically improve models through data generation.☆24Updated 3 months ago
- Text Adventure Learning Environment Suite - Benchmark to evaluate language models on interactive text environments.☆14Updated last week
- Codebase for Inference-Time Policy Adapters☆23Updated last year
- Scalable Meta-Evaluation of LLMs as Evaluators☆42Updated last year
- ☆42Updated 2 months ago
- ☆16Updated 2 months ago
- ☆13Updated 9 months ago
- ☆47Updated 5 months ago
- Repository for Skill Set Optimization☆13Updated 10 months ago
- ☆40Updated 6 months ago
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)☆142Updated 7 months ago
- Code for ICLR 2024 paper "CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets"☆57Updated last year
- ICML 2024 - Official Repository for EXO: Towards Efficient Exact Optimization of Language Model Alignment☆55Updated 11 months ago
- ☆69Updated last year
- the instructions and demonstrations for building a formal logical reasoning capable GLM☆53Updated 9 months ago