kanishkg / stream-of-search
Repository for the paper Stream of Search: Learning to Search in Language
☆70Updated last month
Related projects: ⓘ
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆30Updated last month
- ☆40Updated 4 months ago
- ☆29Updated 2 weeks ago
- ☆68Updated 2 months ago
- ☆74Updated 9 months ago
- A repository for research on medium sized language models.☆71Updated 3 months ago
- Functional Benchmarks and the Reasoning Gap☆74Updated last month
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆37Updated 3 months ago
- ☆50Updated last month
- ☆62Updated 5 months ago
- ☆91Updated last month
- Attribute (or cite) statements generated by LLMs back to in-context information.☆107Updated 2 weeks ago
- Flow of Reasoning: Efficient Training of LLM Policy with Diverse Thinking☆25Updated this week
- Small, simple agent task environments for training and evaluation☆13Updated last week
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆39Updated 2 weeks ago
- ☆77Updated 3 weeks ago
- Code for reproducing our paper "Not All Language Model Features Are Linear"☆57Updated last week
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆73Updated 2 months ago
- "Improving Mathematical Reasoning with Process Supervision" by OPENAI☆55Updated last week
- Evaluation of neuro-symbolic engines☆29Updated last month
- 🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Pap…☆81Updated last month
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆86Updated 3 months ago
- Can Language Models Solve Olympiad Programming?☆92Updated last month
- ☆87Updated 2 months ago
- Codebase accompanying the Summary of a Haystack paper.☆65Updated 2 months ago
- ☆68Updated last month
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆76Updated 6 months ago
- Code for the paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'☆140Updated 2 months ago
- The official implementation of Self-Exploring Language Models (SELM)☆55Updated 3 months ago
- 📜 [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswa…☆36Updated 10 months ago