rohinmanvi / Capability-Aware_and_Mid-Generation_Self-Evaluations
☆18Updated last month
Related projects ⓘ
Alternatives and complementary repositories for Capability-Aware_and_Mid-Generation_Self-Evaluations
- ☆41Updated 2 weeks ago
- A repository for research on medium sized language models.☆74Updated 6 months ago
- ☆28Updated 5 months ago
- The official implementation of Self-Exploring Language Models (SELM)☆55Updated 5 months ago
- ☆62Updated 3 months ago
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆21Updated last week
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆46Updated 2 months ago
- ☆33Updated 6 months ago
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding☆39Updated last month
- ☆57Updated 2 weeks ago
- ☆18Updated 2 weeks ago
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆41Updated last month
- ☆35Updated 3 weeks ago
- ☆22Updated 2 months ago
- ☆37Updated this week
- ☆103Updated last month
- DPO, but faster 🚀☆23Updated 3 weeks ago
- This is the official repository for Inheritune.☆105Updated last month
- ☆90Updated 4 months ago
- Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044☆30Updated last month
- ☆33Updated last month
- Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examples☆39Updated last month
- ☆55Updated last month
- Codebase accompanying the Summary of a Haystack paper.☆72Updated 2 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆75Updated last month
- "Improving Mathematical Reasoning with Process Supervision" by OPENAI☆83Updated 2 weeks ago
- ☆20Updated 3 months ago
- ☆64Updated last month
- Replicating O1 inference-time scaling laws☆51Updated last month
- Astraios: Parameter-Efficient Instruction Tuning Code Language Models☆57Updated 7 months ago