menloresearch / verifiers-deepresearchLinks
Verifiers for LLM Reinforcement Learning
☆75Updated 2 weeks ago
Alternatives and similar repositories for verifiers-deepresearch
Users that are interested in verifiers-deepresearch are comparing it to the libraries listed below
Sorting:
- ☆155Updated 4 months ago
- ☆167Updated 5 months ago
- ☆66Updated 3 months ago
- An OpenSource Deep Research library with reasoning☆153Updated 3 weeks ago
- ☆180Updated 6 months ago
- ☆290Updated 2 weeks ago
- ☆136Updated last month
- Inference, Fine Tuning and many more recipes with Gemma family of models☆266Updated last month
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆73Updated 5 months ago
- A user interface for DSPy☆172Updated 3 months ago
- Agentic RAG to help you build a startup🚀☆54Updated 4 months ago
- ☆102Updated 2 months ago
- Solving data for LLMs - Create quality synthetic datasets!☆151Updated 7 months ago
- II-Researcher: a new open-source framework designed to aid building search / research agents☆466Updated 3 weeks ago
- Metadspy: The framework for specifying—not programming—language models☆88Updated 2 months ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆111Updated 4 months ago
- The State Of The Art, intelligence☆151Updated 2 weeks ago
- An Automatic Prompt Optimization Framework for Large Language Models☆100Updated 3 weeks ago
- ☆79Updated 2 weeks ago
- Train Large Language Models on MLX.☆146Updated 3 weeks ago
- ☆130Updated 5 months ago
- Research repository on interfacing LLMs with Weaviate APIs. Inspired by the Berkeley Gorilla LLM.☆133Updated 2 months ago
- Prompt design in Python☆62Updated 8 months ago
- Context Engineering Course with DSPy☆164Updated 3 weeks ago
- Routing on Random Forest (RoRF)☆195Updated 11 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆92Updated 7 months ago
- ☆74Updated 6 months ago
- Turn topics, links, and files into AI-generated research notebooks — summarize, explore, and ask anything.☆133Updated 2 months ago
- A framework for optimizing DSPy programs with RL☆150Updated this week
- ☆89Updated 7 months ago