menloresearch / verifiers-deepresearchLinks

Verifiers for LLM Reinforcement Learning

☆77

Alternatives and similar repositories for verifiers-deepresearch

Users that are interested in verifiers-deepresearch are comparing it to the libraries listed below

Sorting:

SalesforceAIResearch / enterprise-deep-research
Salesforce Enterprise Deep Research
☆147Updated this week
menloresearch / ReZero
☆158Updated 6 months ago
brendanhogan / picoDeepResearch
☆68Updated 5 months ago
BhabhaAI / dataformer
Solving data for LLMs - Create quality synthetic datasets!
☆151Updated 9 months ago
LLMSELECTOR / LLMSELECTOR
☆79Updated 3 weeks ago
JigsawStack / deep-research
An OpenSource Deep Research library with reasoning
☆161Updated last month
MetaStone-AI / XBai-o4
☆300Updated 2 months ago
SakanaAI / natural_niches
The code repository of the paper: Competition and Attraction Improve Model Fusion
☆161Updated 2 months ago
alexzhang13 / rlm
Super basic implementation (gist-like) of RLMs with REPL environments.
☆204Updated last week
argilla-io / argilla-cookbook
Simple examples using Argilla tools to build AI
☆56Updated 11 months ago
agora-protocol / paper-demo
☆170Updated 7 months ago
NVlabs / UniversalDeepResearch
Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)
☆446Updated 2 months ago
huggingface / huggingface-gemma-recipes
Inference, Fine Tuning and many more recipes with Gemma family of models
☆273Updated 3 months ago
philschmid / mcp-openai-gemini-llama-example
☆181Updated 8 months ago
tom-doerr / mnist_dspy
☆36Updated 8 months ago
google-deepmind / latent-multi-hop-reasoning
[ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?
☆79Updated 7 months ago
Arize-ai / prompt-learning
☆113Updated this week
Intelligent-Internet / ii-researcher
II-Researcher: a new open-source framework designed to aid building search / research agents
☆475Updated 2 months ago
SalesforceAIResearch / promptomatix
An Automatic Prompt Optimization Framework for Large Language Models
☆130Updated 2 months ago
OpenPipe / deductive-reasoning
Train your own SOTA deductive reasoning model
☆108Updated 7 months ago
avbiswas / context-engineering-dspy
Context Engineering Course with DSPy
☆195Updated 2 months ago
QuixiAI / dolphin-logger
☆104Updated 4 months ago
Royaltyprogram / Crux
The State Of The Art, intelligence
☆154Updated 2 months ago
Xalp / ECHO
Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)
☆91Updated 9 months ago
obalcells / hallucination_probes
Real-Time Detection of Hallucinated Entities in Long-Form Generation
☆260Updated last week
weaviate / gorilla
Research repository on interfacing LLMs with Weaviate APIs. Inspired by the Berkeley Gorilla LLM.
☆136Updated 2 months ago
hammer-mt / DSPyUI
A user interface for DSPy
☆195Updated 3 weeks ago
PrimeIntellect-ai / genesys
☆135Updated 7 months ago
haizelabs / Awesome-LLM-Judges
⚖️ Awesome LLM Judges ⚖️
☆132Updated 5 months ago
flowaicom / flow-judge
Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…
☆78Updated 11 months ago