janhq / verifiers-deepresearchLinks
Verifiers for LLM Reinforcement Learning
☆79Updated 3 months ago
Alternatives and similar repositories for verifiers-deepresearch
Users that are interested in verifiers-deepresearch are comparing it to the libraries listed below
Sorting:
- ☆159Updated 8 months ago
- Deep research agents using MiniMax M2.1 interleaved thinking☆176Updated last week
- Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)☆455Updated 4 months ago
- Inference, Fine Tuning and many more recipes with Gemma family of models☆276Updated 5 months ago
- ☆107Updated 2 months ago
- Real-Time Detection of Hallucinated Entities in Long-Form Generation☆273Updated last month
- Simple examples using Argilla tools to build AI☆57Updated last year
- ☆301Updated 4 months ago
- ☆68Updated 7 months ago
- An OpenSource Deep Research library with reasoning☆172Updated 3 weeks ago
- AI Agent that researches the lives of historical figures and extracts events into structured JSON timelines using LangGraph multi-agent o…☆218Updated 2 months ago
- MLX-based QA pair generator and LLM finetuning tool in Streamlit☆42Updated 2 months ago
- ☆173Updated 9 months ago
- Super basic implementation (gist-like) of RLMs with REPL environments.☆290Updated 2 months ago
- Train Large Language Models on MLX.☆239Updated 3 weeks ago
- Luth is a state-of-the-art series of fine-tuned LLMs for French☆41Updated 2 months ago
- ☆79Updated 3 months ago
- OmniDaemon is a Universal Event-Driven Runtime for AI Agents, it's framework-agnostic, event-driven runtime that turns AI agents into pr…☆44Updated 2 weeks ago
- Solving data for LLMs - Create quality synthetic datasets!☆150Updated 11 months ago
- ☆90Updated 11 months ago
- ☆182Updated 10 months ago
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆169Updated 4 months ago
- Turn topics, links, and files into AI-generated research notebooks — summarize, explore, and ask anything.☆144Updated 6 months ago
- Official code for NeurIPS 2025 paper "AutoDiscovery: Open-ended Scientific Discovery via Bayesian Surprise"☆114Updated 2 weeks ago
- II-Researcher: a new open-source framework designed to aid building search / research agents☆491Updated 4 months ago
- Train embedding and reranker models for retrieval tasks on Apple Silicon with MLX☆172Updated 3 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆249Updated this week
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆78Updated last year
- ☆193Updated 5 months ago
- ☆36Updated 10 months ago