janhq / verifiers-deepresearchView external linksLinks
Verifiers for LLM Reinforcement Learning
☆82Sep 11, 2025Updated 5 months ago
Alternatives and similar repositories for verifiers-deepresearch
Users that are interested in verifiers-deepresearch are comparing it to the libraries listed below
Sorting:
- aka "Bayesian Methods for Hackers": An introduction to Bayesian methods + probabilistic programming with a computation/understanding-firs…☆10May 9, 2015Updated 10 years ago
- ☆17Jul 9, 2025Updated 7 months ago
- ☆14Apr 16, 2025Updated 9 months ago
- ☆15Apr 10, 2024Updated last year
- [ICML 2025] Logits are All We Need to Adapt Closed Models☆21May 2, 2025Updated 9 months ago
- ☆26Oct 26, 2025Updated 3 months ago
- Evals that meet you where you are. For AI that's grounded.☆45Feb 6, 2026Updated last week
- Automatically annotates YOLO dataset using Moondream visual model☆20Aug 24, 2025Updated 5 months ago
- Create synthetic datasets from scratch using AI-powered generation. Define topics, customize prompts, and generate high-quality reasoning…☆29Updated this week
- Code for the ACL 2024 paper "PLUG: Leveraging Pivot Language in Cross-Lingual Instruction Tuning"☆14Aug 13, 2025Updated 6 months ago
- TurboAPI: Lightning-Fast ASGI Framework with FastAPI-Compatible Syntax☆49Feb 7, 2026Updated last week
- A proxy for minimax-m2, enabling interleaved thinking, and tool calls.☆38Nov 21, 2025Updated 2 months ago
- Common tools for data processing☆22Dec 8, 2025Updated 2 months ago
- Exploring Applications of GRPO☆251Aug 25, 2025Updated 5 months ago
- A virtual agent for your virtual books📚☆48May 18, 2025Updated 8 months ago
- ☆25May 7, 2025Updated 9 months ago
- Luth is a state-of-the-art series of fine-tuned LLMs for French☆41Oct 12, 2025Updated 4 months ago
- Use AI to edit your documents in real-time. Provide feedback and let the AI do all the work.☆29Jul 24, 2024Updated last year
- ☆45Jan 19, 2026Updated 3 weeks ago
- [EMNLP 2025] Verification Engineering for RL in Instruction Following☆50Jan 5, 2026Updated last month
- Instant Perfect Native MacOS Transcription☆53Jul 26, 2025Updated 6 months ago
- Build datasets using natural language☆566Sep 19, 2025Updated 4 months ago
- Official implementation for DenseMixer: Improving MoE Post-Training with Precise Router Gradient☆66Aug 3, 2025Updated 6 months ago
- oda-r is a professional-grade compiler for Declarative Self-improving Python (DSPy), featuring comprehensive error handling, logging, and…☆39Jan 21, 2025Updated last year
- Solving data for LLMs - Create quality synthetic datasets!☆151Jan 20, 2025Updated last year
- Collection of specialized agent definitions for Claude Code☆32Feb 2, 2026Updated last week
- ☆12Jun 4, 2023Updated 2 years ago
- A General Quantum Software☆17Dec 11, 2025Updated 2 months ago
- ☆18Jun 25, 2025Updated 7 months ago
- Local text-to-speech in your browser with Piper TTS☆16Aug 13, 2025Updated 6 months ago
- ☆11Feb 26, 2024Updated last year
- This is an example RAG pipeline for ingesting private IP Network Design documentation for use with an LLM☆14Nov 5, 2025Updated 3 months ago
- ☆41Mar 20, 2024Updated last year
- I saw this [Blog Post](https://www.morling.dev/blog/one-billion-row-challenge/) on a Billion Row challenge for Java so naturally I tried …☆14Jan 10, 2024Updated 2 years ago
- LLM Building Blocks for Python Course☆15Nov 17, 2025Updated 2 months ago
- MLX Implementation of Recursive Reasoning with Tiny Networks☆78Oct 11, 2025Updated 4 months ago
- ☆11Aug 23, 2024Updated last year
- An attempt to live code a working Retrieval Augmented Generation app with AI coding tools☆17Apr 24, 2025Updated 9 months ago
- a blog starter project☆11Oct 29, 2018Updated 7 years ago