janhq / verifiers-deepresearchLinks
Verifiers for LLM Reinforcement Learning
☆78Updated 2 months ago
Alternatives and similar repositories for verifiers-deepresearch
Users that are interested in verifiers-deepresearch are comparing it to the libraries listed below
Sorting:
- ☆158Updated 7 months ago
- Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)☆449Updated 2 months ago
- ☆107Updated 2 weeks ago
- Super basic implementation (gist-like) of RLMs with REPL environments.☆248Updated last month
- ☆68Updated 5 months ago
- ☆172Updated 8 months ago
- ☆300Updated 3 months ago
- Simple examples using Argilla tools to build AI☆56Updated last year
- Real-Time Detection of Hallucinated Entities in Long-Form Generation☆268Updated this week
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆82Updated 8 months ago
- An OpenSource Deep Research library with reasoning☆165Updated last week
- Train Large Language Models on MLX.☆216Updated last week
- Research repository on interfacing LLMs with Weaviate APIs. Inspired by the Berkeley Gorilla LLM.☆138Updated 2 months ago
- An Automatic Prompt Optimization Framework for Large Language Models☆137Updated 3 months ago
- ☆36Updated 9 months ago
- ☆117Updated last month
- Inference, Fine Tuning and many more recipes with Gemma family of models☆275Updated 4 months ago
- Context Engineering Course with DSPy☆202Updated 3 months ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆113Updated 7 months ago
- ☆79Updated last month
- A user interface for DSPy☆195Updated last month
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆165Updated 2 months ago
- ☆135Updated 8 months ago
- Solving data for LLMs - Create quality synthetic datasets!☆150Updated 10 months ago
- Easy to use, High Performant Knowledge Distillation for LLMs☆95Updated 6 months ago
- Codebase for FinePDFs☆145Updated 2 weeks ago
- ☆121Updated last week
- Finetune Llama-3-8b on the MathInstruct dataset☆114Updated last year
- ⚖️ Awesome LLM Judges ⚖️☆133Updated 6 months ago
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆78Updated last year