Verifiers for LLM Reinforcement Learning
☆83Sep 11, 2025Updated 9 months ago
Alternatives and similar repositories for verifiers-deepresearch
Users that are interested in verifiers-deepresearch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆68May 23, 2025Updated last year
- ☆36Oct 26, 2025Updated 7 months ago
- [ICML 2025] Logits are All We Need to Adapt Closed Models☆23May 2, 2025Updated last year
- Explore training for quantized models☆26Jul 12, 2025Updated 11 months ago
- Exploring Applications of GRPO☆253Aug 25, 2025Updated 9 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- RAG Tool using Haystack, Mistral, and Chainlit. All open source stack on CPU.☆22Oct 14, 2023Updated 2 years ago
- ☆15Feb 23, 2026Updated 3 months ago
- Train Large Language Models on MLX.☆377Updated this week
- ☆20Mar 10, 2025Updated last year
- Tracking adoption of AI code review tools in open-source repos☆33Jun 3, 2026Updated last week
- Official implementation for DenseMixer: Improving MoE Post-Training with Precise Router Gradient☆67Aug 3, 2025Updated 10 months ago
- Terraform ECX Fabric Provider☆10Mar 1, 2023Updated 3 years ago
- A community-maintained registry of Model Context Protocol (MCP) servers with structured installation configurations for easy integration.☆25Updated this week
- Hi! My name is Kai G. I'm a knowledge AI, skilled in vector search, and graph RAG. My DB of choice is SurrealDB.☆74Jun 4, 2026Updated last week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆17Jul 9, 2025Updated 11 months ago
- Automatically annotates YOLO dataset using Moondream visual model☆21Aug 24, 2025Updated 9 months ago
- Run GEPA on your favorite non-python libraries.☆36Jan 22, 2026Updated 4 months ago
- ☆35Jan 31, 2026Updated 4 months ago
- aws lambda bash template, lambda bash shell script wrapped in nodejs☆10Sep 4, 2022Updated 3 years ago
- ☆14Apr 16, 2025Updated last year
- Source code for 'MicroPython for the Internet of Things' by Charles Bell☆15Nov 3, 2017Updated 8 years ago
- ☆45Jan 19, 2026Updated 4 months ago
- This sample illustrates using data compression with AWS Lambda functions☆11Apr 10, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Luth is a state-of-the-art series of fine-tuned LLMs for French☆46Oct 12, 2025Updated 8 months ago
- Create synthetic datasets from scratch using AI-powered generation. Define topics, customize prompts, and generate high-quality reasoning…☆31Mar 18, 2026Updated 2 months ago
- Code repository for Raspberry Pi for Secret Agents Third Edition, published by Packt☆13Jan 14, 2021Updated 5 years ago
- Röttger et al. (2025): "MSTS: A Multimodal Safety Test Suite for Vision-Language Models"☆20Mar 31, 2025Updated last year
- Claude Code Skill for creating animations with Manim. Claude Code autonomously plans scenes, writes Manim code, renders videos, and refin…☆70Jan 26, 2026Updated 4 months ago
- oda-r is a professional-grade compiler for Declarative Self-improving Python (DSPy), featuring comprehensive error handling, logging, and…☆40Jan 21, 2025Updated last year
- A Comprehensive Library for Memory of LLM-based Agents.☆111May 13, 2025Updated last year
- Feeling confused about super alignment? Here is a reading list☆43Jan 9, 2024Updated 2 years ago
- H.AI cookbook provides code examples and guides to help developers use models developed by H Company.☆80Feb 20, 2026Updated 3 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code for the ACL 2024 paper "PLUG: Leveraging Pivot Language in Cross-Lingual Instruction Tuning"☆14Aug 13, 2025Updated 10 months ago
- This is a repository where I show how to use Mistral 7B☆10Oct 26, 2023Updated 2 years ago
- A virtual agent for your virtual books📚☆50May 18, 2025Updated last year
- A comprehensive guide for fine-tuning Large Language Models (LLMs) on Apple Silicon Macs using MLX Framework and llama.cpp. This reposito…☆23Oct 12, 2025Updated 8 months ago
- Real-time guardrails for Claude Code tool calls.☆70Feb 4, 2026Updated 4 months ago
- ☆85Feb 1, 2024Updated 2 years ago
- Official code of "The Automated but Risky Game: Modeling and Benchmarking Agent-to-Agent Negotiations and Transactions in Consumer Market…☆27Jun 9, 2026Updated last week