Verifiers for LLM Reinforcement Learning
☆83Sep 11, 2025Updated 5 months ago
Alternatives and similar repositories for verifiers-deepresearch
Users that are interested in verifiers-deepresearch are comparing it to the libraries listed below
Sorting:
- ☆67May 23, 2025Updated 9 months ago
- Terminate AV/EDR processes by exploiting the vulnerable NsecSoft driver☆33Sep 15, 2025Updated 5 months ago
- ☆29Oct 26, 2025Updated 4 months ago
- [ICML 2025] Logits are All We Need to Adapt Closed Models☆21May 2, 2025Updated 10 months ago
- ☆15Apr 10, 2024Updated last year
- Code for the ACL 2024 paper "PLUG: Leveraging Pivot Language in Cross-Lingual Instruction Tuning"☆14Aug 13, 2025Updated 6 months ago
- Evals that meet you where you are. For AI that's grounded.☆53Feb 6, 2026Updated last month
- Perf monitoring CLI tool for Apple Silicon☆16Jan 1, 2024Updated 2 years ago
- Train Large Language Models on MLX.☆273Feb 27, 2026Updated last week
- TurboAPI: Lightning-Fast ASGI Framework with FastAPI-Compatible Syntax☆49Feb 7, 2026Updated last month
- Common tools for data processing☆22Dec 8, 2025Updated 2 months ago
- Exploring Applications of GRPO☆251Aug 25, 2025Updated 6 months ago
- A virtual agent for your virtual books📚☆48May 18, 2025Updated 9 months ago
- ☆25May 7, 2025Updated 10 months ago
- Luth is a state-of-the-art series of fine-tuned LLMs for French☆42Oct 12, 2025Updated 4 months ago
- ☆86Feb 1, 2024Updated 2 years ago
- H.AI cookbook provides code examples and guides to help developers use models developed by H Company.☆66Feb 20, 2026Updated 2 weeks ago
- ☆35Jan 31, 2026Updated last month
- Instant Perfect Native MacOS Transcription☆52Jul 26, 2025Updated 7 months ago
- Build datasets using natural language☆568Sep 19, 2025Updated 5 months ago
- Smart Queue Management System For Syndicate Bank☆13Dec 10, 2022Updated 3 years ago
- oda-r is a professional-grade compiler for Declarative Self-improving Python (DSPy), featuring comprehensive error handling, logging, and…☆39Jan 21, 2025Updated last year
- This is a Login application for Android using Parse server.☆10Nov 26, 2018Updated 7 years ago
- ☆15Dec 23, 2022Updated 3 years ago
- A Claude Code plugin that solves the same problems as community frameworks (GSD, BMAD, Ralph, Agent OS) — but using the tool's native arc…☆28Updated this week
- Solving data for LLMs - Create quality synthetic datasets!☆151Jan 20, 2025Updated last year
- Train embedding and reranker models for retrieval tasks on Apple Silicon with MLX☆174Sep 18, 2025Updated 5 months ago
- ☆41Mar 20, 2024Updated last year
- a blog starter project☆11Oct 29, 2018Updated 7 years ago
- A copy of the latest version of MVSIS☆12Apr 18, 2021Updated 4 years ago
- A Chrome extension that generates binaural beats.☆23Aug 23, 2023Updated 2 years ago
- Download Esign iOS 26 IPA signing tool – install IPAs without jailbreak or PC. Sign apps easily on iPhone, iPad with fast, stable certifi…☆43Jan 26, 2026Updated last month
- Collection of specialized agent definitions for Claude Code☆32Feb 2, 2026Updated last month
- ☆52Jan 20, 2026Updated last month
- ☆10Feb 9, 2024Updated 2 years ago
- 📚 Step to step guide on how to migrate a WordPress website☆11Aug 16, 2017Updated 8 years ago
- ☆12Jun 4, 2023Updated 2 years ago
- MLX Implementation of Recursive Reasoning with Tiny Networks☆78Oct 11, 2025Updated 4 months ago
- Scripts for Digital Design flow control.☆16Oct 30, 2025Updated 4 months ago