obalcells / hallucination_probesLinks
Real-Time Detection of Hallucinated Entities in Long-Form Generation
☆278Updated 2 months ago
Alternatives and similar repositories for hallucination_probes
Users that are interested in hallucination_probes are comparing it to the libraries listed below
Sorting:
- The State Of The Art, intelligence☆157Updated 6 months ago
- Verifiers for LLM Reinforcement Learning☆82Updated 5 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆261Updated this week
- [ICLR2026] Test-Time Scaling with Reflective Generative Model☆302Updated 2 weeks ago
- Codebase for FinePDFs☆176Updated last month
- build and benchmark deep research☆232Updated 2 weeks ago
- Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)☆460Updated 5 months ago
- ⚖️ Awesome LLM Judges ⚖️☆161Updated 9 months ago
- ☆67Updated 8 months ago
- ☆87Updated last year
- Official CLI and Python SDK for Prime Intellect - access GPU compute, remote sandboxes, RL environments, and distributed training infrast…☆151Updated this week
- An alignment auditing agent capable of quickly exploring alignment hypothesis☆887Updated this week
- frozen-in-time version of our Paper Finder agent for reproducing evaluation results☆225Updated 5 months ago
- Deep research agents using MiniMax M2.1 interleaved thinking☆197Updated last month
- Inference, Fine Tuning and many more recipes with Gemma family of models☆279Updated 6 months ago
- Collection of scripts and notebooks for OpenAI's latest GPT OSS models☆496Updated 5 months ago
- Super basic implementation (gist-like) of RLMs with REPL environments.☆636Updated last month
- ☆80Updated 4 months ago
- ☆127Updated 4 months ago
- ☆308Updated 3 months ago
- ☆177Updated 11 months ago
- Evolve your language agent with Agentic Context Engineering (ACE)☆596Updated 3 weeks ago
- Streamline on-policy/off-policy distillation workflows in a few lines of code☆95Updated last week
- Simple UI for debugging correlations of text embeddings☆305Updated 8 months ago
- ☆274Updated 3 weeks ago
- Simple examples using Argilla tools to build AI☆57Updated last year
- Together Open Deep Research☆358Updated 9 months ago
- Context Engineering Course with DSPy☆214Updated 6 months ago
- Solving data for LLMs - Create quality synthetic datasets!☆151Updated last year
- Claude Deep Research config for Claude Code.☆226Updated 10 months ago