obalcells / hallucination_probesLinks
Real-Time Detection of Hallucinated Entities in Long-Form Generation
☆273Updated last month
Alternatives and similar repositories for hallucination_probes
Users that are interested in hallucination_probes are comparing it to the libraries listed below
Sorting:
- Verifiers for LLM Reinforcement Learning☆79Updated 3 months ago
- Super basic implementation (gist-like) of RLMs with REPL environments.☆290Updated 2 months ago
- ☆301Updated 4 months ago
- ☆86Updated last year
- Simple examples using Argilla tools to build AI☆57Updated last year
- This repository contains the toolkit for replicating results from our technical report.☆185Updated 3 months ago
- Simple UI for debugging correlations of text embeddings☆306Updated 6 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆246Updated this week
- Inference, Fine Tuning and many more recipes with Gemma family of models☆276Updated 5 months ago
- Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)☆451Updated 4 months ago
- frozen-in-time version of our Paper Finder agent for reproducing evaluation results☆214Updated 4 months ago
- An alignment auditing agent capable of quickly exploring alignment hypothesis☆759Updated last week
- ☆173Updated 9 months ago
- ☆68Updated 7 months ago
- The State Of The Art, intelligence☆157Updated 4 months ago
- Together Open Deep Research☆356Updated 8 months ago
- Codebase for FinePDFs☆156Updated last month
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆168Updated 4 months ago
- Evolve your language agent with Agentic Context Engineering (ACE)☆439Updated last month
- ☆159Updated 8 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆84Updated 9 months ago
- Train your own SOTA deductive reasoning model☆107Updated 9 months ago
- Collection of scripts and notebooks for OpenAI's latest GPT OSS models☆486Updated 4 months ago
- ⚖️ Awesome LLM Judges ⚖️☆146Updated 7 months ago
- Train Large Language Models on MLX.☆236Updated 2 weeks ago
- A pure MLX-based training pipeline for fine-tuning LLMs using GRPO on Apple Silicon.☆222Updated last month
- An automated tool for discovering insights from research papaer corpora☆137Updated last year
- II-Researcher: a new open-source framework designed to aid building search / research agents☆487Updated 4 months ago
- ☆235Updated last month
- ☆91Updated 5 months ago