obalcells / hallucination_probesLinks
Real-Time Detection of Hallucinated Entities in Long-Form Generation
☆269Updated 3 weeks ago
Alternatives and similar repositories for hallucination_probes
Users that are interested in hallucination_probes are comparing it to the libraries listed below
Sorting:
- Verifiers for LLM Reinforcement Learning☆79Updated 2 months ago
- Super basic implementation (gist-like) of RLMs with REPL environments.☆278Updated last month
- ☆301Updated 4 months ago
- The State Of The Art, intelligence☆156Updated 3 months ago
- frozen-in-time version of our Paper Finder agent for reproducing evaluation results☆206Updated 3 months ago
- Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)☆450Updated 3 months ago
- ☆124Updated this week
- Inference, Fine Tuning and many more recipes with Gemma family of models☆274Updated 4 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆225Updated this week
- ☆79Updated 2 months ago
- CodeScientist: An automated scientific discovery system for code-based experiments☆303Updated last week
- An alignment auditing agent capable of quickly exploring alignment hypothesis☆696Updated last week
- ☆158Updated 7 months ago
- Simple UI for debugging correlations of text embeddings☆302Updated 6 months ago
- Together Open Deep Research☆355Updated 7 months ago
- The official code implementation for "Cache-to-Cache: Direct Semantic Communication Between Large Language Models"☆285Updated last week
- Train your own SOTA deductive reasoning model☆107Updated 9 months ago
- ☆68Updated 6 months ago
- Evolve your language agent with Agentic Context Engineering (ACE)☆136Updated 2 weeks ago
- II-Researcher: a new open-source framework designed to aid building search / research agents☆481Updated 4 months ago
- Training-Ready RL Environments + Evals☆185Updated this week
- A Tree Search Library with Flexible API for LLM Inference-Time Scaling☆494Updated last week
- ☆86Updated last year
- ⚖️ Awesome LLM Judges ⚖️☆134Updated 7 months ago
- Letting Claude Code develop his own MCP tools :)☆123Updated 8 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆84Updated 8 months ago
- ToolOrchestra is an end-to-end RL training framework for orchestrating tools and agentic workflows.☆289Updated this week
- Collection of scripts and notebooks for OpenAI's latest GPT OSS models☆479Updated 3 months ago
- Deep research agents using MiniMax-M2 interleaved thinking☆139Updated last week
- 🧠 Advanced Claude streaming interface with interleaved thinking, dynamic tool discovery, and MCP integration. Watch Claude think through…☆180Updated 5 months ago