obalcells / hallucination_probesLinks
Real-Time Detection of Hallucinated Entities in Long-Form Generation
☆266Updated last month
Alternatives and similar repositories for hallucination_probes
Users that are interested in hallucination_probes are comparing it to the libraries listed below
Sorting:
- Verifiers for LLM Reinforcement Learning☆78Updated 2 months ago
- ☆300Updated 3 months ago
- Super basic implementation (gist-like) of RLMs with REPL environments.☆248Updated 3 weeks ago
- ☆86Updated last year
- Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)☆447Updated 2 months ago
- ⚖️ Awesome LLM Judges ⚖️☆133Updated 6 months ago
- frozen-in-time version of our Paper Finder agent for reproducing evaluation results☆200Updated 2 months ago
- ☆79Updated last month
- The State Of The Art, intelligence☆156Updated 3 months ago
- An alignment auditing agent capable of quickly exploring alignment hypothesis☆652Updated this week
- ☆171Updated 8 months ago
- ☆158Updated 6 months ago
- II-Researcher: a new open-source framework designed to aid building search / research agents☆477Updated 3 months ago
- Together Open Deep Research☆352Updated 7 months ago
- Collection of scripts and notebooks for OpenAI's latest GPT OSS models☆477Updated 2 months ago
- Codebase for FinePDFs☆135Updated last week
- OSS RL environment + evals toolkit☆200Updated this week
- ☆68Updated 5 months ago
- Train Large Language Models on MLX.☆213Updated last week
- craft post-training data recipes☆53Updated last week
- Simple examples using Argilla tools to build AI☆56Updated 11 months ago
- 🧠 Advanced Claude streaming interface with interleaved thinking, dynamic tool discovery, and MCP integration. Watch Claude think through…☆180Updated 5 months ago
- Inference, Fine Tuning and many more recipes with Gemma family of models☆275Updated 3 months ago
- An OpenSource Deep Research library with reasoning☆164Updated last week
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆82Updated 7 months ago
- Simple UI for debugging correlations of text embeddings☆299Updated 5 months ago
- A Tree Search Library with Flexible API for LLM Inference-Time Scaling☆488Updated this week
- ☆102Updated last year
- ☆234Updated 4 months ago
- CodeScientist: An automated scientific discovery system for code-based experiments☆300Updated 4 months ago