obalcells / hallucination_probesLinks
Real-Time Detection of Hallucinated Entities in Long-Form Generation
☆260Updated last week
Alternatives and similar repositories for hallucination_probes
Users that are interested in hallucination_probes are comparing it to the libraries listed below
Sorting:
- ☆300Updated 2 months ago
- Salesforce Enterprise Deep Research☆147Updated this week
- Simple examples using Argilla tools to build AI☆56Updated 11 months ago
- ☆86Updated last year
- Collection of scripts and notebooks for OpenAI's latest GPT OSS models☆463Updated 2 months ago
- frozen-in-time version of our Paper Finder agent for reproducing evaluation results☆190Updated 2 months ago
- ☆158Updated 6 months ago
- Verifiers for LLM Reinforcement Learning☆77Updated last month
- ⚖️ Awesome LLM Judges ⚖️☆132Updated 5 months ago
- Super basic implementation (gist-like) of RLMs with REPL environments.☆204Updated last week
- Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)☆446Updated 2 months ago
- ☆68Updated 5 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆79Updated 7 months ago
- The State Of The Art, intelligence☆154Updated 2 months ago
- Inference, Fine Tuning and many more recipes with Gemma family of models☆273Updated 3 months ago
- Together Open Deep Research☆352Updated 6 months ago
- Solving data for LLMs - Create quality synthetic datasets!☆151Updated 9 months ago
- Train your own SOTA deductive reasoning model☆108Updated 7 months ago
- ScreenSuite - The most comprehensive benchmarking suite for GUI Agents!☆127Updated 3 weeks ago
- CodeScientist: An automated scientific discovery system for code-based experiments☆297Updated 4 months ago
- OSS RL environment + evals toolkit☆192Updated this week
- ☆79Updated 3 weeks ago
- An OpenSource Deep Research library with reasoning☆161Updated last month
- ☆232Updated 3 months ago
- Routing on Random Forest (RoRF)☆214Updated last year
- An Automatic Prompt Optimization Framework for Large Language Models☆130Updated 2 months ago
- ☆170Updated 7 months ago
- Prompt design in Python☆63Updated 10 months ago
- ☆104Updated 4 months ago
- An automated tool for discovering insights from research papaer corpora☆138Updated last year