YurtsAI / llm-hallucination-evalLinks
Hallucination evaluation for Large Language Models
☆12Updated 2 years ago
Alternatives and similar repositories for llm-hallucination-eval
Users that are interested in llm-hallucination-eval are comparing it to the libraries listed below
Sorting:
- A language agent gym with challenging scientific tasks☆236Updated this week
- LitQA Eval: A difficult set of scientific questions that require context of full-text research papers to answer☆43Updated last year
- ☆192Updated 5 months ago
- ☆75Updated last year
- A langchain agent that retries☆51Updated 2 years ago
- ChemNLP project☆170Updated last week
- Semi-automatic feature engineering process using Language Models and your dataset descriptions. Based on the paper "LLMs for Semi-Automat…☆180Updated last year
- This repo is for the Open Protein Instructions (OPI) project, aiming to build and release a high-quality and comprehensive protein instru…☆10Updated 10 months ago
- ☆30Updated 2 years ago
- Repository for the paper "MALADE: Orchestration of LLM-powered Agents with Retrieval Augmented Generation for Pharmacovigilance"☆23Updated 11 months ago
- ☆92Updated 2 years ago
- Medical reasoning using large language models☆92Updated 2 years ago
- ☆104Updated 2 weeks ago
- Pembrolizumab-like Antibody Hallucination using AlphaFold2☆16Updated 2 years ago
- ☆283Updated last year
- This is Clinfo.AI Demo Instruction☆37Updated last year
- GPT-powered solution for extracting and modifying data in tables using natural language commands.☆44Updated 2 years ago
- ☆44Updated last year
- LLM for Drug Editing, ICLR 2024☆156Updated last year
- ☆69Updated 2 years ago
- Product analytics for AI Assistants☆157Updated 8 months ago
- Directly Connecting Python to LLMs via Strongly-Typed Functions, Dataclasses, Interfaces & Generic Types☆402Updated 11 months ago
- BioDEX: Large-Scale Biomedical Adverse Drug Event Extraction for Real-World Pharmacovigilance.☆57Updated 2 years ago
- Framework enabling modular interchange of language agents, environments, and optimizers☆121Updated last week
- Data from BioPlanner: Automatic Evaluation of LLMs on Protocol Planning in Biology paper☆27Updated last year
- ☆13Updated 2 years ago
- 🤖🌊 aiFlows: The building blocks of your collaborative AI☆272Updated last year
- Chemcrow☆874Updated last year
- ☆57Updated 2 years ago
- ☆32Updated last year