IBM / InspectorRAGet
The repository contains generative AI analytics platform application code.
☆22Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for InspectorRAGet
- ☆41Updated 2 months ago
- ReBase: Training Task Experts through Retrieval Based Distillation☆27Updated 4 months ago
- Codebase accompanying the Summary of a Haystack paper.☆72Updated 2 months ago
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆61Updated 4 months ago
- ☆28Updated 8 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆46Updated 2 months ago
- ☆22Updated 2 months ago
- Testing speed and accuracy of RAG with, and without Cross Encoder Reranker.☆47Updated 10 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆76Updated last month
- Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"☆109Updated 3 months ago
- 🔧 Compare how Agent systems perform on several benchmarks. 📊🚀☆48Updated last month
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆83Updated 2 months ago
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆77Updated 8 months ago
- ☆41Updated 2 weeks ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆68Updated last month
- ARAGOG- Advanced RAG Output Grading. Exploring and comparing various Retrieval-Augmented Generation (RAG) techniques on AI research paper…☆96Updated 7 months ago
- ☆42Updated 4 months ago
- Codes and datasets for the paper Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Ref…☆23Updated last month
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 4 months ago
- Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators (Liu et al.; arXiv preprint arXiv:2403.…☆37Updated 4 months ago
- [SIGIR 2024 (Demo)] CoSearchAgent: A Lightweight Collborative Search Agent with Large Language Models☆22Updated 9 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆41Updated 8 months ago
- Mixing Language Models with Self-Verification and Meta-Verification☆97Updated last year
- ☆65Updated 2 months ago
- ☆24Updated last year
- Automatic Evals for Instruction-Tuned Models☆71Updated this week
- ☆28Updated 5 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆20Updated 9 months ago
- ☆127Updated 7 months ago