LarsChrWiik / lars_datasetsLinks
Lars's datasets
β12Updated last year
Alternatives and similar repositories for lars_datasets
Users that are interested in lars_datasets are comparing it to the libraries listed below
Sorting:
- π§ Compare how Agent systems perform on several benchmarks. ππβ103Updated 6 months ago
- ARAGOG- Advanced RAG Output Grading. Exploring and comparing various Retrieval-Augmented Generation (RAG) techniques on AI research paperβ¦β113Updated last year
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"β120Updated 3 months ago
- Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".β242Updated last year
- This is the code repo for our paper "Enhancing Knowledge Integration and Utilization of Large Language Models via Constructivist Cognitioβ¦β110Updated 4 months ago
- Complex question answering in LLMs with enhanced reasoning and information-seeking capabilities.β204Updated 2 years ago
- Repository for βPlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makersβ, NAACL24β151Updated last year
- Large-language Model Evaluation framework with Elo Leaderboard and A-B testingβ52Updated last year
- Knowledge Graph Retrieval Augmented Generation (KG-RAG) Eval Datasetsβ197Updated last year
- β22Updated last year
- β52Updated 2 years ago
- This repository implements the chain of verification paper by Meta AIβ196Updated 2 years ago
- WorkBench: a Benchmark Dataset for Agents in a Realistic Workplace Setting.β62Updated last month
- [EMNLP 2024: Demo Oral] RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented Generationβ311Updated last year
- Blended RAG: Improving RAG (Retriever-Augmented Generation) Accuracy with Semantic Search and Hybrid Query-Based Retrieversβ84Updated 8 months ago
- β125Updated last year
- Open-Source Implementation of WizardLM to turn documents into Q:A pairs for LLM fine-tuningβ309Updated last year
- LangChain, Llama2-Chat, and zero- and few-shot prompting are used to generate synthetic datasets for IR and RAG system evaluationβ39Updated 2 years ago
- ToolQA, a new dataset to evaluate the capabilities of LLMs in answering challenging questions with external tools. It offers two levels β¦β285Updated 2 years ago
- Benchmark baseline for retrieval qa applicationsβ120Updated last year
- The official evaluation suite and dynamic data release for MixEval.β255Updated last year
- Implementation of Google's SELF-DISCOVERβ301Updated last year
- The official repository for the paper: Evaluation of Retrieval-Augmented Generation: A Survey.β190Updated 9 months ago
- Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [Fβ¦β67Updated last year
- Source code for our paper: "SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals".β69Updated last year
- Meta-Prompting: Enhancing Language Models with Task-Agnostic Scaffoldingβ418Updated 2 years ago
- The Synthetic-Persona-Chat dataset is a synthetically generated persona-based dialogue dataset. It extends the original Persona-Chat dataβ¦β107Updated 2 years ago
- β279Updated 2 years ago
- Model, Code & Data for the EMNLP'23 paper "Making Large Language Models Better Data Creators"β137Updated 2 years ago
- Evaluation tools for Retrieval-augmented Generation (RAG) methods.β170Updated last year