VILA-Lab / ATLASLinks
A principled instruction benchmark on formulating effective queries and prompts for large language models (LLMs). Our paper: https://arxiv.org/abs/2312.16171
☆961Updated last year
Alternatives and similar repositories for ATLAS
Users that are interested in ATLAS are comparing it to the libraries listed below
Sorting:
- ☆878Updated 7 months ago
- Meta-Prompting: Enhancing Language Models with Task-Agnostic Scaffolding☆386Updated last year
- Open-source tool to visualise your RAG 🔮☆1,134Updated 5 months ago
- [NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web" -- the first LLM-based web agent and benchmark for generalist w…☆830Updated 2 months ago
- Automated Evaluation of RAG Systems☆599Updated 2 months ago
- A comprehensive guide to building RAG-based LLM applications for production.☆1,797Updated 10 months ago
- ☆766Updated last year
- FacTool: Factuality Detection in Generative AI☆874Updated 9 months ago
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.☆423Updated last year
- Efficient Retrieval Augmentation and Generation Framework☆1,558Updated 4 months ago
- ☆1,263Updated last year
- ☆910Updated 5 months ago
- [NeurIPS 2024 Spotlight] Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models☆635Updated 2 months ago
- Automatically evaluate your LLMs in Google Colab☆631Updated last year
- Evaluation tool for LLM QA chains☆1,070Updated 2 years ago
- ☆337Updated 11 months ago
- Fine-Tuning Embedding for RAG with Synthetic Data☆500Updated last year
- Forward-Looking Active REtrieval-augmented generation (FLARE)☆634Updated last year
- Large Language Models for All, 🦙 Cult and More, Stay in touch !☆446Updated 2 years ago
- Streamlines and simplifies prompt design for both developers and non-technical users with a low code approach.☆1,060Updated 3 weeks ago
- An LLM-powered advanced RAG pipeline built from scratch☆841Updated last year
- This repository contains code and tooling for the Abacus.AI LLM Context Expansion project. Also included are evaluation scripts and bench…☆587Updated last year
- Repo for paper "Unleashing Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-Collaboration"☆336Updated last year
- The papers are organized according to our survey: Evaluating Large Language Models: A Comprehensive Survey.☆766Updated last year
- A joint community effort to create one central leaderboard for LLMs.☆299Updated 9 months ago
- An LLM-based autonomous agent controlling real-world applications via RESTful APIs☆1,368Updated last year
- A library for prompt engineering and optimization (SAMMO = Structure-aware Multi-Objective Metaprompt Optimization)☆674Updated 5 months ago
- Evaluate your LLM's response with Prometheus and GPT4 💯☆950Updated last month
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,483Updated 3 weeks ago
- ☆1,032Updated 2 years ago