VILA-Lab / ATLAS
A principled instruction benchmark on formulating effective queries and prompts for large language models (LLMs). Our paper: https://arxiv.org/abs/2312.16171
☆952Updated 10 months ago
Alternatives and similar repositories for ATLAS:
Users that are interested in ATLAS are comparing it to the libraries listed below
- ☆321Updated 9 months ago
- LangChain-powered web researcher chatbot. Searches for sources on the web and cites them in generated answers.☆540Updated last year
- ☆855Updated 5 months ago
- FacTool: Factuality Detection in Generative AI☆859Updated 7 months ago
- ☆1,461Updated last year
- A database of SDKs, frameworks, libraries, and tools for creating, monitoring, debugging and deploying autonomous AI agents☆903Updated last month
- ☆496Updated 7 months ago
- Open-source tool to visualise your RAG 🔮☆1,119Updated 2 months ago
- A unified evaluation framework for large language models☆2,574Updated last month
- Meta-Prompting: Enhancing Language Models with Task-Agnostic Scaffolding☆376Updated last year
- A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.☆757Updated last month
- A library of data loaders for LLMs made by the community -- to be used with LlamaIndex and/or LangChain☆3,474Updated last year
- An LLM-powered advanced RAG pipeline built from scratch☆831Updated last year
- Benchmarking long-form factuality in large language models. Original code for our paper "Long-form factuality in large language models".☆591Updated 3 weeks ago
- A framework for prompt tuning using Intent-based Prompt Calibration☆2,449Updated 4 months ago
- Repo for paper "Unleashing Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-Collaboration"☆325Updated 10 months ago
- Official implement of paper "AutoScraper: A Progressive Understanding Web Agent for Web Scraper Generation" [EMNLP 24']☆458Updated 2 months ago
- LLMs can generate feedback on their work, use it to improve the output, and repeat this process iteratively.☆672Updated 5 months ago
- The Tree of Thoughts (ToT) framework for solving complex reasoning tasks using LLMs☆321Updated 7 months ago
- SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models☆504Updated 9 months ago
- ☆292Updated last year
- Evaluate your LLM's response with Prometheus and GPT4 💯☆893Updated 2 weeks ago
- Fine-Tuning Embedding for RAG with Synthetic Data☆489Updated last year
- ☆869Updated 3 months ago
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,359Updated last week
- This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai,…☆2,012Updated 10 months ago
- An self-improving embodied conversational agent seamlessly integrated into the operating system to automate our daily tasks.☆1,637Updated 6 months ago
- [NeurIPS 2024 Spotlight] Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models☆613Updated last week
- Automatically evaluate your LLMs in Google Colab☆605Updated 10 months ago
- ☆1,231Updated 11 months ago