VILA-Lab / ATLASLinks
A principled instruction benchmark on formulating effective queries and prompts for large language models (LLMs). Our paper: https://arxiv.org/abs/2312.16171
☆967Updated last year
Alternatives and similar repositories for ATLAS
Users that are interested in ATLAS are comparing it to the libraries listed below
Sorting:
- FacTool: Factuality Detection in Generative AI☆879Updated 10 months ago
- Open-source tool to visualise your RAG 🔮☆1,146Updated 6 months ago
- Meta-Prompting: Enhancing Language Models with Task-Agnostic Scaffolding☆393Updated last year
- ☆886Updated 8 months ago
- Benchmarking long-form factuality in large language models. Original code for our paper "Long-form factuality in large language models".☆622Updated this week
- ☆1,489Updated last year
- prompt2model - Generate Deployable Models from Natural Language Instructions☆2,005Updated 6 months ago
- Streamlines and simplifies prompt design for both developers and non-technical users with a low code approach.☆1,087Updated 3 weeks ago
- ☆769Updated 3 weeks ago
- ☆1,283Updated last year
- ☆364Updated last year
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆1,934Updated 11 months ago
- A joint community effort to create one central leaderboard for LLMs.☆303Updated 10 months ago
- Official implement of paper "AutoScraper: A Progressive Understanding Web Agent for Web Scraper Generation" [EMNLP 24']☆472Updated 6 months ago
- GPT based autonomous agent designed to create personalized newspapers tailored to user preferences.☆1,337Updated last year
- Automatically evaluate your LLMs in Google Colab☆649Updated last year
- The Tree of Thoughts (ToT) framework for solving complex reasoning tasks using LLMs☆345Updated 10 months ago
- A unified evaluation framework for large language models☆2,661Updated last week
- LLM Comparator is an interactive data visualization tool for evaluating and analyzing LLM responses side-by-side, developed by the PAIR t…☆454Updated 5 months ago
- Fine-Tuning Embedding for RAG with Synthetic Data☆504Updated last year
- A library for prompt engineering and optimization (SAMMO = Structure-aware Multi-Objective Metaprompt Optimization)☆695Updated 3 weeks ago
- ☆933Updated 7 months ago
- ☆1,031Updated 2 years ago
- Repo for paper "Unleashing Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-Collaboration"☆339Updated last year
- Evaluate your LLM's response with Prometheus and GPT4 💯☆963Updated 2 months ago
- [NeurIPS 2024 Spotlight] Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models☆646Updated 2 weeks ago
- ☆304Updated last year
- Using Tree-of-Thought Prompting to boost ChatGPT's reasoning☆780Updated last year
- Code for our ACL 2023 Paper "Plan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning by Large Language Models".☆676Updated 2 years ago
- A comprehensive guide to building RAG-based LLM applications for production.☆1,805Updated 11 months ago