VILA-Lab / ATLASLinks
A principled instruction benchmark on formulating effective queries and prompts for large language models (LLMs). Our paper: https://arxiv.org/abs/2312.16171
☆972Updated last year
Alternatives and similar repositories for ATLAS
Users that are interested in ATLAS are comparing it to the libraries listed below
Sorting:
- FacTool: Factuality Detection in Generative AI☆889Updated last year
- Using Tree-of-Thought Prompting to boost ChatGPT's reasoning☆788Updated last year
- Open-source tool to visualise your RAG 🔮☆1,150Updated 7 months ago
- ☆372Updated last year
- ☆890Updated 10 months ago
- Code for our ACL 2023 Paper "Plan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning by Large Language Models".☆686Updated 2 years ago
- Fine-Tuning Embedding for RAG with Synthetic Data☆509Updated last year
- Streamlines and simplifies prompt design for both developers and non-technical users with a low code approach.☆1,103Updated last month
- ☆957Updated last month
- Meta-Prompting: Enhancing Language Models with Task-Agnostic Scaffolding☆403Updated last year
- ☆773Updated 2 months ago
- The Tree of Thoughts (ToT) framework for solving complex reasoning tasks using LLMs☆348Updated last year
- ☆1,500Updated last year
- ☆1,039Updated 2 years ago
- prompt2model - Generate Deployable Models from Natural Language Instructions☆2,009Updated 8 months ago
- A unified evaluation framework for large language models☆2,699Updated 3 weeks ago
- Official implement of paper "AutoScraper: A Progressive Understanding Web Agent for Web Scraper Generation" [EMNLP 24']☆475Updated 7 months ago
- ☆1,300Updated last year
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆1,993Updated last year
- This repository contains code and tooling for the Abacus.AI LLM Context Expansion project. Also included are evaluation scripts and bench…☆594Updated last year
- 🍰 PromptLayer - Maintain a log of your prompts and OpenAI API requests. Track, debug, and replay old completions.☆658Updated this week
- Benchmarking long-form factuality in large language models. Original code for our paper "Long-form factuality in large language models".☆639Updated 3 weeks ago
- Automatically evaluate your LLMs in Google Colab☆656Updated last year
- [ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the dive…☆958Updated 10 months ago
- GPT based autonomous agent designed to create personalized newspapers tailored to user preferences.☆1,363Updated last year
- [ICLR 2024] Lemur: Open Foundation Models for Language Agents☆555Updated last year
- Evaluate your LLM's response with Prometheus and GPT4 💯☆981Updated 4 months ago
- ☆307Updated last year
- Repo for paper "Unleashing Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-Collaboration"☆344Updated last year
- A tool for evaluating LLMs☆424Updated last year