VILA-Lab / ATLASLinks
A principled instruction benchmark on formulating effective queries and prompts for large language models (LLMs). Our paper: https://arxiv.org/abs/2312.16171
☆965Updated last year
Alternatives and similar repositories for ATLAS
Users that are interested in ATLAS are comparing it to the libraries listed below
Sorting:
- FacTool: Factuality Detection in Generative AI☆878Updated 10 months ago
- [IJCAI 2024] Generate different roles for GPTs to form a collaborative entity for complex tasks.☆1,370Updated last year
- Ship RAG based LLM web apps in seconds.☆995Updated last year
- The Tree of Thoughts (ToT) framework for solving complex reasoning tasks using LLMs☆342Updated 10 months ago
- This repository contains code and tooling for the Abacus.AI LLM Context Expansion project. Also included are evaluation scripts and bench…☆589Updated last year
- [ICLR 2024] Lemur: Open Foundation Models for Language Agents☆550Updated last year
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆1,897Updated 10 months ago
- ☆1,031Updated 2 years ago
- ☆596Updated 2 years ago
- Using Tree-of-Thought Prompting to boost ChatGPT's reasoning☆773Updated last year
- ☆1,486Updated last year
- [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…☆5,191Updated 3 months ago
- [NeurIPS 2024 Spotlight] Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models☆639Updated 3 months ago
- Evaluation tool for LLM QA chains☆1,073Updated 2 years ago
- [NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web" -- the first LLM-based web agent and benchmark for generalist w…☆836Updated 2 months ago
- Open-source tool to visualise your RAG 🔮☆1,136Updated 5 months ago
- A unified evaluation framework for large language models☆2,641Updated 3 weeks ago
- Code for our ACL 2023 Paper "Plan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning by Large Language Models".☆668Updated 2 years ago
- [ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the dive…☆947Updated 8 months ago
- LangChain-powered web researcher chatbot. Searches for sources on the web and cites them in generated answers.☆543Updated last year
- ☆920Updated 6 months ago
- Benchmarking long-form factuality in large language models. Original code for our paper "Long-form factuality in large language models".☆617Updated last week
- [ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling☆1,704Updated 11 months ago
- An self-improving embodied conversational agent seamlessly integrated into the operating system to automate our daily tasks.☆1,663Updated 9 months ago
- LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transform…☆1,459Updated last year
- LLMs can generate feedback on their work, use it to improve the output, and repeat this process iteratively.☆706Updated 8 months ago
- Decoupling Reasoning from Observations for Efficient Augmented Language Models☆903Updated last year
- [ICLR 2025 Spotlight] An open-sourced LLM judge for evaluating LLM-generated answers.☆369Updated 4 months ago
- Codes for "Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models".☆1,132Updated last year
- ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting wit…☆1,073Updated last year