VILA-Lab / ATLASLinks

A principled instruction benchmark on formulating effective queries and prompts for large language models (LLMs). Our paper: https://arxiv.org/abs/2312.16171

☆968

Alternatives and similar repositories for ATLAS

Users that are interested in ATLAS are comparing it to the libraries listed below

Sorting:

GAIR-NLP / factool
FacTool: Factuality Detection in Generative AI
☆885Updated 11 months ago
gabrielchua / RAGxplorer
Open-source tool to visualise your RAG 🔮
☆1,147Updated 7 months ago
character-ai / prompt-poet
Streamlines and simplifies prompt design for both developers and non-technical users with a low code approach.
☆1,098Updated 2 weeks ago
run-llama / finetune-embedding
Fine-Tuning Embedding for RAG with Synthetic Data
☆506Updated last year
dave1010 / tree-of-thought-prompting
Using Tree-of-Thought Prompting to boost ChatGPT's reasoning
☆785Updated last year
langchain-ai / langsmith-cookbook
☆944Updated 2 weeks ago
ray-project / llm-applications
A comprehensive guide to building RAG-based LLM applications for production.
☆1,813Updated last year
keirp / automatic_prompt_engineer
☆1,286Updated last year
Raudaschl / rag-fusion
☆885Updated 9 months ago
langchain-ai / auto-evaluator
☆773Updated last month
BatsResearch / bonito
A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.
☆785Updated 3 weeks ago
run-llama / llama-lab
☆1,491Updated last year
suzgunmirac / meta-prompting
Meta-Prompting: Enhancing Language Models with Task-Agnostic Scaffolding
☆397Updated last year
prometheus-eval / prometheus-eval
Evaluate your LLM's response with Prometheus and GPT4 💯
☆978Updated 3 months ago
neulab / prompt2model
prompt2model - Generate Deployable Models from Natural Language Instructions
☆2,008Updated 7 months ago
ctlllll / LLM-ToolMaker
☆1,036Updated 2 years ago
yuchenlin / LLM-Blender
[ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the dive…
☆956Updated 9 months ago
finic-ai / doctran
☆508Updated 11 months ago
trigaten / The_Prompt_Report
☆368Updated last year
AGI-Edgerunners / Plan-and-Solve-Prompting
Code for our ACL 2023 Paper "Plan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning by Large Language Models".
☆681Updated 2 years ago
mlabonne / llm-autoeval
Automatically evaluate your LLMs in Google Colab
☆649Updated last year
google-deepmind / long-form-factuality
Benchmarking long-form factuality in large language models. Original code for our paper "Long-form factuality in large language models".
☆629Updated this week
microsoft / promptbench
A unified evaluation framework for large language models
☆2,679Updated this week
microsoft / sammo
A library for prompt engineering and optimization (SAMMO = Structure-aware Multi-Objective Metaprompt Optimization)
☆716Updated last month
jieyilong / tree-of-thought-puzzle-solver
The Tree of Thoughts (ToT) framework for solving complex reasoning tasks using LLMs
☆346Updated 11 months ago
gkamradt / LLMTest_NeedleInAHaystack
Doing simple retrieval from LLM models at various context lengths to measure accuracy
☆1,956Updated 11 months ago
MagnivOrg / prompt-layer-library
🍰 PromptLayer - Maintain a log of your prompts and OpenAI API requests. Track, debug, and replay old completions.
☆651Updated this week
run-llama / chat-llamaindex
☆961Updated 11 months ago
IntelLabs / fastRAG
Efficient Retrieval Augmentation and Generation Framework
☆1,625Updated 7 months ago
MikeWangWZHL / Solo-Performance-Prompting
Repo for paper "Unleashing Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-Collaboration"
☆343Updated last year