psunlpgroup / GreaterPromptLinks
GreaterPrompt: A Python Toolkit for Prompt Optimization
☆53Updated 8 months ago
Alternatives and similar repositories for GreaterPrompt
Users that are interested in GreaterPrompt are comparing it to the libraries listed below
Sorting:
- ☆51Updated last year
- Codebase accompanying the Summary of a Haystack paper.☆79Updated last year
- Retrieval Augmented Generation Generalized Evaluation Dataset☆59Updated 4 months ago
- Complex Function Calling Benchmark.☆149Updated 10 months ago
- [SIGIR 2024 (Demo)] CoSearchAgent: A Lightweight Collborative Search Agent with Large Language Models☆28Updated last year
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker☆124Updated last month
- ☆81Updated last month
- Framework and toolkits for building and evaluating collaborative agents that can work together with humans.☆113Updated last week
- ☆38Updated last year
- Code and Dataset for Learning to Solve Complex Tasks by Talking to Agents☆24Updated 3 years ago
- Code and Data for "Language Modeling with Editable External Knowledge"☆36Updated last year
- ☆62Updated last year
- Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]☆88Updated 10 months ago
- ☆20Updated 8 months ago
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆100Updated 2 years ago
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆102Updated last year
- Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".☆211Updated 5 months ago
- This project studies the performance and robustness of language models and task-adaptation methods.☆155Updated last year
- ☆41Updated 5 months ago
- ☆58Updated last year
- Streamline on-policy/off-policy distillation workflows in a few lines of code☆79Updated this week
- Retrieval-Augmented Generation battle!☆61Updated 4 months ago
- ☆53Updated last year
- Data for the MTEB leaderboard☆39Updated this week
- ☆43Updated 10 months ago
- 🚢 Data Toolkit for Sailor Language Models☆94Updated 9 months ago
- This repository contains ScholarQABench data and evaluation pipeline.☆89Updated 4 months ago
- Reasoning by Communicating with Agents☆29Updated 7 months ago
- Evaluating tool-augmented LLMs in conversation settings☆88Updated last year
- Jina VDR is a multilingual, multi-domain benchmark for visual document retrieval☆37Updated 4 months ago