argilla-io / awesome-llm-datasets
π©π€π€ A curated list of datasets for large language models (LLMs), RLHF and related resources (continually updated)
β21Updated last year
Related projects: β
- Voyage AI Official Python Libraryβ37Updated 3 months ago
- Streamlit app for recommending eval functions using prompt diffsβ24Updated 8 months ago
- Evaluation and analysis code for LLM360β75Updated 3 months ago
- Github repo for storing LlamaDatasetsβ27Updated 8 months ago
- LLMs as Collaboratively Edited Knowledge Basesβ40Updated 7 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β48Updated 2 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimizationβ33Updated 6 months ago
- Writing Blog Posts with Generative Feedback Loops!β41Updated 6 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing β‘β58Updated 2 weeks ago
- LLM finetuningβ41Updated last year
- Ultra Fast Multi-Modality Vector Databaseβ16Updated 6 months ago
- never forget anything again! combine AI and intelligent tooling for a local knowledge base to track catalogue, annotate, and plan for youβ¦β29Updated 4 months ago
- Deploy your autonomous agents to production grade environments with 99% Uptime Guarantee, Infinite Scalability, and self-healing.β24Updated this week
- β20Updated 7 months ago
- Codebase accompanying the Summary of a Haystack paper.β65Updated 2 months ago
- Tools to make language models a bit easier to useβ22Updated last week
- β37Updated 9 months ago
- β71Updated 3 months ago
- ToK aka Tree of Knowledge for Large Language Models LLM. It's a novel dataset that inspires knowledge symbolic correlation in simple inpuβ¦β43Updated last year
- Score LLM pretraining data with classifiersβ56Updated 10 months ago
- β48Updated 11 months ago
- Example Notebook for Synthetic User Research with Persona Prompting and Autonomous Agentsβ26Updated 5 months ago
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.β26Updated 7 months ago
- LLM prompt language based on Jinjaβ52Updated 2 weeks ago
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive argumentsβ49Updated 3 weeks ago
- Tool to apply Legal Matter Specification Standard (LMSS) to documentsβ11Updated last month
- Solve Geometric & Graph Problems with Large Language Modelsβ27Updated last year
- Source code for our paper: "SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals".β62Updated 2 months ago
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems.β48Updated 3 weeks ago
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.β39Updated 2 weeks ago