argilla-io / awesome-llm-datasets
π©π€π€ A curated list of datasets for large language models (LLMs), RLHF and related resources (continually updated)
β21Updated last year
Related projects β
Alternatives and complementary repositories for awesome-llm-datasets
- Streamlit app for recommending eval functions using prompt diffsβ25Updated 10 months ago
- Github repo for storing LlamaDatasetsβ30Updated 10 months ago
- LLM finetuningβ42Updated last year
- Explore the use of DSPy for extracting features from PDFs πβ33Updated 8 months ago
- β20Updated 9 months ago
- Data preparation code for CrystalCoder 7B LLMβ42Updated 6 months ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and minβ¦β23Updated 2 weeks ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Modelsβ20Updated 9 months ago
- Open Implementations of LLM Analysesβ94Updated last month
- Build Agentic workflows with function callingβ20Updated last week
- Tools for formatting large language model prompts.β12Updated 11 months ago
- A collection of pre-build wrappers over common RAG systems like ChromaDB, Weaviate, Pinecone, and othersz!β20Updated 2 weeks ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing β‘β62Updated 3 weeks ago
- Deploy your autonomous agents to production grade environments with 99% Uptime Guarantee, Infinite Scalability, and self-healing.β27Updated last week
- Ultra Fast Multi-Modality Vector Databaseβ17Updated 9 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β48Updated 4 months ago
- Chat Markup Language conversation libraryβ54Updated 10 months ago
- Simple examples using Argilla tools to build AIβ42Updated last week
- LLM prompt language based on Jinja. Banks provides tools and functions to build prompts text and chat messages from generic blueprints. Iβ¦β67Updated 2 weeks ago
- β20Updated last year
- β30Updated last year
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorerβ37Updated 7 months ago
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafteβ¦β53Updated 3 weeks ago
- BH hackathonβ14Updated 7 months ago
- Mixing Language Models with Self-Verification and Meta-Verificationβ97Updated last year
- Tools to make language models a bit easier to useβ30Updated last week
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.β29Updated 6 months ago
- β17Updated last year
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.β28Updated 9 months ago
- LLMs as Collaboratively Edited Knowledge Basesβ43Updated 9 months ago