argilla-io / awesome-llm-datasetsView external linksLinks
👩🤝🤖 A curated list of datasets for large language models (LLMs), RLHF and related resources (continually updated)
☆24May 2, 2023Updated 2 years ago
Alternatives and similar repositories for awesome-llm-datasets
Users that are interested in awesome-llm-datasets are comparing it to the libraries listed below
Sorting:
- A Python library aimed at dissecting and augmenting NER training data.☆61May 11, 2023Updated 2 years ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆23Mar 12, 2024Updated last year
- Translating human input as kubectl commands using LLMs powered by Yacana☆12Feb 4, 2026Updated last week
- ☆29May 30, 2023Updated 2 years ago
- Code for constructing TLDR corpus from Reddit dataset☆27Nov 23, 2021Updated 4 years ago
- A YouTube agent built using AutoGen☆34Dec 13, 2024Updated last year
- Fact checking baseline combining dense retrieval and textual entailment☆30Aug 10, 2025Updated 6 months ago
- An awesome & curated list of best LLMOps tools for developers☆25Jun 21, 2023Updated 2 years ago
- Examples for the HEBI Robotics Python API☆14Jan 9, 2026Updated last month
- A Python-based voice assistant integrating speech-to-text (STT), text-to-speech (TTS), and powerful AI capabilities using either a local …☆13Dec 8, 2025Updated 2 months ago
- ☆12Jun 19, 2025Updated 7 months ago
- Official Code for All-in-One Medical Image Re-Identification (CVPR2025)☆18Jan 11, 2026Updated last month
- FELICS Framework☆11Dec 5, 2019Updated 6 years ago
- Sample code for integrating LightWare rangefinders into various platforms.☆10Oct 22, 2025Updated 3 months ago
- An interactive remote control panel for various devices☆14Feb 28, 2022Updated 3 years ago
- ECED440 Computer Security☆11Nov 6, 2024Updated last year
- Open Source Project for Defi Crypto Portfolio Management☆10Feb 4, 2025Updated last year
- Faster access to Tesseract-OCR from Python☆13Jun 8, 2021Updated 4 years ago
- Python wrapper for the energy system optimization framework IESopt.☆18Feb 9, 2026Updated last week
- Converts Python3 .py files into .exe and makes it so the file can run on any environment without installing python3.☆11Jun 7, 2018Updated 7 years ago
- A repository aimed at sharing links to climate-related resources.☆12Feb 4, 2026Updated last week
- ☆17Updated this week
- Gimp plugins to extract text from images (Bubble/Balloons)☆12Jul 7, 2024Updated last year
- Blackbird OSINT tool FrontEnd React Project☆13Mar 6, 2024Updated last year
- An Alfred workflow--and a command line utility--to easily find recently modified files.☆13Sep 15, 2023Updated 2 years ago
- Kismet website generation & documentation data☆12Feb 7, 2026Updated last week
- ☆15Oct 24, 2023Updated 2 years ago
- ☆12Jan 19, 2023Updated 3 years ago
- gcnano-binaries☆11Feb 9, 2026Updated last week
- Multi-task model for named-entity recognition, relation extraction, entity mention detection and coreference resolution.☆46Jun 26, 2024Updated last year
- ☆52Mar 9, 2025Updated 11 months ago
- Code repository for our paper, "Medical Large Language Models are Vulnerable to Data Poisoning Attacks" (Nature Medicine, 2024).☆12Jan 5, 2025Updated last year
- OpenAI's Code Interpreter running locally, as a service via WebSocket☆10Sep 22, 2023Updated 2 years ago
- [KDD24-ADS] R-Eval: A Unified Toolkit for Evaluating Domain Knowledge of Retrieval Augmented Large Language Models☆11Apr 9, 2024Updated last year
- GOST-34.11-2012 (Stribog) hash-function☆11May 12, 2015Updated 10 years ago
- A virtual city environment for traffic simulation, drone simulation, computer vision, etc.☆13Jan 28, 2026Updated 2 weeks ago
- ☆11Apr 17, 2023Updated 2 years ago
- The official Python library for Openlayer, the Continuous Model Improvement Platform for AI. 📈☆16Updated this week
- A dark theme for Redoc☆11Nov 24, 2024Updated last year