Use Large Language Models like OpenAI's GPT-3.5 for data annotation and model enhancement. This framework combines human expertise with LLMs, employs Iterative Active Learning for continuous improvement, and integrates CleanLab (Confident Learning) to ensure high-quality datasets and better model performance
☆40Sep 11, 2023Updated 2 years ago
Alternatives and similar repositories for llm-data-annotation
Users that are interested in llm-data-annotation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Introducing gpt_annotate: an easy-to-use python package designed to streamline automated text annotation using LLMs for different tasks a…☆31Sep 7, 2024Updated last year
- [EMNLP 2022] Code for our paper “ZeroGen: Efficient Zero-shot Learning via Dataset Generation”.☆16Feb 18, 2022Updated 4 years ago
- [TACL] Code for "Red Teaming Language Model Detectors with Language Models"☆24Nov 24, 2023Updated 2 years ago
- ☆12Jul 6, 2023Updated 2 years ago
- ☆24Feb 5, 2024Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆13Mar 11, 2024Updated 2 years ago
- Source code and data for ADEPT: A DEbiasing PrompT Framework (AAAI-23).☆15Dec 13, 2024Updated last year
- ☆13Nov 7, 2023Updated 2 years ago
- ☆10Jul 30, 2025Updated 7 months ago
- The official repository for paper "LLMaAA: Making Large Language Models as Active Annotators"☆44Apr 14, 2024Updated last year
- Official code for "Automated Scoring for Reading Comprehension via In-context BERT Tuning" (AIED 2022)☆13May 23, 2022Updated 3 years ago
- [NeurIPS 2023 D&B Track] Code and data for paper "Revisiting Out-of-distribution Robustness in NLP: Benchmarks, Analysis, and LLMs Evalua…☆36Jun 8, 2023Updated 2 years ago
- ☆19Mar 6, 2023Updated 3 years ago
- Powerful document clustering models are essential as they can efficiently process large sets of documents. These models can be helpful in…☆17Oct 30, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- JavaScript visualizations of various DELPH-IN structures.☆17Feb 3, 2022Updated 4 years ago
- Tool for the automatic assessment of lexical diversity☆14Sep 6, 2025Updated 6 months ago
- Early Detection of Fake News with Multi-source Weak Social Supervision☆23Jun 12, 2023Updated 2 years ago
- Repo containing documentation and explanation for CSET's harm taxonomy of incidents from AIID.☆19Jun 21, 2024Updated last year
- Pytorch implementation of "Enhancing Chinese Pre-trained Language Model via Heterogeneous Linguistics Graph", ACL 2022☆15Feb 28, 2022Updated 4 years ago
- ☆648Jul 29, 2025Updated 7 months ago
- Python scripts to scrape the iTunes Podcast categories.☆12Nov 30, 2020Updated 5 years ago
- FrugalScore is an approach to learn a fixed, low cost version of any expensive NLG metric, while retaining most of its original performan…☆16Sep 21, 2022Updated 3 years ago
- FDSML Course Project 2020/21☆15May 11, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Audio feature extraction and baseline search implementation for the Spotify Podcast Dataset.☆12Sep 30, 2021Updated 4 years ago
- Code for the paper: ConDA: Contrastive Domain Adaptation for AI-generated Text Detection☆41Dec 21, 2023Updated 2 years ago
- Duke Machine Learning Winter School: Computer Vision 2022☆10Jan 3, 2022Updated 4 years ago
- Framework for controlling demographic biases in NLG (using adversarial prompts)☆21Jun 12, 2023Updated 2 years ago
- ☆13Sep 13, 2015Updated 10 years ago
- Data from paper: "Benign Effects of Automation: New Evidence from Patent Texts"☆12May 31, 2025Updated 9 months ago
- HackerRank, LeetCode, Cracking the Coding Interview Solutions in Python/C++☆11Mar 15, 2026Updated last week
- GPU-accelerated algorithm for subsampling datasets while preserving diversity☆27Jan 12, 2024Updated 2 years ago
- A supplementary material to "The Evolution of Work in the United States"☆12Jun 23, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- LLMEval☆11Feb 12, 2024Updated 2 years ago
- This repo is for the Mis2-KDD 2021 under review paper: Dataset of Propaganda Techniques of the State-Sponsored Information Operation of t…☆19Feb 5, 2022Updated 4 years ago
- ☆19Apr 28, 2021Updated 4 years ago
- An Empirical Study of Memorization in NLP (ACL 2022)☆13Jun 22, 2022Updated 3 years ago
- Master the techniques of function-calling and structured data extraction with LLMs. Learn to enhance LLM capabilities, integrate web serv…☆12Jun 29, 2024Updated last year
- A demo project showcasing a domain specific AI powered knowledge base☆17Jan 4, 2024Updated 2 years ago
- ☆14Jun 17, 2024Updated last year