Use Large Language Models like OpenAI's GPT-3.5 for data annotation and model enhancement. This framework combines human expertise with LLMs, employs Iterative Active Learning for continuous improvement, and integrates CleanLab (Confident Learning) to ensure high-quality datasets and better model performance
☆41Sep 11, 2023Updated 2 years ago
Alternatives and similar repositories for llm-data-annotation
Users that are interested in llm-data-annotation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Introducing gpt_annotate: an easy-to-use python package designed to streamline automated text annotation using LLMs for different tasks a…☆31Sep 7, 2024Updated last year
- [EMNLP 2022] Code for our paper “ZeroGen: Efficient Zero-shot Learning via Dataset Generation”.☆16Feb 18, 2022Updated 4 years ago
- Convert CoNLL output of a dependency parser into a latex or graphviz tree☆12Mar 26, 2020Updated 6 years ago
- ☆24Feb 5, 2024Updated 2 years ago
- A Multi-Session and Multi-Therapy Benchmark for High-Realism AI Psychological Counselor☆38Jan 13, 2026Updated 3 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆19Jun 4, 2024Updated last year
- ☆13Mar 11, 2024Updated 2 years ago
- Code for HyperSeg and HyperSum☆16Jul 15, 2025Updated 9 months ago
- ☆14Dec 13, 2023Updated 2 years ago
- ☆13Nov 7, 2023Updated 2 years ago
- The official repository for paper "LLMaAA: Making Large Language Models as Active Annotators"☆44Apr 14, 2024Updated 2 years ago
- ☆12Dec 26, 2023Updated 2 years ago
- [NeurIPS 2023 D&B Track] Code and data for paper "Revisiting Out-of-distribution Robustness in NLP: Benchmarks, Analysis, and LLMs Evalua…☆37Jun 8, 2023Updated 2 years ago
- Powerful document clustering models are essential as they can efficiently process large sets of documents. These models can be helpful in…☆17Oct 30, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Podcast index database quality dashboard☆15Apr 19, 2026Updated last week
- Early Detection of Fake News with Multi-source Weak Social Supervision☆24Jun 12, 2023Updated 2 years ago
- Official implementation of "MadCLIP: Few-shot Medical Anomaly Detection with CLIP" (MICCAI 2025, Early Accepted).☆28Jul 24, 2025Updated 9 months ago
- ☆12May 10, 2024Updated last year
- ☆18Apr 2, 2021Updated 5 years ago
- FrugalScore is an approach to learn a fixed, low cost version of any expensive NLG metric, while retaining most of its original performan…☆16Sep 21, 2022Updated 3 years ago
- In this notebook, I am updating NLP notebooks, and projects☆10Jun 29, 2023Updated 2 years ago
- Interface LLMs from within MISP to extract TTPs and threat intel from CTI reports☆18Nov 13, 2023Updated 2 years ago
- FDSML Course Project 2020/21☆16May 11, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Framework for controlling demographic biases in NLG (using adversarial prompts)☆21Jun 12, 2023Updated 2 years ago
- Official codes for paper "TimeDP: Learning to Generate Multi-Domain Time Series with Domain Prompts" (AAAI-25)☆16May 23, 2025Updated 11 months ago
- Data set for LREC 2020 paper "I Feel Offended, Don't Be Abusive!"☆18Sep 23, 2023Updated 2 years ago
- Noise Reduction Methods for Distantly Supervised Biomedical Relation Extraction☆11Oct 25, 2017Updated 8 years ago
- 爬取雪球网股票评论☆18Apr 28, 2025Updated last year
- ☆13Jan 10, 2023Updated 3 years ago
- ☆20Aug 29, 2024Updated last year
- Data from paper: "Benign Effects of Automation: New Evidence from Patent Texts"☆12May 31, 2025Updated 10 months ago
- HackerRank, LeetCode, Cracking the Coding Interview Solutions in Python/C++☆11Mar 15, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆22Dec 12, 2024Updated last year
- meta_llama_2finetuned_text_generation_summarization☆21Jul 21, 2023Updated 2 years ago
- LLMEval☆11Feb 12, 2024Updated 2 years ago
- This repo is for the Mis2-KDD 2021 under review paper: Dataset of Propaganda Techniques of the State-Sponsored Information Operation of t…☆19Feb 5, 2022Updated 4 years ago
- Code for the AAAI 2023 Paper "Real or Fake Text?: Investigating Human Ability to Detect Boundaries Between Human-Written and Machine-Gene…☆17Oct 29, 2024Updated last year
- An Open-source Factuality Evaluation Demo for LLMs☆32Feb 23, 2026Updated 2 months ago
- Data and codes for BioBERT-MRC☆11Oct 5, 2021Updated 4 years ago