saran9991 / llm-data-annotation
Use Large Language Models like OpenAI's GPT-3.5 for data annotation and model enhancement. This framework combines human expertise with LLMs, employs Iterative Active Learning for continuous improvement, and integrates CleanLab (Confident Learning) to ensure high-quality datasets and better model performance
☆35Updated last year
Alternatives and similar repositories for llm-data-annotation:
Users that are interested in llm-data-annotation are comparing it to the libraries listed below
- A multi-purpose toolkit for table-to-text generation: web interface, Python bindings, CLI commands.☆55Updated 11 months ago
- M4: Multi-generator, Multi-domain, and Multi-lingual Black-Box Machine-Generated Text Detection☆22Updated 11 months ago
- ☆70Updated 6 months ago
- A simple recipe for training and inferencing Transformer architecture for Multi-Task Learning on custom datasets. You can find two approa…☆95Updated 2 years ago
- Token-level Reference-free Hallucination Detection☆94Updated last year
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆126Updated last year
- A Simple but Powerful SOTA NER Model | Official Code For Label Supervised LLaMA Finetuning☆153Updated last year
- A curated list of research papers and resources on Cultural LLM.☆41Updated 6 months ago
- Efficient Attention for Long Sequence Processing☆93Updated last year
- In this implementation, using the Flan T5 large language model, we performed the Text Classification task on the IMDB dataset and obtaine…☆21Updated last year
- ☆89Updated last month
- ☆26Updated 5 months ago
- ☆38Updated last year
- ☆359Updated last year
- Codebase, data and models for the SummaC paper in TACL☆89Updated 2 months ago
- ☆42Updated last year
- Text classification with Foundation Language Model LLaMA☆115Updated 2 years ago
- A collaborative project to collect datasets in SEA languages, SEA regions, or SEA cultures.☆78Updated 2 months ago
- ☆40Updated 11 months ago
- Interpreting Language Models with Contrastive Explanations (EMNLP 2022 Best Paper Honorable Mention)☆62Updated 2 years ago
- ☆61Updated 4 years ago
- ACL2023 - AlignScore, a metric for factual consistency evaluation.☆124Updated last year
- Dataset from the paper "Mintaka: A Complex, Natural, and Multilingual Dataset for End-to-End Question Answering" (COLING 2022)☆112Updated 2 years ago
- TimeLMs: Diachronic Language Models from Twitter☆108Updated last year
- RARR: Researching and Revising What Language Models Say, Using Language Models☆46Updated last year
- Fine-tuning of Flan-5T LLM for text classification 🤖 focuses on adapting a state-of-the-art language model to enhance its ability to cla…☆38Updated 5 months ago
- ☆23Updated 8 months ago
- ☆39Updated 2 years ago
- Easy multi-task learning with HuggingFace Datasets and Trainer☆55Updated 2 months ago
- Resources for cultural NLP research☆86Updated 2 months ago