saran9991 / llm-data-annotation
Use Large Language Models like OpenAI's GPT-3.5 for data annotation and model enhancement. This framework combines human expertise with LLMs, employs Iterative Active Learning for continuous improvement, and integrates CleanLab (Confident Learning) to ensure high-quality datasets and better model performance
☆34Updated last year
Alternatives and similar repositories for llm-data-annotation:
Users that are interested in llm-data-annotation are comparing it to the libraries listed below
- ☆61Updated 4 years ago
- A simple recipe for training and inferencing Transformer architecture for Multi-Task Learning on custom datasets. You can find two approa…☆94Updated 2 years ago
- ☆86Updated 2 weeks ago
- A Simple but Powerful SOTA NER Model | Official Code For Label Supervised LLaMA Finetuning☆151Updated 11 months ago
- ☆31Updated 4 months ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆125Updated 11 months ago
- A Framework for Textual Entailment based Zero Shot text classification☆152Updated 11 months ago
- Benchmarking Large Language Models☆89Updated this week
- Multilingual Large Language Models Evaluation Benchmark☆117Updated 6 months ago
- Efficient Attention for Long Sequence Processing☆92Updated last year
- A Dataset for Direct Quotation Extraction and Attribution in News Articles.☆13Updated 3 years ago
- ACL2023 - AlignScore, a metric for factual consistency evaluation.☆121Updated 11 months ago
- Dataset and code for "Explainable Automated Fact-Checking for Public Health Claims" from EMNLP 2020.☆58Updated 3 years ago
- Collection of NLP model explanations and accompanying analysis tools☆145Updated last year
- Codebase, data and models for the SummaC paper in TACL☆87Updated 3 weeks ago
- Long Document Summarization Papers☆141Updated last year
- Token-level Reference-free Hallucination Detection☆94Updated last year
- pyTorch implementation of Recurrence over BERT (RoBERT) based on this paper https://arxiv.org/abs/1910.10781 and comparison with pyTorch …☆80Updated 2 years ago
- Text classification with Foundation Language Model LLaMA☆114Updated last year
- GISTEmbed: Guided In-sample Selection of Training Negatives for Text Embeddings☆37Updated 11 months ago
- BARTScore: Evaluating Generated Text as Text Generation☆342Updated 2 years ago
- ☆70Updated 4 months ago
- M4: Multi-generator, Multi-domain, and Multi-lingual Black-Box Machine-Generated Text Detection☆21Updated 10 months ago
- ☆38Updated last year
- ☆36Updated last year
- ☆28Updated 2 years ago
- A multi-purpose toolkit for table-to-text generation: web interface, Python bindings, CLI commands.☆55Updated 9 months ago
- ☆39Updated last month
- A text truncation method, useful for instance in long text classification☆23Updated 2 years ago
- A extension of Transformers library to include T5ForSequenceClassification class.☆37Updated last year