saran9991 / llm-data-annotationLinks
Use Large Language Models like OpenAI's GPT-3.5 for data annotation and model enhancement. This framework combines human expertise with LLMs, employs Iterative Active Learning for continuous improvement, and integrates CleanLab (Confident Learning) to ensure high-quality datasets and better model performance
☆40Updated 2 years ago
Alternatives and similar repositories for llm-data-annotation
Users that are interested in llm-data-annotation are comparing it to the libraries listed below
Sorting:
- ☆373Updated 2 years ago
- ☆80Updated last year
- ☆52Updated 4 years ago
- LexGLUE: A Benchmark Dataset for Legal Language Understanding in English☆237Updated 6 months ago
- Benchmarking Large Language Models☆105Updated 7 months ago
- A Simple but Powerful SOTA NER Model | Official Code For Label Supervised LLaMA Finetuning☆152Updated last year
- BERT classification model for processing texts longer than 512 tokens. Text is first divided into smaller chunks and after feeding them t…☆146Updated last year
- ☆94Updated last year
- Building NER and RE components using HuggingFace Transformers☆51Updated 3 years ago
- Guideline following Large Language Model for Information Extraction☆426Updated last year
- Repository for research in the field of Responsible NLP at Meta.☆205Updated last week
- ☆48Updated 3 years ago
- A multi-purpose toolkit for table-to-text generation: web interface, Python bindings, CLI commands.☆57Updated last year
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆136Updated last year
- Calculate perplexity on a text with pre-trained language models. Support MLM (eg. DeBERTa), recurrent LM (eg. GPT3), and encoder-decoder …☆166Updated 7 months ago
- auto icd coding with prompt☆49Updated last year
- Retrieval-Augmented Generation-based Relation Extraction☆50Updated 3 months ago
- Biomedical Question Answering Datasets.☆123Updated 9 months ago
- A simple recipe for training and inferencing Transformer architecture for Multi-Task Learning on custom datasets. You can find two approa…☆99Updated 3 years ago
- Text classification with Foundation Language Model LLaMA☆113Updated 2 years ago
- Multilingual/multidomain question generation datasets, models, and python library for question generation.☆373Updated last year
- ☆60Updated 4 years ago
- 🤖 Long-form question answering in the legal domain. (AAAI 2024)☆43Updated last year
- [ACL 2022] LinkBERT: A Knowledgeable Language Model 😎 Pretrained with Document Links☆449Updated 3 years ago
- StereoSet: Measuring stereotypical bias in pretrained language models☆198Updated 3 years ago
- Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"☆225Updated last year
- Dataset from the paper "Mintaka: A Complex, Natural, and Multilingual Dataset for End-to-End Question Answering" (COLING 2022)☆118Updated 3 years ago
- We evaluate many models used for biomedical and clinical nlp tasks, and train new models that perform much better.☆163Updated 4 years ago
- Efficient Attention for Long Sequence Processing☆98Updated 2 years ago
- Collection of NLP model explanations and accompanying analysis tools☆144Updated 2 years ago