saran9991 / llm-data-annotation
Use Large Language Models like OpenAI's GPT-3.5 for data annotation and model enhancement. This framework combines human expertise with LLMs, employs Iterative Active Learning for continuous improvement, and integrates CleanLab (Confident Learning) to ensure high-quality datasets and better model performance
☆35Updated last year
Alternatives and similar repositories for llm-data-annotation:
Users that are interested in llm-data-annotation are comparing it to the libraries listed below
- Efficient Attention for Long Sequence Processing☆93Updated last year
- A Simple but Powerful SOTA NER Model | Official Code For Label Supervised LLaMA Finetuning☆154Updated last year
- ☆70Updated 6 months ago
- Text classification with Foundation Language Model LLaMA☆115Updated 2 years ago
- A curated list of research papers and resources on Cultural LLM.☆41Updated 6 months ago
- GISTEmbed: Guided In-sample Selection of Training Negatives for Text Embeddings☆39Updated last year
- Benchmarking Large Language Models☆94Updated last week
- Fine-tuning of Flan-5T LLM for text classification 🤖 focuses on adapting a state-of-the-art language model to enhance its ability to cla…☆38Updated 5 months ago
- ☆42Updated last year
- Retrieval-Augmented Generation-based Relation Extraction☆37Updated 2 months ago
- A multi-purpose toolkit for table-to-text generation: web interface, Python bindings, CLI commands.☆55Updated 11 months ago
- Multilingual Large Language Models Evaluation Benchmark☆119Updated 7 months ago
- ☆359Updated last year
- Instruct LLMs for flat and nested NER. Fine-tuning Llama and Mistral models for instruction named entity recognition. (Instruction NER)☆80Updated 10 months ago
- SciFive: a text-text transformer model for biomedical literature☆94Updated 10 months ago
- ☆44Updated 2 years ago
- Notebooks for training universal 0-shot classifiers on many different tasks☆120Updated 3 months ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆126Updated last year
- Collection of NLP model explanations and accompanying analysis tools☆145Updated last year
- A simple recipe for training and inferencing Transformer architecture for Multi-Task Learning on custom datasets. You can find two approa…☆95Updated 2 years ago
- MEDIQA-Chat Shared Tasks @ ACL-ClinicalNLP 2023☆50Updated last year
- A Python Natural Language Processing Toolkit for Medical Text Generation☆76Updated 5 months ago
- Resources for cultural NLP research☆86Updated 2 months ago
- Long Document Summarization Papers☆145Updated last year
- [LREC 2022] An off-the-shelf pre-trained Tweet NLP Toolkit (NER, tokenization, lemmatization, POS tagging, dependency parsing) + Tweeban…☆104Updated last year
- ☆34Updated 6 months ago
- ☆38Updated last year
- We evaluate many models used for biomedical and clinical nlp tasks, and train new models that perform much better.☆158Updated 3 years ago
- 🔍 A statutory article retrieval dataset in French. (ACL 2022)☆39Updated last year
- ☆90Updated last year