saran9991 / llm-data-annotationLinks
Use Large Language Models like OpenAI's GPT-3.5 for data annotation and model enhancement. This framework combines human expertise with LLMs, employs Iterative Active Learning for continuous improvement, and integrates CleanLab (Confident Learning) to ensure high-quality datasets and better model performance
☆39Updated 2 years ago
Alternatives and similar repositories for llm-data-annotation
Users that are interested in llm-data-annotation are comparing it to the libraries listed below
Sorting:
- ☆370Updated last year
- BERT classification model for processing texts longer than 512 tokens. Text is first divided into smaller chunks and after feeding them t…☆147Updated last year
- A simple recipe for training and inferencing Transformer architecture for Multi-Task Learning on custom datasets. You can find two approa…☆98Updated 3 years ago
- Text classification with Foundation Language Model LLaMA☆113Updated 2 years ago
- ☆95Updated 9 months ago
- ☆51Updated 4 years ago
- A Simple but Powerful SOTA NER Model | Official Code For Label Supervised LLaMA Finetuning☆154Updated last year
- [ACL 2022] LinkBERT: A Knowledgeable Language Model 😎 Pretrained with Document Links☆449Updated 3 years ago
- Collection of NLP model explanations and accompanying analysis tools☆144Updated 2 years ago
- Benchmarking Large Language Models☆101Updated 5 months ago
- Calculate perplexity on a text with pre-trained language models. Support MLM (eg. DeBERTa), recurrent LM (eg. GPT3), and encoder-decoder …☆162Updated 5 months ago
- ☆80Updated last year
- ☆45Updated 2 years ago
- A python package for benchmarking interpretability techniques on Transformers.☆214Updated last year
- A Python Natural Language Processing Toolkit for Medical Text Generation☆84Updated 6 months ago
- ☆39Updated last year
- We evaluate many models used for biomedical and clinical nlp tasks, and train new models that perform much better.☆162Updated 4 years ago
- Long Document Summarization Papers☆154Updated 2 years ago
- Efficient Attention for Long Sequence Processing☆98Updated last year
- A multi-purpose toolkit for table-to-text generation: web interface, Python bindings, CLI commands.☆57Updated last year
- ACL2023 - AlignScore, a metric for factual consistency evaluation.☆144Updated last year
- Aligned Neural Topic Model (ANTM) for Exploring Evolving Topics: a dynamic neural topic model that uses document embeddings (data2vec) to…☆37Updated 2 years ago
- ☆47Updated 3 years ago
- Data and models for the SciFact verification task.☆244Updated 2 years ago
- Retrieval-Augmented Generation-based Relation Extraction☆48Updated 3 weeks ago
- Instruct LLMs for flat and nested NER. Fine-tuning Llama and Mistral models for instruction named entity recognition. (Instruction NER)☆87Updated last year
- A Framework for Textual Entailment based Zero Shot text classification☆153Updated last year
- Guideline following Large Language Model for Information Extraction☆409Updated last year
- a library for named entity recognition developed by UF HOBI NLP lab featuring SOTA algorithms☆154Updated 2 years ago
- Fine-tuning of Flan-5T LLM for text classification 🤖 focuses on adapting a state-of-the-art language model to enhance its ability to cla…☆44Updated last year