saran9991 / llm-data-annotationLinks
Use Large Language Models like OpenAI's GPT-3.5 for data annotation and model enhancement. This framework combines human expertise with LLMs, employs Iterative Active Learning for continuous improvement, and integrates CleanLab (Confident Learning) to ensure high-quality datasets and better model performance
☆37Updated 2 years ago
Alternatives and similar repositories for llm-data-annotation
Users that are interested in llm-data-annotation are comparing it to the libraries listed below
Sorting:
- ☆369Updated last year
- Benchmarking Large Language Models☆99Updated 3 months ago
- A Simple but Powerful SOTA NER Model | Official Code For Label Supervised LLaMA Finetuning☆155Updated last year
- Calculate perplexity on a text with pre-trained language models. Support MLM (eg. DeBERTa), recurrent LM (eg. GPT3), and encoder-decoder …☆162Updated 3 months ago
- ☆79Updated last year
- Zero and Few shot named entity & relationships recognition☆388Updated 3 weeks ago
- ☆46Updated 3 years ago
- LexGLUE: A Benchmark Dataset for Legal Language Understanding in English☆225Updated 2 months ago
- BERT classification model for processing texts longer than 512 tokens. Text is first divided into smaller chunks and after feeding them t…☆144Updated last year
- A simple recipe for training and inferencing Transformer architecture for Multi-Task Learning on custom datasets. You can find two approa…☆96Updated 3 years ago
- Active Learning for Text Classification in Python☆628Updated last month
- Text classification with Foundation Language Model LLaMA☆114Updated 2 years ago
- Set of vectorizers that extract keyphrases with part-of-speech patterns from a collection of text documents and convert them into a docum…☆264Updated 11 months ago
- Efficient Attention for Long Sequence Processing☆97Updated last year
- Instruct LLMs for flat and nested NER. Fine-tuning Llama and Mistral models for instruction named entity recognition. (Instruction NER)☆85Updated last year
- Collection of NLP model explanations and accompanying analysis tools☆144Updated 2 years ago
- Building NER and RE components using HuggingFace Transformers☆51Updated 3 years ago
- multimodal document analysis☆167Updated last year
- Guideline following Large Language Model for Information Extraction☆402Updated 11 months ago
- ☆45Updated 2 years ago
- ☆92Updated 8 months ago
- Full named-entity (i.e., not tag/token) evaluation metrics based on SemEval’13☆192Updated last month
- We evaluate many models used for biomedical and clinical nlp tasks, and train new models that perform much better.☆162Updated 4 years ago
- Bi-encoder entity linking architecture☆50Updated last year
- ACL2023 - AlignScore, a metric for factual consistency evaluation.☆138Updated last year
- A multi-purpose toolkit for table-to-text generation: web interface, Python bindings, CLI commands.☆56Updated last year
- Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: …☆337Updated 2 years ago
- This is the code for our KILT leaderboard submissions (KGI + Re2G models).☆157Updated 3 weeks ago
- KeyPhraseTransformer lets you quickly extract key phrases, topics, themes from your text data with T5 transformer | Keyphrase extraction…☆105Updated last year
- [ACL 2022] LinkBERT: A Knowledgeable Language Model 😎 Pretrained with Document Links☆446Updated 3 years ago