saran9991 / llm-data-annotation
Use Large Language Models like OpenAI's GPT-3.5 for data annotation and model enhancement. This framework combines human expertise with LLMs, employs Iterative Active Learning for continuous improvement, and integrates CleanLab (Confident Learning) to ensure high-quality datasets and better model performance
☆34Updated last year
Alternatives and similar repositories for llm-data-annotation:
Users that are interested in llm-data-annotation are comparing it to the libraries listed below
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆126Updated last year
- Multilingual Large Language Models Evaluation Benchmark☆118Updated 7 months ago
- Efficient Attention for Long Sequence Processing☆92Updated last year
- Token-level Reference-free Hallucination Detection☆94Updated last year
- ☆88Updated last month
- Official repo for SAC3: Reliable Hallucination Detection in Black-Box Language Models via Semantic-aware Cross-check Consistency☆35Updated 2 months ago
- ACL2023 - AlignScore, a metric for factual consistency evaluation.☆124Updated last year
- Fine-tuning of Flan-5T LLM for text classification 🤖 focuses on adapting a state-of-the-art language model to enhance its ability to cla…☆38Updated 5 months ago
- Dataset used to evaluate Skill Extraction systems based on the ESCO skills taxonomy.☆13Updated 8 months ago
- Text classification with Foundation Language Model LLaMA☆115Updated 2 years ago
- GISTEmbed: Guided In-sample Selection of Training Negatives for Text Embeddings☆39Updated last year
- A multi-purpose toolkit for table-to-text generation: web interface, Python bindings, CLI commands.☆55Updated 10 months ago
- 🔍 A statutory article retrieval dataset in French. (ACL 2022)☆39Updated last year
- ☆70Updated 6 months ago
- Codebase, data and models for the SummaC paper in TACL☆89Updated last month
- A curated list of research papers and resources on Cultural LLM.☆41Updated 6 months ago
- Interpreting Language Models with Contrastive Explanations (EMNLP 2022 Best Paper Honorable Mention)☆62Updated 2 years ago
- BARTScore: Evaluating Generated Text as Text Generation☆345Updated 2 years ago
- ☆23Updated 7 months ago
- ☆23Updated last year
- M4: Multi-generator, Multi-domain, and Multi-lingual Black-Box Machine-Generated Text Detection☆22Updated 11 months ago
- ☆34Updated 5 months ago
- ☆38Updated last year
- ☆95Updated last year
- ☆39Updated 2 years ago
- A simple recipe for training and inferencing Transformer architecture for Multi-Task Learning on custom datasets. You can find two approa…☆95Updated 2 years ago
- Detect hallucinated tokens for conditional sequence generation.☆64Updated 2 years ago
- A Simple but Powerful SOTA NER Model | Official Code For Label Supervised LLaMA Finetuning☆153Updated last year
- [NAACL 2022] Robust (Controlled) Table-to-Text Generation with Structure-Aware Equivariance Learning.☆57Updated 11 months ago
- Repository for research in the field of Responsible NLP at Meta.☆198Updated 4 months ago