dhfbk / KINDLinks
KIND: an Italian Multi-Domain Dataset for Named Entity Recognition
☆15Updated 2 years ago
Alternatives and similar repositories for KIND
Users that are interested in KIND are comparing it to the libraries listed below
Sorting:
- Automatically detect errors in annotated corpora.☆47Updated 2 years ago
- A python package to run inference with HuggingFace language and vision-language checkpoints wrapping many convenient features.☆28Updated last year
- A software for transferring pre-trained English models to foreign languages☆19Updated 2 years ago
- Repo for Aspire - A scientific document similarity model based on matching fine-grained aspects of scientific papers.☆54Updated 2 years ago
- Repository with code for MaChAmp: https://aclanthology.org/2021.eacl-demos.22/☆88Updated 5 months ago
- (NAACL 2024) Guiding Large Language Models to Post-Edit Machine Translation with Error Annotations☆14Updated 6 months ago
- Code associated with the paper "Entropy-based Attention Regularization Frees Unintended Bias Mitigation from Lists"☆50Updated 3 years ago
- Find informative examples to efficiently (human)-evaluate NLG models.☆16Updated 3 weeks ago
- Full named-entity (i.e., not tag/token) evaluation metrics based on SemEval’13☆196Updated last month
- ☆75Updated 4 years ago
- Semantically Structured Sentence Embeddings☆67Updated last year
- A framework for evaluating Machine Translation models.☆10Updated 5 months ago
- ☆10Updated last year
- CD20200004 from 01/01/2021 to 31/12/2023 - LIG UGA - Python Notebook and Models for the MT Lab @ ALPS 2022☆13Updated last year
- A spaCy custom component that extracts and normalizes temporal expressions☆55Updated 2 years ago
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"☆27Updated 4 years ago
- Collection of NLP model explanations and accompanying analysis tools☆144Updated 2 years ago
- A Large-Scale Gender Bias Dataset for Coreference Resolution and Machine Translation, Levy et al., Findings of EMNLP 2021☆14Updated 3 years ago
- Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings (EMNLP 2022 paper)☆71Updated 2 years ago
- A repository with several curated datasets of counter-narratives to fight online hate speech.☆92Updated 3 months ago
- A survey of corpora for Germanic low-resource languages and dialects☆25Updated 10 months ago
- As good as new. How to successfully recycle English GPT-2 to make models for other languages (ACL Findings 2021)☆48Updated 4 years ago
- GLADIS: A General and Large Acronym Disambiguation Benchmark (EACL 23)☆18Updated last year
- This repository provides the source code used to automatically generate the book summarization datasets described in the paper titled "Ec…☆11Updated 6 months ago
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆62Updated last year
- Attribute statements generated by LLMs to preceding tokens using attention weights.☆18Updated 6 months ago
- Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learning☆30Updated 2 years ago
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to i…☆46Updated last year
- INCOME: An Easy Repository for Training and Evaluation of Index Compression Methods in Dense Retrieval. Includes BPR and JPQ.☆24Updated 2 years ago
- https://arxiv.org/abs/2404.10917☆14Updated 7 months ago