chrishokamp / zero-shot-ner-fine-tuningLinks

zero shot NER fine tuning

☆13

Alternatives and similar repositories for zero-shot-ner-fine-tuning

Users that are interested in zero-shot-ner-fine-tuning are comparing it to the libraries listed below

Sorting:

lgessler / microbert
A tiny BERT for low-resource monolingual models
☆31Updated last month
ltgoslo / ltg-bert
LTG-Bert
☆34Updated last year
sophiaalthammer / parm
This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…
☆41Updated 3 years ago
gsarti / t5-flax-gcp
Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP
☆57Updated 3 years ago
Babelscape / wikineural
Data and evaluation code for the paper WikiNEuRal: Combined Neural and Knowledge-based Silver Data Creation for Multilingual NER (EMNLP 2…
☆69Updated 2 years ago
Geotrend-research / smaller-transformers
Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.
☆104Updated 3 years ago
ChenghaoMou / embeddings
zero-vocab or low-vocab embeddings
☆18Updated 3 years ago
ArneBinder / pytorch-ie
PyTorch-IE: State-of-the-art Information Extraction in PyTorch
☆77Updated last month
Babelscape / multinerd
Repository for the paper "MultiNERD: A Multilingual, Multi-Genre and Fine-Grained Dataset for Named Entity Recognition (and Disambiguatio…
☆45Updated last year
tigerchen52 / GLADIS
GLADIS: A General and Large Acronym Disambiguation Benchmark (EACL 23)
☆18Updated last year
konstantinjdobler / focus
[EMNLP'23] Official Code for "FOCUS: Effective Embedding Initialization for Monolingual Specialization of Multilingual Models"
☆34Updated 5 months ago
MinishLab / tokenlearn
Pre-train Static Word Embeddings
☆90Updated 2 months ago
CPJKU / wechsel
Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.
☆85Updated last year
hplt-project / OpusTrainer
Curriculum training
☆18Updated 4 months ago
cisnlp / GlotLID
💬 Language Identification with Support for More Than 2000 Labels -- EMNLP 2023
☆171Updated 5 months ago
PyThaiNLP / MultiEL
Multilingual Entity Linking model by BELA model
☆12Updated 2 years ago
flairNLP / fabricator
[EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.
☆110Updated last year
facebookresearch / BELA
Bi-encoder entity linking architecture
☆51Updated last year
MicrosoftTranslator / NTREX
NTREX -- News Test References for MT Evaluation
☆86Updated last year
huggingface / olm-training
Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.
☆96Updated 2 years ago
google-research / metricx
☆115Updated 11 months ago
shijie-wu / neural-transducer
This repo contains a set of neural transducer, e.g. sequence-to-sequence model, focusing on character-level tasks.
☆76Updated 2 years ago
google-research / xtreme-up
☆52Updated 2 years ago
SkBlaz / rakun2
RaKUn 2.0 - A fast keyword detection algorithm
☆68Updated 3 months ago
stefan-it / xlm-v-experiments
Experiments for XLM-V Transformers Integeration
☆13Updated 2 years ago
laurieburchell / open-lid-dataset
Repository accompanying "An Open Dataset and Model for Language Identification" (Burchell et al., 2023)
☆75Updated 7 months ago
satya77 / Transformer_Temporal_Tagger
Code and data form the paper BERT Got a Date: Introducing Transformers to Temporal Tagging
☆68Updated 3 years ago
flipz357 / S3BERT
Semantically Structured Sentence Embeddings
☆67Updated last year
dennlinger / TopicalChange
Code accompanying the submission "Structural Text Segmentation of Legal Documents" by Aumiller et al.
☆98Updated 2 years ago
bigscience-workshop / multilingual-modeling
BLOOM+1: Adapting BLOOM model to support a new unseen language
☆74Updated last year