jogonba2 / twilbert
Specialization of BERT architecture both for the Spanish language and the Twitter domain
☆13Updated 4 years ago
Alternatives and similar repositories for twilbert:
Users that are interested in twilbert are comparing it to the libraries listed below
- BERT and ELECTRA models trained on Europeana Newspapers☆37Updated 3 years ago
- Ready to use Spanish Word2Vec embeddings created from >18B chars and >3B words☆41Updated 5 years ago
- Unannotated Spanish 3 Billion Words Corpora☆101Updated 2 years ago
- Spanish Billion Word Corpus and Embeddings☆48Updated 2 years ago
- ☆23Updated 3 years ago
- ☆15Updated 6 years ago
- AlBERTo the first italian BERT model for Twitter languange understanding☆72Updated 4 years ago
- Official repository of the Hate Speech Detection Tasks at Evalita☆12Updated 4 years ago
- An easy-to-use library to extract indices from texts.☆29Updated 3 years ago
- Lista de corpus de PLN en español ✨ #Somos600M: Ayuda a desarrollar IA inclusiva que entienda las diferentes variedades de nuestras lengu…☆21Updated last year
- A collection of over 1.5 Million tweets data translated to French, with their sentiment.☆35Updated 7 years ago
- EmoEvent: A Multilingual Emotion Corpus based on different Events☆7Updated 3 years ago
- Official source for spanish Language Models and resources made @ BSC-TEMU within the "Plan de las Tecnologías del Lenguaje" (Plan-TL).☆258Updated last year
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆80Updated 9 months ago
- ☆64Updated 2 years ago
- ☆39Updated 3 years ago
- Training Temporal Word Embeddings with a Compass☆64Updated 2 years ago
- Ten Thousand German News Articles Dataset for Topic Classification☆84Updated 2 years ago
- Language Models for Zalando's flair library☆61Updated 5 years ago
- A python module for evaluating NERC and NEL system performances as defined in the HIPE shared tasks (formerly CLEF-HIPE-2020-scorer).☆14Updated 10 months ago
- French Machine Reading for Question Answering☆18Updated 2 years ago
- Data for the HIPE 2022 shared task.☆17Updated last year
- Train, evaluate, and use different unsupervised topic modelling algorithms using a RESTful API.☆36Updated last year
- This repository contains all manually labeled data from the GermEval-2018 shared task.☆30Updated 6 years ago
- A large (>5k) collection of search questions asked about Coronavirus 🦠☆14Updated 5 years ago
- Code for the paper "Content Analysis of Textbooks via Natural Language Processing".☆58Updated last year
- CONLL-U to Pandas DataFrame☆31Updated 7 years ago
- Anonymization of legal cases (Fr) based on Flair embeddings☆88Updated 4 years ago
- Code for the CUP Elements on text analysis in Python for social scientists☆137Updated 2 years ago
- A french sequence to sequence pretrained model☆59Updated 2 years ago