stefan-it / italian-bertelectraLinks

🇮🇹 Italian BERT and ELECTRA models (incl. evaluation)

☆18

Alternatives and similar repositories for italian-bertelectra

Users that are interested in italian-bertelectra are comparing it to the libraries listed below

Sorting:

gsarti / it5
Materials for "IT5: Large-scale Text-to-text Pretraining for Italian Language Understanding and Generation" 🇮🇹
☆30Updated last year
gsarti / t5-flax-gcp
Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP
☆57Updated 3 years ago
CPJKU / wechsel
Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.
☆85Updated last year
ccasimiro88 / TranslateAlignRetrieve
Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.
☆59Updated 2 years ago
MilaNLProc / simple-generation
A python package to run inference with HuggingFace language and vision-language checkpoints wrapping many convenient features.
☆28Updated last year
cardiffnlp / xlm-t
Repository for XLM-T, a framework for evaluating multilingual language models on Twitter data
☆158Updated 2 years ago
lgessler / microbert
A tiny BERT for low-resource monolingual models
☆31Updated last month
crux82 / squad-it
A large scale dataset for Question Answering in Italian
☆27Updated 7 years ago
musixmatchresearch / umberto
UmBERTo: an Italian Language Model trained with Whole Word Masking.
☆110Updated 2 years ago
cardiffnlp / timelms
TimeLMs: Diachronic Language Models from Twitter
☆111Updated last year
patrickvonplaten / Wav2Vec2_PyCTCDecode
Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode
☆111Updated 3 years ago
teelinsan / camoscio
Camoscio: An Italian instruction-tuned language model based on LLaMA
☆127Updated last year
ltgoslo / ltg-bert
LTG-Bert
☆34Updated last year
mariosasko / datasets_sql
Execute arbitrary SQL queries on 🤗 Datasets
☆32Updated last year
infinitylogesh / mutate
A library to synthesize text datasets using Large Language Models (LLM)
☆151Updated 2 years ago
anton-l / wav2vec-toolkit
A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR models
☆31Updated 4 years ago
bminixhofer / gerpt2
German small and large versions of GPT2.
☆20Updated 3 years ago
ZurichNLP / mbr
Minimum Bayes Risk Decoding for Hugging Face Transformers
☆60Updated last year
mainlp / germanic-lrl-corpora
A survey of corpora for Germanic low-resource languages and dialects
☆26Updated 11 months ago
helboukkouri / character-bert
Main repository for "CharacterBERT: Reconciling ELMo and BERT for Word-Level Open-Vocabulary Representations From Characters"
☆200Updated 2 years ago
sophiaalthammer / parm
This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…
☆41Updated 3 years ago
chrishokamp / zero-shot-ner-fine-tuning
zero shot NER fine tuning
☆13Updated 8 months ago
dbmdz / berts
DBMDZ BERT, DistilBERT, ELECTRA, GPT-2 and ConvBERT models
☆154Updated 2 years ago
oliverguhr / spelling
This is a neural spelling checker
☆68Updated 2 years ago
UBC-NLP / afrolid
AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.
☆34Updated 8 months ago
masakhane-io / masakhane-ner
☆115Updated last month
patrickvonplaten / notebooks
Some notebooks for NLP
☆207Updated 2 years ago
ikergarcia1996 / MetaVec
A monolingual and cross-lingual meta-embedding generation and evaluation framework
☆79Updated 3 years ago
huggingface / olm-training
Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.
☆96Updated 2 years ago
wietsedv / gpt2-recycle
As good as new. How to successfully recycle English GPT-2 to make models for other languages (ACL Findings 2021)
☆48Updated 4 years ago