ZurichNLP / swissbertLinks
The multilingual language model for Switzerland
☆27Updated last year
Alternatives and similar repositories for swissbert
Users that are interested in swissbert are comparing it to the libraries listed below
Sorting:
- Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".☆99Updated 2 years ago
- This repository contains a demonstrative implementation for pooling-based models, e.g., DeepPyramidion complementing our paper "Sparsifyi…☆14Updated 3 years ago
- Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.☆105Updated 3 years ago
- A tiny BERT for low-resource monolingual models☆31Updated 11 months ago
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated 2 years ago
- Semantically Structured Sentence Embeddings☆67Updated 11 months ago
- NTREX -- News Test References for MT Evaluation☆85Updated last year
- A survey of corpora for Germanic low-resource languages and dialects☆25Updated 9 months ago
- BLOOM+1: Adapting BLOOM model to support a new unseen language☆73Updated last year
- Code accompanying the submission "Structural Text Segmentation of Legal Documents" by Aumiller et al.☆97Updated 2 years ago
- ☆169Updated last year
- Repository with code for MaChAmp: https://aclanthology.org/2021.eacl-demos.22/☆88Updated 4 months ago
- Automatically detect errors in annotated corpora.☆47Updated 2 years ago
- Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023☆104Updated last year
- PropSegmEnt is an annotated dataset for segmenting English text into propositions, and recognizing proposition-level entailment relations…☆22Updated 2 years ago
- ☆17Updated 2 years ago
- As good as new. How to successfully recycle English GPT-2 to make models for other languages (ACL Findings 2021)☆48Updated 4 years ago
- Bi-encoder entity linking architecture☆50Updated last year
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆82Updated last year
- Framework for unified summarisation and evaluation of English documents using state-of-the-art models and measures.☆33Updated last year
- XED multilingual emotion datasets☆62Updated 2 years ago
- ☆35Updated 3 years ago
- OpusFilter - Parallel corpus processing toolkit☆109Updated last month
- Repository for the paper "MultiNERD: A Multilingual, Multi-Genre and Fine-Grained Dataset for Named Entity Recognition (and Disambiguatio…☆44Updated last year
- ☆104Updated 4 years ago
- An easy-to-use API for analyzing INCEpTION annotation projects.☆17Updated last year
- MT Evaluation in Many Languages via Zero-Shot Paraphrasing☆102Updated last year
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆61Updated last year
- Repository for Vajjala & Lucic (2018)☆65Updated last year
- Multilingual Entity Linking model by BELA model☆12Updated 2 years ago