ZurichNLP / swissbert
The multilingual language model for Switzerland
☆25Updated 9 months ago
Related projects ⓘ
Alternatives and complementary repositories for swissbert
- A survey of corpora for Germanic low-resource languages and dialects☆24Updated 3 months ago
- ☆25Updated last month
- T-Projection is a method to perform high-quality Annotation Projection of Sequence Labeling datasets.☆11Updated 11 months ago
- Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learning☆29Updated last year
- UDapter is a multilingual dependency parser that uses "contextual" adapters together with language-typology features for language-specifi…☆30Updated last year
- A spaCy custom component that extracts and normalizes temporal expressions☆52Updated last year
- KIND: an Italian Multi-Domain Dataset for Named Entity Recognition☆15Updated last year
- ☆20Updated 4 months ago
- Data and evaluation code for the paper WikiNEuRal: Combined Neural and Knowledge-based Silver Data Creation for Multilingual NER (EMNLP 2…☆66Updated last year
- Codebase, data and models for the Keep it Simple paper at ACL2021☆36Updated last year
- Repository with code for MaChAmp: https://aclanthology.org/2021.eacl-demos.22/☆81Updated last month
- A software for transferring pre-trained English models to foreign languages☆18Updated last year
- Code and data for the IWSLT 2022 shared task on Formality Control for SLT☆21Updated last year
- Evaluate language models using multiple choice items☆12Updated last month
- Framework for unified summarisation and evaluation of English documents using state-of-the-art models and measures.☆31Updated 5 months ago
- NTREX -- News Test References for MT Evaluation☆75Updated 5 months ago
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆40Updated 2 years ago
- GLADIS: A General and Large Acronym Disambiguation Benchmark (EACL 23)☆13Updated 4 months ago
- SeqScore: Scoring for named entity recognition and other sequence labeling tasks☆20Updated 3 weeks ago
- Neural models for detecting and masking personal information from texts☆14Updated last year
- [EMNLP'23] Official Code for "FOCUS: Effective Embedding Initialization for Monolingual Specialization of Multilingual Models"☆28Updated 3 weeks ago
- Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023☆96Updated 6 months ago
- An easy-to-use library to linguistically compare one sentence and its words to another, in the same language or a different one. For inst…☆20Updated 2 years ago
- Code for equipping pretrained language models (BART, GPT-2, XLNet) with commonsense knowledge for generating implicit knowledge statement…☆16Updated 3 years ago
- This repo contains code for the paper "Psychologically-informed chain-of-thought prompts for metaphor understanding in large language mod…☆13Updated last year
- Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.☆74Updated last month
- CD20200004 from 01/01/2021 to 31/12/2023 - LIG UGA - Python Notebook and Models for the MT Lab @ ALPS 2022☆14Updated 7 months ago
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆52Updated 3 months ago
- BLOOM+1: Adapting BLOOM model to support a new unseen language☆70Updated 8 months ago
- GEMBA — GPT Estimation Metric Based Assessment☆100Updated 3 months ago