Pre-trained Nordic models for BERT
☆175Nov 12, 2021Updated 4 years ago
Alternatives and similar repositories for nordic_bert
Users that are interested in nordic_bert are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The weights for the embedding layer of Scandinavian UMLFiT language models☆32Dec 5, 2019Updated 6 years ago
- Ælæctra was created as part of a Cognitive Science bachelor thesis, in the attempt to enhance the Danish NLP community with a more effici…☆28Oct 31, 2022Updated 3 years ago
- The Danish Gigaword project☆16Jan 25, 2021Updated 5 years ago
- Prompt templating and versioning using jinja2 and litellm 🔥☆16Jan 21, 2024Updated 2 years ago
- An open-source Python package for Danish speech recognition☆36Feb 19, 2023Updated 3 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Simple customizable pipeline tool for anonymizing Danish text.☆11Sep 19, 2024Updated last year
- Natural language understanding benchmarks for Norwegian☆14Aug 29, 2025Updated 6 months ago
- Chatbot Question Dataset Of Questions about the Covid-19 crisis☆11May 7, 2020Updated 5 years ago
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to i…☆46Apr 15, 2024Updated last year
- Code and Swedish pre-trained models for BERT☆12Feb 5, 2020Updated 6 years ago
- Sentiment Corpus for Swedish 🇸🇪 Norwegian 🇳🇴 Danish 🇩🇰 Finnish 🇫🇮 (and English 🏴)☆15May 3, 2021Updated 4 years ago
- ML Powered Danish Sentiment Model☆14Jun 4, 2024Updated last year
- Norwegian Speech Transformer Models☆19Updated this week
- Docker container for UDPipe (https://github.com/ufal/udpipe) REST server.☆12Jun 23, 2020Updated 5 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Open broad-coverage corpus for Finnish named entity recognition.☆12Aug 22, 2020Updated 5 years ago
- Converting irregularly spaced time series, such as eletronic health records, into dataframes for tabular classification.☆19Jun 17, 2025Updated 9 months ago
- Named entity recognition built on top of BERT and keras-bert.☆14Aug 20, 2020Updated 5 years ago
- ParlaMint: Comparable Parliamentary Corpora☆77Nov 2, 2025Updated 4 months ago
- A modular and extensible Python framework, designed to aid in the creation of high-quality, unbiased datasets to build robust models for …☆20Mar 7, 2026Updated 2 weeks ago
- Fine-tuning of transformers for Sentiment Analysis☆19May 25, 2021Updated 4 years ago
- Contains code used to conduct experiments on dependency parsing with the Tensor-LSTM model developed for our paper "Cross-Lingual Depende…☆13Jan 5, 2017Updated 9 years ago
- Parsing only with Pretraining Networks☆16Jul 25, 2024Updated last year
- ☆15Mar 27, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- SOTA TAG Parser☆15Jan 19, 2019Updated 7 years ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆157May 24, 2024Updated last year
- This is a simple variational autoencoder (VAE) implemented in torch for R☆11Jan 4, 2022Updated 4 years ago
- Spacy model trained based on Norwegian corpus converted from OBT to Universal dep.☆13Jan 31, 2018Updated 8 years ago
- Python standalone tokenizer☆15Nov 12, 2015Updated 10 years ago
- BERT model trained from scratch on Finnish☆96Aug 26, 2021Updated 4 years ago
- Shared code for training sentence embeddings with Flax / JAX☆28Jul 15, 2021Updated 4 years ago
- Lexicons for the Multilingual UCREL Semantic Analysis System☆47Mar 11, 2026Updated 2 weeks ago
- Primary repository for the NLP course as part of the CogSci masters program at Aarhus University.☆23Sep 16, 2025Updated 6 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆48Dec 23, 2018Updated 7 years ago
- A Corpus Data Retrieval Index using Lucene for Look-Ups☆20Updated this week
- The Broad Twitter Corpus, an NER dataset in English stratified for time, location, social media genre, socioeconomic factors (COLING 2016…☆68May 12, 2022Updated 3 years ago
- Blazing fast language detection using fastText model☆24Dec 18, 2022Updated 3 years ago
- Code for my blog post on Generating Words from Embeddings☆23Jul 25, 2024Updated last year
- ☆11Dec 3, 2020Updated 5 years ago
- ☆142Oct 15, 2020Updated 5 years ago