Norwegian Speech Transformer Models
☆19Oct 17, 2025Updated 5 months ago
Alternatives and similar repositories for nostram
Users that are interested in nostram are comparing it to the libraries listed below
Sorting:
- DHLAB is a library of python modules for accessing text and pictures at the National Library of Norway.☆25Oct 13, 2025Updated 5 months ago
- Natural language understanding benchmarks for Norwegian☆14Aug 29, 2025Updated 6 months ago
- texrex web page cleaning & ClaraX random walk crawler☆11Dec 13, 2021Updated 4 years ago
- ☆17Nov 12, 2025Updated 4 months ago
- GC4LM: A Colossal (Biased) language model for German☆13May 2, 2021Updated 4 years ago
- Neural models for detecting and masking personal information from texts☆16Nov 25, 2022Updated 3 years ago
- Norwegian Transformer Model☆117Jan 11, 2026Updated 2 months ago
- An initiative to collect and distribute resources for co-reference resolution in a unified standard.☆26May 12, 2024Updated last year
- A Scandinavian Benchmark for sentence embeddings☆46Dec 5, 2025Updated 3 months ago
- Named-Entity Recognition for Norwegian Bokmål and Nynorsk☆12Aug 5, 2019Updated 6 years ago
- Modules used for separating articles in (historical) newspapers and similar documents. This repository is part of the European Union's Ho…☆22Sep 2, 2022Updated 3 years ago
- clarin-dspace digital repository based on DSpace and LINDAT/CLARIN DSpace☆28Updated this week
- The robust European language model benchmark.☆164Updated this week
- Large-scale language models for Norwegian☆44Feb 25, 2026Updated 3 weeks ago
- Command line tool for digging into WARC files☆51Feb 27, 2026Updated 3 weeks ago
- ☆29Jul 17, 2019Updated 6 years ago
- Elastic support for Bokmål/Nynorsk☆32Mar 30, 2017Updated 8 years ago
- RDF river plugin for harvesting metadata from Jena TDB, SPARQL endpoints or plain RDF files into Elasticsearch☆10May 20, 2022Updated 3 years ago
- ☆10Jun 23, 2023Updated 2 years ago
- HTRflow is the underlying engine for our HTR-pipeline☆72Mar 12, 2026Updated last week
- A Corpus Data Retrieval Index using Lucene for Look-Ups☆20Updated this week
- Conversion scripts for coreference☆29Sep 30, 2024Updated last year
- Python wrapper for phonetisaurus grapheme to phoneme tool☆12Mar 11, 2021Updated 5 years ago
- Scraping daily reads from Blinkist and converting them into Markdown files.☆10Jul 24, 2018Updated 7 years ago
- Converting irregularly spaced time series, such as eletronic health records, into dataframes for tabular classification.☆19Jun 17, 2025Updated 9 months ago
- Convert Transkribus PAGE-XML to standard PAGE-XML☆12Dec 10, 2025Updated 3 months ago
- A pre-commit hook for Pyrefly.☆23Mar 12, 2026Updated last week
- The Universal Anaphora Scorer☆15Sep 2, 2024Updated last year
- ☆13Mar 10, 2025Updated last year
- A Python wrapper for the bioRxiv API.☆10Aug 18, 2021Updated 4 years ago
- ☆13Feb 12, 2023Updated 3 years ago
- ☆14Apr 24, 2024Updated last year
- code to remove "noise" from hOCR output of Tesseract OCR.☆14Oct 24, 2016Updated 9 years ago
- Sequence Labeling Parsing by Learning Across Representations☆13Oct 3, 2019Updated 6 years ago
- Distillation of Ensemble Dependency Parsers into a Single Graph-Based Parser☆11Oct 14, 2016Updated 9 years ago
- Computer Vision tutorial for DH Summer School Antwerp☆11May 25, 2023Updated 2 years ago
- Dependency Parsing as Sequence Labeling with Python3+ and PyTorch1+ and MTL☆10Nov 21, 2019Updated 6 years ago
- Pipeline for the production of digital scholarly editions of archival collections☆14Feb 22, 2024Updated 2 years ago
- ☆29Jun 8, 2025Updated 9 months ago