ltgoslo / talk-of-norway
This repository makes available the Talk of Norway (ToN) dataset, a collection of Norwegian parliament speeches from 1998 to 2016. Every speech is richly annotated with metadata pulled from different sources, and augmented with sentence, token, lemma, part-of-speech and morphological feature annotations.
☆31Updated last year
Alternatives and similar repositories for talk-of-norway
Users that are interested in talk-of-norway are comparing it to the libraries listed below
Sorting:
- Citation Classification using hybrid neural network model for Wikipedia References☆28Updated 2 years ago
- DHLAB is a library of python modules for accessing text and pictures at the National Library of Norway.☆22Updated last week
- ☆12Updated 2 years ago
- Scrape and structure raw data from the Norwegian parliament's API.☆12Updated last month
- Compare accuracies of udpipe models and spacy models which can be used for NLP annotation☆14Updated 7 years ago
- A deep learning architecture for reference mining from literature in the arts and humanities.☆16Updated 5 years ago
- A trend viewer written in Python/JavaScript☆21Updated 6 months ago
- Special Topics in AI: Artificial Intelligence as an Archival Science☆17Updated last year
- A maximum-strength name parser for record linkage.☆37Updated last week
- Provide partial dates and retain the date precision through processing☆13Updated 2 years ago
- Topic Modeling Workflow in Python☆16Updated 2 years ago
- R tools to download, ingest, and analyze the Phoenix dataset from the Open Event Data Alliance☆12Updated 8 years ago
- Classify names by gender, U.S. ethnicity, or leaf nationality☆19Updated 6 years ago
- CSV on the web☆40Updated 2 months ago
- Documents for the project Libraccess☆13Updated 10 years ago
- Service for creating Twitter datasets for research and archiving.☆26Updated 2 years ago
- Adding links to full text in Wikipedia references☆37Updated last year
- European Parliament Open Data // Twitter☆20Updated 2 years ago
- The GitHub repository for the AI for Humanists Project☆18Updated 3 weeks ago
- ☆17Updated 7 months ago
- R-package for text mining with the Corpus Workbench (CWB) as backend☆50Updated last month
- Use spaCy for NLP and output to the FoLiA XML format.☆12Updated last year
- Bill cosponsorship networks in European parliaments.☆17Updated 8 years ago
- America's most comprehensive dictionary of campaign finance jargon. A free resource created by and for data journalists.☆17Updated 2 weeks ago
- Extract networks of entities from journalistic reporting☆48Updated last year
- A minimal Akoma Ntoso -based legal informatics toolchain☆14Updated last year
- Investigative tool for extracting relevant areas from many documents☆14Updated 9 years ago
- Norwegian Named Entities annotations on top of NDT (Norwegian Dependency Treebank)☆69Updated 8 months ago
- OpenRefine reconciler for Research Organization Registry☆13Updated last month
- Data Donation Module: A Django application to setup and manage data donation projects.☆23Updated 2 weeks ago