ltgoslo / talk-of-norwayLinks
This repository makes available the Talk of Norway (ToN) dataset, a collection of Norwegian parliament speeches from 1998 to 2016. Every speech is richly annotated with metadata pulled from different sources, and augmented with sentence, token, lemma, part-of-speech and morphological feature annotations.
☆31Updated last year
Alternatives and similar repositories for talk-of-norway
Users that are interested in talk-of-norway are comparing it to the libraries listed below
Sorting:
- Topic Modeling Workflow in Python☆16Updated 2 years ago
- Scrape and structure raw data from the Norwegian parliament's API.☆12Updated 2 months ago
- A trend viewer written in Python/JavaScript☆21Updated 7 months ago
- Citation Classification using hybrid neural network model for Wikipedia References☆29Updated 2 years ago
- The GitHub repository for the AI for Humanists Project☆18Updated 2 weeks ago
- Tools for working with HTRC Feature Extraction files☆39Updated 6 months ago
- Download and manipulate HathiTrust wordcount data in the tidyverse☆9Updated 3 years ago
- Special Topics in AI: Artificial Intelligence as an Archival Science☆17Updated last year
- Python tools for text☆15Updated 5 years ago
- R tools to download, ingest, and analyze the Phoenix dataset from the Open Event Data Alliance☆12Updated 8 years ago
- R-package for text mining with the Corpus Workbench (CWB) as backend☆50Updated 3 months ago
- Documents for the project Libraccess☆13Updated 10 years ago
- Service for creating Twitter datasets for research and archiving.☆26Updated 2 years ago
- Python implementation of the Zeta score for contrastive text analysis☆14Updated 4 years ago
- An R package for analysis of dramatic texts☆15Updated 2 years ago
- A Mashup Interface for Text Analysis Operations☆13Updated 6 months ago
- Extract networks of entities from journalistic reporting☆48Updated last year
- Scripts that clean up OCR and munge Hathi metadata.☆76Updated 7 years ago
- Fast, permanent and flexible patterns for sharing and computing on texts with metadata using Apache Arrow.☆14Updated 3 years ago
- Tutorials for Stance Detection: A practical guide☆22Updated 2 years ago
- OpenRefine reconciler for Research Organization Registry☆13Updated 2 months ago
- A structured list of text corpora, created for use with a corpus downloader.☆13Updated 8 years ago
- Practical Approaches to Data Science with Text☆39Updated 5 years ago
- ParlaMint: Comparable Parliamentary Corpora☆62Updated this week
- A deep learning architecture for reference mining from literature in the arts and humanities.☆16Updated 5 years ago
- Scripts to create git repositories for ALTO XML texts, like those from the British Library's scanned documents.☆31Updated 7 years ago
- Bill cosponsorship networks in European parliaments.☆17Updated 8 years ago
- Amsterdam Content Analysis Toolkit☆46Updated 2 years ago
- Use spaCy for NLP and output to the FoLiA XML format.☆12Updated last year
- Repository for code and metadata to support work described in "Authorless Topic Models: Biasing Models Away from Known Structure"☆29Updated 5 years ago