ltgoslo / talk-of-norwayLinks
This repository makes available the Talk of Norway (ToN) dataset, a collection of Norwegian parliament speeches from 1998 to 2016. Every speech is richly annotated with metadata pulled from different sources, and augmented with sentence, token, lemma, part-of-speech and morphological feature annotations.
☆31Updated last year
Alternatives and similar repositories for talk-of-norway
Users that are interested in talk-of-norway are comparing it to the libraries listed below
Sorting:
- Citation Classification using hybrid neural network model for Wikipedia References☆30Updated 2 years ago
- A deep learning architecture for reference mining from literature in the arts and humanities.☆16Updated 5 years ago
- Extract networks of entities from journalistic reporting☆48Updated 2 years ago
- ☆12Updated 2 years ago
- The GitHub repository for the AI for Humanists Project☆18Updated last month
- Topic Modeling Workflow in Python☆16Updated 2 years ago
- Matcher for affiliations - link raw affiliation to ROR ids, country and RNSR☆25Updated 6 months ago
- Code repository for whatisdigitalhumanities.com☆32Updated 2 years ago
- Special Topics in AI: Artificial Intelligence as an Archival Science☆17Updated last year
- Scrape and structure raw data from the Norwegian parliament's API.☆12Updated 3 months ago
- A Python library for topic modeling and visualization☆65Updated 4 years ago
- Service for creating Twitter datasets for research and archiving.☆26Updated 2 years ago
- A Knowledge Base for research software relying on large-scale text mining and curated knowledge sources☆16Updated 2 years ago
- ParlaMint: Comparable Parliamentary Corpora☆62Updated this week
- Tools for working with HTRC Feature Extraction files☆39Updated last week
- OpenRefine reconciler for Research Organization Registry☆13Updated 3 months ago
- Download and manipulate HathiTrust wordcount data in the tidyverse☆9Updated 3 years ago
- ☆72Updated 6 months ago
- Detect and visualize text reuse☆118Updated 10 months ago
- The RICardo dataset compiles trade statistics sources of international trade bilateral flows of the 19th century.☆19Updated this week
- Fast, permanent and flexible patterns for sharing and computing on texts with metadata using Apache Arrow.☆14Updated 3 years ago
- Amsterdam Content Analysis Toolkit☆46Updated 3 years ago
- Python package for harvesting records from OAI-PMH provider(s).☆64Updated 2 years ago
- MOVED to https://gitlab.com/crossref/reference_matching_evaluation_framework☆17Updated 6 years ago
- Inspection of tabular (csv, xls-like) files to guess the columns' content☆47Updated 3 weeks ago
- Tutorials for Stance Detection: A practical guide☆23Updated 2 years ago
- Explore your own text collection with a topic model – without prior knowledge.☆63Updated 6 months ago
- Making Patent Citations Uncool Again☆110Updated 2 years ago
- ☆9Updated 9 years ago
- Python implementation of the Zeta score for contrastive text analysis☆14Updated 4 years ago