ltgoslo / talk-of-norwayLinks
This repository makes available the Talk of Norway (ToN) dataset, a collection of Norwegian parliament speeches from 1998 to 2016. Every speech is richly annotated with metadata pulled from different sources, and augmented with sentence, token, lemma, part-of-speech and morphological feature annotations.
☆31Updated 2 years ago
Alternatives and similar repositories for talk-of-norway
Users that are interested in talk-of-norway are comparing it to the libraries listed below
Sorting:
- Inspect a URL and estimate if it contains a news story☆39Updated this week
- ☆76Updated last week
- Python implementation of the Zeta score for contrastive text analysis☆14Updated 4 years ago
- Citation Classification using hybrid neural network model for Wikipedia References☆31Updated 3 years ago
- Extract networks of entities from journalistic reporting☆49Updated 2 years ago
- Platform for journalists to search, analyse, categorise and share unstructured data☆56Updated this week
- Topic Modeling Workflow in Python☆16Updated 2 years ago
- A trend viewer written in Python/JavaScript☆21Updated last year
- Scripts to create git repositories for ALTO XML texts, like those from the British Library's scanned documents.☆31Updated 8 years ago
- Detect and align similar passages☆115Updated 3 months ago
- ParlaMint: Comparable Parliamentary Corpora☆72Updated 2 months ago
- ☆12Updated 3 years ago
- Group thousands of similar spreadsheet or database text entries in seconds☆157Updated 2 years ago
- Use spaCy for NLP and output to the FoLiA XML format.☆12Updated last year
- The RICardo dataset compiles trade statistics sources of international trade bilateral flows of the 19th century.☆19Updated last month
- Amsterdam Content Analysis Toolkit☆46Updated 3 years ago
- Norwegian Named Entities annotations on top of NDT (Norwegian Dependency Treebank)☆71Updated last year
- Project on the history of genre.☆24Updated 5 years ago
- System for building, visualizing, and working with LDA topic models☆97Updated this week
- Interpretable data visualizations for understanding how texts differ at the word level☆286Updated 10 months ago
- Python package for harvesting records from OAI-PMH provider(s).☆64Updated 3 years ago
- High-performance text aligner for large collections of texts☆54Updated last month
- Scripts that clean up OCR and munge Hathi metadata.☆77Updated 8 years ago
- Tools for text tokenization and encoding☆84Updated 4 years ago
- A point-and-click tool for creating and analyzing topic models produced by MALLET.☆112Updated 4 years ago
- An experiment replicating part of "Why Literary Time is Measured in Minutes" with GPT-4.☆34Updated 2 years ago
- Provide partial dates and retain the date precision through processing☆14Updated 5 months ago
- A deep learning architecture for reference mining from literature in the arts and humanities.☆16Updated 6 years ago
- Lexicons for the Multilingual UCREL Semantic Analysis System☆47Updated 3 weeks ago
- Tool for probabilistically linking the records of individual entities (e.g. people) within and across datasets☆118Updated last month