Tools to train and explore diachronic word embeddings from Big Historical Data
☆31Apr 18, 2026Updated last month
Alternatives and similar repositories for DiachronicEmb-BigHistData
Users that are interested in DiachronicEmb-BigHistData are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆17Jan 21, 2025Updated last year
- Neural Language Models for Historical Research☆29Oct 16, 2024Updated last year
- GisPy: A Tool for Measuring Gist Inference Score in Text https://aclanthology.org/2022.wnu-1.5/☆13Jul 1, 2024Updated last year
- The official Github for the American Stories dataset as in {link}☆134Mar 7, 2024Updated 2 years ago
- ChineseDiachronicCorpus,中文历时语料库,横跨六十余年,包括腾讯历时新闻2000-2016,人民日报历时语料1946-2003,参考消息历时语料1957-2002。基于历时流通语料库,可用于历时语言变化计算、语言监测、社会文化变迁研究提供基础性的语料支…☆23Jan 10, 2021Updated 5 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Repository for Quantifying Valence and Arousal in Text with Multilingual Pre-trained Transformers☆43Feb 26, 2023Updated 3 years ago
- MRQAP Implementation in Python☆24Apr 20, 2020Updated 6 years ago
- Simple Python wrapper for querying data with TikTok's research API☆13Dec 25, 2023Updated 2 years ago
- Flexible calculation of moral foundation scores from textual input data based on word embedding methods.☆46Mar 22, 2023Updated 3 years ago
- ☆10Sep 10, 2022Updated 3 years ago
- ☆11Aug 14, 2018Updated 7 years ago
- Code for SaGe subword tokenizer (EACL 2023)☆28Nov 30, 2024Updated last year
- Framework for the extraction of features from Wikipedia XML dumps.☆10Jun 18, 2021Updated 4 years ago
- Turkish and English Dataset from "Large-Scale Hate Speech Detection with Cross-Domain Transfer"☆31Oct 11, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆13Jun 27, 2023Updated 2 years ago
- Using NLTK to run an analysis on the Harry Potter corpus - different versions of this talk were given at Codeland in NYC and DjangoCon US…☆36Nov 10, 2018Updated 7 years ago
- Web based semantic visualization tool☆12Feb 16, 2017Updated 9 years ago
- Digital Outrage Classifier from the Crockett Lab at Yale. Predicts whether tweets contain moral outrage.☆31Apr 7, 2023Updated 3 years ago
- Programming for Historians☆17Sep 12, 2022Updated 3 years ago
- ☆16Nov 5, 2018Updated 7 years ago
- Frame Semantic Parser based on T5 and FrameNet☆69Sep 13, 2023Updated 2 years ago
- Compare accuracies of udpipe models and spacy models which can be used for NLP annotation☆14Feb 11, 2018Updated 8 years ago
- Covid-19 weibo rumor dataset, collected from 2020.1.22 to 2021.4.22☆13Jun 27, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Sources of materials for the course Data Analysis with Python - Summer 2019☆12Sep 4, 2019Updated 6 years ago
- Machine Learning basics with phishing dataset☆10Apr 19, 2021Updated 5 years ago
- 🖼 A jQuery widget to query heterogeneous interfaces using Comunica SPARQL☆20Updated this week
- Sphinx extension for quizdown.js☆15Sep 21, 2022Updated 3 years ago
- code base for constructing narrative statements from text☆124Jan 13, 2026Updated 4 months ago
- State-of-the-art count-based word embeddings for low-resource languages.☆12Nov 13, 2025Updated 6 months ago
- 🌍 Comunica engine scripts for Web browsers☆16May 14, 2026Updated 2 weeks ago
- VIAF via Python☆13May 20, 2026Updated last week
- Scrapes Google Books Ngram data to create a long word list☆14Feb 24, 2024Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆11Jun 3, 2021Updated 4 years ago
- ParCourE - Parallel Corpus Explorer☆12Dec 27, 2021Updated 4 years ago
- ☆12Jun 3, 2021Updated 4 years ago
- Code for measuring novelty in science using publication text☆37Mar 4, 2025Updated last year
- pydistinto - a Python implementation of different measures of distinctiveness for contrastive text analysis☆11May 15, 2025Updated last year
- BirdSpotter is a python package which provides an influence and bot detection toolkit for twitter.☆19Mar 10, 2021Updated 5 years ago
- Tutorial : Spatial econometrics for cross-sectional data (Columbus crime example)☆10Aug 3, 2022Updated 3 years ago