rhgarcia / tropescraper
A tropes scraper
☆32Updated last year
Alternatives and similar repositories for tropescraper:
Users that are interested in tropescraper are comparing it to the libraries listed below
- WordWanderer – take your text for a walk☆12Updated 5 years ago
- A maximum-strength name parser for record linkage.☆36Updated last week
- ☆55Updated 2 years ago
- Quote identification, attribution and resolution.☆12Updated last year
- an experimental implementation of Burrow's delta in Python 3☆20Updated 3 years ago
- Word generation based on n-gram models, and a cli utility to generate said models.☆17Updated 8 years ago
- Stylometry library for Burrows' Delta method☆34Updated 9 months ago
- ☆12Updated 9 years ago
- 🧮 Python package to construct word embeddings for small data using PMI and SVD☆17Updated 4 years ago
- ☆54Updated last year
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆51Updated 3 years ago
- Fastlaw's purpose is to replace generic word embeddings for work on supervised machine learning NLP-tasks with legal texts.☆37Updated 5 years ago
- ☆30Updated 2 years ago
- Fork of kingoflolz/mesh-transformer-jax with memory usage optimizations and support for GPT-Neo, GPT-NeoX, BLOOM, OPT and fairseq dense L…☆22Updated 2 years ago
- A helper library full of URL-related heuristics.☆64Updated 4 months ago
- Generate a SQLite database from Wikipedia & Wikidata dumps.☆33Updated 10 months ago
- Python wrapper library for the Datamuse API☆77Updated last year
- Scansion tool for Spanish texts☆11Updated last year
- Multilingual syllable annotation pipeline component for spacy☆39Updated last year
- Inspect a URL and estimate if it contains a news story☆39Updated 2 months ago
- Measure the similarity of text corpora for 74 languages☆13Updated last year
- The tastiest machine learning project. Can we predict who is speaking for how long during an episode of the syntax.fm podcast?☆36Updated 6 years ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 5 years ago
- Python SDK for the TextRazor Text Analytics API☆20Updated last year
- Labeled segmentation for the document structure of printed books☆13Updated 7 years ago
- Language detection using Spacy and Fasttext☆55Updated last year
- Metaphone is a phonetic algorithm, an algorithm published in 1990 for indexing words by their English pronunciation. It fundamentally imp…☆33Updated 11 years ago
- The News Landscape Toolkit (NELA)☆15Updated 4 years ago
- extract relationships from standardized terms from corpus of interest with deep learning☆20Updated 5 years ago