rhgarcia / tropescraper
A tropes scraper
☆33Updated last year
Alternatives and similar repositories for tropescraper:
Users that are interested in tropescraper are comparing it to the libraries listed below
- Quote identification, attribution and resolution.☆12Updated last year
- ☆55Updated 2 years ago
- Scansion tool for Spanish texts☆12Updated last year
- A Python scraping module, that extracts text from articles found in RSS feeds. Uses SQLite as database.☆19Updated 8 months ago
- Faster, modernized fork of the language identification tool langid.py☆55Updated 4 months ago
- etl pipeline, graphical explorer and general toolbox for investigations with follow the money data☆15Updated last year
- Poetic processing, for Python.☆40Updated 11 months ago
- The RadioTalk dataset of talk radio transcripts☆57Updated 4 years ago
- Discourse Analysis Tool Suite☆19Updated this week
- Extract networks of entities from journalistic reporting☆48Updated last year
- ISO 639 language codes☆39Updated last month
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 5 years ago
- Fast syllable estimation library based on pattern matching.☆37Updated 3 weeks ago
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆51Updated 3 years ago
- Named-Entity Recognition for Norwegian Bokmål and Nynorsk☆12Updated 5 years ago
- Keyword spaCy is a spaCy pipeline component for extracting keywords from text using cosine similarity.☆11Updated last year
- Web interface for network analysis.☆21Updated 2 years ago
- A tool for telling stories with maps.☆27Updated 6 months ago
- Libraries, Archives and Museums (LAM)☆82Updated 2 years ago
- ☆12Updated 9 years ago
- A tool for analyzing the word histories of a text.☆34Updated 4 months ago
- An experiment replicating part of "Why Literary Time is Measured in Minutes" with GPT-4.☆32Updated 2 years ago
- Natural Language Inflection in English☆11Updated 3 years ago
- Python based Wikidata framework for easy dataframe extraction☆43Updated last year
- A simple tool for splitting up an ebook into its chapters. Works well with Project Gutenberg texts. May also be used to clean up books fo…☆107Updated 6 years ago
- image-to-text model for PDF.js☆36Updated 2 weeks ago
- 🧮 Python package to construct word embeddings for small data using PMI and SVD☆17Updated 4 years ago
- Ranking signals for Wikidata☆68Updated 2 weeks ago
- Multilingual syllable annotation pipeline component for spacy☆39Updated 2 years ago
- Scrollership through 20m pubmed abstracts.☆26Updated last year