nschaetti / SFGram-dataset
SFGram (Science-Fiction Gram) is a dataset of public science-fiction novels, books and movie covers. It is designed to be used by researchers to study the evolution of the science-fiction literature over time and to test machine learning algorithms on authorship attribution and document classification tasks. All the documents are now published o…
☆31Updated 6 years ago
Alternatives and similar repositories for SFGram-dataset:
Users that are interested in SFGram-dataset are comparing it to the libraries listed below
- ☆57Updated 2 years ago
- An experiment replicating part of "Why Literary Time is Measured in Minutes" with GPT-4.☆32Updated 2 years ago
- Finds linguistic patterns effortlessly☆36Updated last year
- Code for the paper "Towards an Argument Mining Pipeline Transforming Texts to Argument Graphs" presented at COMMA 2020☆23Updated last month
- Analyze Argumentation and Rhetorical Aspects in Scientific Writing.☆19Updated 2 years ago
- Weird A.I. Yankovic neural-net based lyrics parody generator☆84Updated 3 years ago
- Discourse Analysis Tool Suite☆20Updated this week
- LegalCrawler: A tool for automated scraping of English legal corpora☆55Updated 2 years ago
- Libraries, Archives and Museums (LAM)☆82Updated 2 years ago
- Homebase of the IPTC EXTRA project about rule-based text categorization☆13Updated 7 years ago
- A deep learning model for extracting references from text☆28Updated last year
- The ScriptBase Corpus☆43Updated 6 years ago
- A collection of open source tools and resources related to Wikibase knowledge graphs☆70Updated last year
- Machine Learning scripts for the identification of human values behind arguments.☆24Updated last year
- A simple tool for splitting up an ebook into its chapters. Works well with Project Gutenberg texts. May also be used to clean up books fo…☆106Updated 6 years ago
- Dataset of personal narratives with Advice Seeking Questions☆15Updated 5 years ago
- Generating Interactive Fiction worlds from story plots☆75Updated 2 years ago
- A simple web application for searching Word2Vec embeddings derived from approximately 2,000 law reports published by the The Incorporated…☆26Updated 2 years ago
- I.PHI dataset generation☆25Updated last year
- Search through Facebook Research's PyTorch BigGraph Wikidata-dataset with the Weaviate vector search engine☆31Updated 3 years ago
- MultiCite code and data. Models are available on Huggingface.☆31Updated 2 years ago
- Repo for the LREC 2022 paper The Project Dialogism Novel Corpus: A Dataset for Quotation Attribution in Literary Texts.☆13Updated 2 years ago
- Poetic processing, for Python.☆40Updated 11 months ago
- Language models are open knowledge graphs ( non official implementation )☆13Updated 4 years ago
- Frame Semantic Parser based on T5 and FrameNet☆59Updated last year
- GreenLIT: Using GPT-J with Multi-Task Learning to Create New Screenplays☆17Updated 2 years ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 5 years ago
- an experimental implementation of Burrow's delta in Python 3☆21Updated 3 years ago
- Calculate Krippendorff's Alpha on any DataFrame☆37Updated last year
- VerbNet semantic parser and related utilities☆36Updated 2 years ago