Adrien-Luxey / Da-Fonky-Movie-Script-Parser
This Python script parses HTML movie scripts, such as the ones found on imsdb.com.
☆37Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for Da-Fonky-Movie-Script-Parser
- Screenplay parser☆24Updated last year
- ☆13Updated 5 years ago
- 📄Neural Sentential Paraphrase Generation to Augment Chatbot Training Dataset☆22Updated last year
- Neural Network for Automatic Negation Detection☆20Updated 8 years ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 2 years ago
- Code and data for the AAAI'19 paper "Reverse-Engineering Satire, or 'Paper on Computational Humor Accepted Despite Making Serious Advance…☆12Updated last year
- The ScriptBase Corpus☆42Updated 6 years ago
- Tools for training and evaluating word embeddings based on subtitles. Published as "subs2vec: Word embeddings from subtitles in 55 langua…☆33Updated 4 years ago
- Code accompanying the submission "Structural Text Segmentation of Legal Documents" by Aumiller et al.☆96Updated last year
- English web corpus with 4M tokens and several annotation types☆25Updated last year
- Cleans Reddit Text Data☆81Updated 4 years ago
- Automated Screenplay Annotation for Extracting Storytelling Knowledge☆40Updated 7 years ago
- Weird A.I. Yankovic neural-net based lyrics parody generator☆84Updated 2 years ago
- Data and some code for the DopeLearning paper☆28Updated 8 years ago
- MinScIE is an Open Information Extraction system which provides structured knowledge enriched with semantic information about citations.☆15Updated 5 years ago
- TuRnIng POint Dataset☆46Updated 5 years ago
- Linguistic and stylistic complexity measures for (literary) texts☆77Updated 10 months ago
- bin files☆13Updated 2 months ago
- Preprocessing scripts to read definitions and other information from dictionaries☆22Updated 7 years ago
- 25,100 queries from the Paralex corpus (Fader et al., 2013) annotated with human ratings of whether they are well-formed natural languag…☆85Updated 6 years ago
- An NLP processing pipeline for characters in fanfiction. Developed by students at Carnegie Mellon University from 2019-2021.☆31Updated 2 months ago
- This repository contains all manually labeled data from the GermEval-2018 shared task.☆30Updated 6 years ago
- ☆32Updated 4 years ago
- An API to access data from The New Yorker Caption Contest☆60Updated last year
- German GPT-2 model☆32Updated 3 years ago
- README WIKI:☆21Updated 3 years ago
- A Mechanical Turk Interface (amti) 🤖☆55Updated 10 months ago
- Generate variations of text through synonym matching☆12Updated 7 years ago
- A web application tagging and retrieval of arguments in text☆30Updated last year