shunk031 / TedScraperLinks
Scraper for TED Talks in Python. Get talk title, transcript, talk topics and so on.
☆15Updated 8 years ago
Alternatives and similar repositories for TedScraper
Users that are interested in TedScraper are comparing it to the libraries listed below
Sorting:
- Automatically exported from code.google.com/p/guess-language☆54Updated 3 months ago
- An LL parser for extracting information from Wiki text, particularly Wiktionary.☆49Updated 2 years ago
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆66Updated last month
- Command-line corpus tools☆10Updated 8 years ago
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆58Updated 4 years ago
- The projects lets you extract glossary words and their definitions from a given piece of text automatically using NLP techniques☆30Updated 5 years ago
- A spell-checker extending Peter Norvig's with multi-typo correction, hamming distance weighting, and more.☆98Updated 5 years ago
- ☆40Updated 7 years ago
- Finds linguistic patterns effortlessly☆39Updated 2 years ago
- Interactive visualization of Wiktionary words and etymologies.☆98Updated 2 weeks ago
- A natural language date parser. (Python version of chrono.js)☆25Updated 8 months ago
- tool for collectively summarizing large discussions☆145Updated 3 years ago
- Library for extracting text and timestamps from multiple subtitle files (.ass, .ssa, .srt, .sub, .txt).☆53Updated last year
- Post-processing OCR errors with seq2seq models☆28Updated 5 years ago
- FoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.g…☆113Updated last year
- Natural language generation language☆56Updated 6 years ago
- clone of https://code.google.com/p/splitta/ so it can be a git submodule☆34Updated 12 years ago
- This repository for Web Crawling, Information Extraction, and Knowledge Graph build up.☆33Updated 7 years ago
- Code for Deep-speare: a joint neural model of poetic language, meter and rhyme☆78Updated 3 years ago
- Json Wikipedia, contains code to convert the Wikipedia xml dump into a json dump. Questions? https://gitter.im/idio-opensource/Lobby☆17Updated 3 years ago
- PDF Extraction Toolkit☆42Updated 5 years ago
- Python tools for interacting with Wikidata☆161Updated 2 years ago
- This Python module can be used to obtain antonyms, synonyms, hypernyms, hyponyms, homophones and definitions.☆127Updated last year
- A python module for word inflections designed for use with spaCy.☆93Updated 6 years ago
- Lightning Fast Language Prediction 🚀☆167Updated 5 months ago
- Extract a plain text corpus from MediaWiki XML dumps, such as Wikipedia.☆134Updated 7 years ago
- bin files☆13Updated last year
- WordNet Domains, WordNet Affect and SentiWords☆48Updated 10 years ago
- Wikidata embedding☆51Updated last year
- Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.☆47Updated 8 years ago