shunk031 / TedScraperLinks
Scraper for TED Talks in Python. Get talk title, transcript, talk topics and so on.
☆15Updated 8 years ago
Alternatives and similar repositories for TedScraper
Users that are interested in TedScraper are comparing it to the libraries listed below
Sorting:
- Automatically exported from code.google.com/p/guess-language☆54Updated last month
- tool for collectively summarizing large discussions☆145Updated 3 years ago
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆66Updated this week
- An LL parser for extracting information from Wiki text, particularly Wiktionary.☆49Updated 2 years ago
- GloVe word vector embedding experiments (similar to Word2Vec)☆67Updated 2 years ago
- Wiktionary parser tool for many language editions.☆54Updated 3 years ago
- Finds linguistic patterns effortlessly☆39Updated 2 years ago
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆56Updated 4 years ago
- A versioned python wrapper package for cmudict (https://github.com/cmusphinx/cmudict).☆65Updated 2 weeks ago
- convert epub file to txt☆94Updated 5 years ago
- Scraper for downloading the entire ebooks repository of project Gutenberg☆154Updated this week
- Convert ALTO XML to plain text + minimal metadata☆17Updated last year
- A pipeline for detecting novel information about entities from a stream of text, updating a knowledge base about the entities, and genera…☆32Updated 6 years ago
- Find rss, atom, xml, and rdf feeds on webpages☆31Updated last month
- 💡✏️️ ⬇️️ JSON to Markdown converter - Generate Markdown from format independent JSON☆78Updated 6 years ago
- clone of https://code.google.com/p/splitta/ so it can be a git submodule☆34Updated 12 years ago
- A spell-checker extending Peter Norvig's with multi-typo correction, hamming distance weighting, and more.☆98Updated 5 years ago
- Offline bilingual dictionaries made using data from Wiktionary☆62Updated 10 years ago
- ☆40Updated 7 years ago
- MediaWiki API wrapper in python http://pymediawiki.readthedocs.io/en/latest/☆186Updated 2 weeks ago
- FoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.g…☆113Updated 10 months ago
- The projects lets you extract glossary words and their definitions from a given piece of text automatically using NLP techniques☆29Updated 5 years ago
- A python module for word inflections designed for use with spaCy.☆93Updated 5 years ago
- Wikipedia API wrapper for humans and elk. (en.wikipedia.org/w/api.php, get it?)☆38Updated 11 years ago
- A Python module to discover the etymology of words☆151Updated last year
- Extract a plain text corpus from MediaWiki XML dumps, such as Wikipedia.☆134Updated 7 years ago
- All TED talks narratives extracted and cleaned.☆101Updated 7 years ago
- This repository for Web Crawling, Information Extraction, and Knowledge Graph build up.☆33Updated 7 years ago
- ☆14Updated 3 years ago
- Analyze and extract Wikipedia article text and attributes and store them into an ElasticSearch index or to json files (multilingual suppo…☆47Updated 2 years ago