shunk031 / TedScraper
Scraper for TED Talks in Python. Get talk title, transcript, talk topics and so on.
☆15Updated 7 years ago
Alternatives and similar repositories for TedScraper:
Users that are interested in TedScraper are comparing it to the libraries listed below
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆63Updated 11 months ago
- Command-line corpus tools☆9Updated 7 years ago
- Automatically exported from code.google.com/p/guess-language☆53Updated last year
- Wikipedia API wrapper for humans and elk. (en.wikipedia.org/w/api.php, get it?)☆36Updated 10 years ago
- clone of https://code.google.com/p/splitta/ so it can be a git submodule☆34Updated 11 years ago
- The zhong [|] Chinese grammars☆14Updated 3 years ago
- ☆40Updated 7 years ago
- Browser-based annotation tool for Framenet☆16Updated 10 years ago
- A PDFMiner wrapper to ease the text extraction from pdf files.☆25Updated 12 years ago
- A web application for exploring documents topically.☆26Updated 8 years ago
- A workflow system for Natural Language Processing.☆21Updated 5 years ago
- Simple spaCy-based concept extraction API, involving a dictionary of relevant concepts.☆10Updated 5 years ago
- Simple CORPORA list crawler☆10Updated 8 years ago
- Some convenient natural language tools that build on NLTK.☆85Updated 10 years ago
- ☆21Updated 8 years ago
- Stylometric framework in Python☆17Updated 10 years ago
- stav text annotation visualiser☆34Updated 13 years ago
- API server for NLTK☆23Updated 8 years ago
- CLI tool for importing entities from Wikidata / Wikibase☆23Updated 2 years ago
- WordNet Domains, WordNet Affect and SentiWords☆48Updated 9 years ago
- Code for recon16 hack day☆16Updated 7 years ago
- Simple natural language parsing and semantic grounding☆10Updated 4 years ago
- Convert a corpus of PDF to clean text files on a distributed architecture☆38Updated last year
- Hierarchical phrase-based machine translation system☆32Updated 10 years ago
- Easy language identification of 380 languages☆17Updated 5 years ago
- Recipes for training OpenNMT systems☆14Updated 7 years ago
- Convert ALTO XML to plain text + minimal metadata☆16Updated 6 months ago
- Construct your personal API☆18Updated 2 years ago
- GOPHI: an AMR-to-English Verbalizer☆11Updated 5 years ago
- 📑 Python Package to reconstruct the original continuous text from PDFs with language models☆32Updated last year