copiesofcopies / youtube-transcription
☆73Updated 12 years ago
Alternatives and similar repositories for youtube-transcription:
Users that are interested in youtube-transcription are comparing it to the libraries listed below
- Search 'from' and 'to' strings to learn a text cleaning mapping☆17Updated 9 years ago
- WebAnnotator is a tool for annotating Web pages. WebAnnotator is implemented as a Firefox extension (https://addons.mozilla.org/en-US/fi…☆48Updated 3 years ago
- Extract data from an HTML table and store results to a csv file.☆38Updated 9 years ago
- Find which links on a web page are pagination links☆29Updated 8 years ago
- December 14th Python Meetup Files☆37Updated 12 years ago
- Code + Jupyter Notebooks for Visualizing Clusters of Clickbait Headlines Using Spark, Word2vec, and Plotly☆47Updated 4 years ago
- Python library with common functionality for writing web scrapers☆102Updated 9 years ago
- Graph extraction and NLP analysis for Baleen Corpora☆18Updated 8 years ago
- Inspect a URL and estimate if it contains a news story☆39Updated 5 months ago
- A web application that identifies party in political discourse and an example of operationalized machine learning.☆28Updated 6 years ago
- common data interchange format for document processing pipelines that apply natural language processing tools to large streams of text☆35Updated 8 years ago
- The missing datasets manager. Like hombrew but for datasets. CLI-tool for search and discover datasets!☆39Updated 7 years ago
- code and slides for my PyGotham 2016 talk, "Higher-level Natural Language Processing with textacy"☆15Updated 8 years ago
- ☆13Updated 8 years ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 5 years ago
- Measure is scripts and conventions to build KPI dashboards for projects.☆17Updated 4 years ago
- python script to ingest a csv and convert it to the flare.json format used by many D3.js visualizations☆20Updated last year
- Library for bootstrapping statistics☆21Updated 7 years ago
- Small set of utilities to simplify writing Scrapy spiders.☆49Updated 9 years ago
- A (comprehensive) collection of open source tools used by the data community.☆51Updated 9 years ago
- 🍻Uses Google, Yelp, and Foursquare APIs to retrieve and rank bars☆86Updated 7 years ago
- Lightweight, multilingual natural language processing☆63Updated 12 years ago
- Exploring the shapes of stories using indico sentiment analysis APIs☆28Updated 9 years ago
- IPython Notebook + D3☆128Updated 10 years ago
- extract relationships from standardized terms from corpus of interest with deep learning☆20Updated 5 years ago
- Multidimensional data explorer and visualization tool.☆56Updated 7 years ago
- An attempt at creating a silver/gold standard dataset for backtesting yesterday & today's content-extractors☆34Updated 10 years ago
- Modularly extensible semantic metadata validator☆84Updated 9 years ago
- Partial result caching for pandas in Python.☆19Updated 6 years ago
- Topic models (just LDA for now) on the Hacker News corpus☆22Updated 9 years ago