copiesofcopies / youtube-transcription
☆72Updated 11 years ago
Alternatives and similar repositories for youtube-transcription:
Users that are interested in youtube-transcription are comparing it to the libraries listed below
- A web application that identifies party in political discourse and an example of operationalized machine learning.☆28Updated 6 years ago
- Search 'from' and 'to' strings to learn a text cleaning mapping☆17Updated 9 years ago
- A polite, minimal interface for sending python objects to and from Amazon S3.☆57Updated 8 years ago
- code and slides for my PyGotham 2016 talk, "Higher-level Natural Language Processing with textacy"☆15Updated 8 years ago
- Library for Geo-Inferencing in Twitter Data☆28Updated 8 years ago
- Multidimensional data explorer and visualization tool.☆55Updated 7 years ago
- Literate data analysis with iPython notebooks and Jekyll.☆92Updated 10 years ago
- Download *ALL* the submissions from Hacker News☆50Updated 10 years ago
- rapid nlp prototyping☆72Updated 2 years ago
- Find which links on a web page are pagination links☆29Updated 8 years ago
- Set of scripts to aid in the download of the GDELT data files from www.gdeltproject.org☆11Updated 10 years ago
- Example nteract notebooks with links to execution on mybinder.org☆27Updated 2 years ago
- a set of services that provide NLP facilities☆25Updated 4 years ago
- Scraping Assisted by Learning☆35Updated this week
- WebAnnotator is a tool for annotating Web pages. WebAnnotator is implemented as a Firefox extension (https://addons.mozilla.org/en-US/fi…☆48Updated 3 years ago
- Tribe extracts a network from an email mbox and writes it to a graphml file for visualization and analysis.☆78Updated last year
- A web application for exploring documents topically.☆26Updated 8 years ago
- Python wrapper for a C++ Double Metaphone☆15Updated last year
- python script to ingest a csv and convert it to the flare.json format used by many D3.js visualizations☆20Updated 9 months ago
- Find rss, atom, xml, and rdf feeds on webpages☆30Updated 3 months ago
- Deployment of pywb as a CommonCrawl Index Server☆21Updated 7 years ago
- Compute association strength over semantic networks in a dimensionality-reduced form.☆33Updated 9 years ago
- Files for workshop at Center for Research on Inequalities and the Life Course☆33Updated 10 years ago
- Code + Jupyter Notebooks for Visualizing Clusters of Clickbait Headlines Using Spark, Word2vec, and Plotly☆47Updated 4 years ago
- Using word2vec and t-SNE to compare text sources.☆20Updated 9 years ago
- A simple command line interface to the datamade/dedupe library.☆42Updated 2 years ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆42Updated 5 years ago
- python package for performing deduplication using flexible text matching and cleaning in pandas dataframe☆25Updated 4 years ago