ankiano / etl
Extract transform load CLI tool for extracting small and middle data volume from sources (databases, csv files, xls files, gspreadsheets) to target (databases, csv files, xls files, gspreadsheets) in free combination.
☆11Updated this week
Alternatives and similar repositories for etl:
Users that are interested in etl are comparing it to the libraries listed below
- Python, Tor, Stem, Privoxy: with this tools, allow requests new connections via Tor for obtain new IP addresses.☆24Updated 6 years ago
- A component that tries to avoid downloading duplicate content☆27Updated 6 years ago
- extract difference between two html pages☆32Updated 6 years ago
- Extract social media links and account names from websites.☆38Updated 4 years ago
- Scrapy pipeline which allows you to store scrapy items in a solr server.☆19Updated 8 years ago
- ☆14Updated 6 years ago
- A python client for connecting to all the services provided by https://dandelion.eu☆36Updated last year
- API client for Aleph, supports bulk entity and document upload.☆28Updated 6 months ago
- ☆34Updated last year
- Web scraper for generating a graph of media connections via articles, twitter, reddit, and more☆36Updated 7 years ago
- Data notification service: subscribe to keywords and get notified whenever an open data sources mentions that keyword.☆24Updated 11 years ago
- Scrapes sites. Gets news. Eventually events.☆85Updated 9 years ago
- Streaming web crawler with WebSocket API☆44Updated last year
- A Scrapy crawler for http://books.toscrape.com☆27Updated 7 years ago
- Natural Language Processing of Chicago news articles☆52Updated last month
- This is a Python script to generate Sunburst Charts that visualise the structure of English words.☆16Updated 6 years ago
- Scripts for capturing tweets, creating data dictionary, processing & scoring tweet sentiments☆11Updated 9 years ago
- An alpha project combining beneficial ownership and contracting data☆13Updated 3 years ago
- A classifier for detecting soft 404 pages☆57Updated last year
- Simple dashboard for getting currently trending hashtags and topics on Twitter☆25Updated 2 years ago
- Scraping Tweet data for Russian Troll Twitter accounts into Neo4j☆57Updated 7 years ago
- Techniques for Scraping the Web in Python☆26Updated 6 years ago
- Scraper for Facebook's Archive of Ads with Political Content☆36Updated 6 years ago
- Scalable String Similarity Joins in Python☆39Updated 9 months ago
- Predict age and gender from a first name☆60Updated 6 years ago
- Show summary of a large number of URLs in a Jupyter Notebook☆17Updated 3 years ago
- Analysis pipeline for quick ML analyses.☆11Updated 6 years ago
- A scrapper to identify whether a person is of interest against key databases.☆22Updated 6 years ago
- Converter for ICIJ Offshore Leaks data into FollowTheMoney format☆12Updated 3 years ago
- How to handle emoji in Python + a quick Python script to count emoji in Tweets as an example. (python 2.7)☆13Updated 9 years ago