A Python based web crawler that crawls all the web pages in a breathe-first approach from the given seed page
☆16Apr 28, 2015Updated 11 years ago
Alternatives and similar repositories for cleoria-web-crawler
Users that are interested in cleoria-web-crawler are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- All the Hadoop Mapreduce examples in python!☆16May 8, 2015Updated 10 years ago
- This is a program to crawl entire 'Wikipedia' and extract & store information from the pages as required.☆76Nov 16, 2023Updated 2 years ago
- Databases: Concepts, commands, codes, interview questions and more...☆59Apr 2, 2022Updated 4 years ago
- Simple MapReduce implementation in Python, for text file parallel processing☆20Mar 3, 2012Updated 14 years ago
- A Simple Web Crawler from Scratch.☆10Dec 2, 2017Updated 8 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Simple Python configuration utilities.☆18Jul 26, 2022Updated 3 years ago
- Will Search Various Platforms to Confirm An Email Exists.☆10May 29, 2020Updated 5 years ago
- Yelp Restaurant Photo Classification - Kaggle competition☆12Apr 19, 2019Updated 7 years ago
- Finalist entry for the M2CAI Workflow Challenge 2016☆10Nov 25, 2016Updated 9 years ago
- Movielens collaborative filtering with Solr streaming expression☆11Oct 13, 2016Updated 9 years ago
- Sample REST API built with Python using Flask.☆14Sep 29, 2020Updated 5 years ago
- Stores email header and body information in JSON format☆12Mar 10, 2016Updated 10 years ago
- A simple tool in Python that automates Gmail replies by using Selenium☆17Dec 19, 2016Updated 9 years ago
- Application to warmup servers☆10Jul 13, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Researching the forward-backward algorithm☆11Aug 3, 2018Updated 7 years ago
- Machine learning in nim☆12Aug 16, 2014Updated 11 years ago
- A chrome extension that makes it possible to reply to all selected conversations in Gmail™ at once.☆12Dec 16, 2024Updated last year
- Python bindings to babeljs☆11Mar 16, 2018Updated 8 years ago
- General purpose Billing/Customer Management Software which can be used in gym, yoga centers, aerobics centers,etc.☆12Nov 17, 2022Updated 3 years ago
- Provides Movie Recommendations on the MovieLens ml-100k dataset using Collaborative Filtering☆11Nov 14, 2013Updated 12 years ago
- Tornado base application☆16Jan 12, 2010Updated 16 years ago
- Python website creator. Create a website within 20 seconds in python☆16Oct 1, 2020Updated 5 years ago
- IPython notebook manager which seamlessly saves and loads to S3☆19Feb 12, 2015Updated 11 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- 💬 Data forensics and recovery utility for Skype chats and history☆12Oct 29, 2021Updated 4 years ago
- Easily collect and store tweets from the Twitter Streaming API☆11Jul 4, 2014Updated 11 years ago
- GitHub news feed on your iPhone☆34Aug 8, 2008Updated 17 years ago
- Sample application for deploying a neural network as a tornado-powered webservice. The thread-blocking calls are run sequentially on a se…☆13Dec 30, 2016Updated 9 years ago
- 客户端工具函数集合库,针对工作中在客户端环境,需要用到的常用需求进行了封装,需要undescore,jQuery的配合只兼容chrome以及IE8以上浏览器,省去多次提取,或者编写函数导致的重复劳动。如提取cookie,以及提取链接参数的字符串,兼容了requireJS与s…☆10Apr 14, 2021Updated 5 years ago
- An Information Extraction Framework with Deep Learning developed at New York University☆15Oct 27, 2016Updated 9 years ago
- Named Entity Extraction on Twitter Stream using Apache Spark Streaming and Stanford CoreNLP☆15Oct 12, 2016Updated 9 years ago
- (Theano) Implementations about deep neural network, recurrent neural network, LSTM, and structured learining.☆10Nov 9, 2016Updated 9 years ago
- 编写高质量代码--改善Python程序的91个建议☆12Jun 19, 2015Updated 10 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A browser extension for GMail that lets you schedule emails to return to your inbox☆18May 31, 2017Updated 8 years ago
- Experimentation with Highway Networks & GradNets☆13Dec 31, 2015Updated 10 years ago
- Leetcode python solution, with analytics in blog☆13Nov 25, 2014Updated 11 years ago
- Cloud Mining automatically builds exploratory faceted search systems.☆52Oct 15, 2013Updated 12 years ago
- Toy codes to kick-start deep learning for NLP !☆12Aug 22, 2016Updated 9 years ago
- Convolutional Neural Network for Language Detection in Tensorflow☆12Apr 22, 2018Updated 8 years ago
- Stupid Experiments in Elasticsearch Image Search☆14Oct 18, 2019Updated 6 years ago