qoda / python-searchengine
A simple search engine which utilizes whoosh, mongodb, a custom html scraper and simple crawler.
☆35Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for python-searchengine
- Using Scrapy to get Linkedin's person public profile.☆28Updated 12 years ago
- Academic Search Engine using Scrapy, MongoDB, Lucene/Solr, Tika, Struts2, Jquery, Bootstrap, D3, CAS☆98Updated 11 years ago
- Scrapy the Zhihu content and user social network information☆47Updated 10 years ago
- 淘宝爬虫原型,基于gevent☆49Updated 11 years ago
- A scrapy zhihu crawler☆76Updated 6 years ago
- A distributed Sina Weibo Search spider base on Scrapy and Redis.☆143Updated 11 years ago
- A spectrum analysis based music finder☆107Updated 9 years ago
- Quora clone write in Python + Tornado + MongoDB☆106Updated 3 years ago
- Output scrapy statistics to graphite/carbon☆54Updated 11 years ago
- Scrapy project based on dirbot to show how to use Twisted's adbapi to store the scraped data in MySQL.☆117Updated 11 years ago
- Multi-CPU, Multi-Thread. Implemented in Python.☆79Updated 9 years ago
- A flexible web crawler based on Scrapy for fetching most of Ajax or other various types of web pages. Easy to use: To customize a new web…☆45Updated 8 years ago
- A python web fetcher using phantomjs to mock browser☆180Updated 7 years ago
- WEIBO_SCRAPY is a Multi-Threading SINA WEIBO data extraction Framework in Python.☆154Updated 7 years ago
- http://pythonhackers.com☆136Updated 9 years ago
- ☆167Updated 6 years ago
- Brownant is a web data extracting framework.☆159Updated 7 years ago
- Python Web Framework☆130Updated 8 years ago
- Unofficial Python API for Hacker News. RESTful API at https://github.com/karan/HNify☆390Updated 5 years ago
- a live chat built with python(flask + gevent + apscheduler) + redis☆321Updated 2 years ago
- trashMailFilter☆51Updated 5 years ago
- Python distributed web scrapper and dynamic crawler☆140Updated 7 years ago
- 分布式定向抓取集群☆71Updated 7 years ago
- A real-time crawler for searching P2P magnet url.☆41Updated 9 years ago
- 将会陆续添加豆瓣里面各种信息的爬虫代码和分析☆25Updated 10 years ago
- It's a magnet links search engine build with python.☆14Updated 10 years ago
- 人人好友关系☆184Updated 11 years ago
- A web based image hosting, viewing and sharing service build on top of Flask.☆110Updated 9 years ago