open source, distributed, restful crawler engine
☆14Feb 3, 2015Updated 11 years ago
Alternatives and similar repositories for ants
Users that are interested in ants are comparing it to the libraries listed below
Sorting:
- Browser-based annotation tool for Framenet☆16Jan 27, 2015Updated 11 years ago
- A simple and fast search engine☆70Jun 21, 2022Updated 3 years ago
- For interacting with nutch via Python☆29Feb 18, 2026Updated 2 weeks ago
- A Dockerized RSS feed fetcher for NLP work, using asyncio☆20Sep 16, 2022Updated 3 years ago
- a framework and language for exploring and analyzing feeds of social media data.☆23Jan 25, 2012Updated 14 years ago
- Fast filtering and animation of large dynamic networks☆39May 24, 2016Updated 9 years ago
- Cloud Mining automatically builds exploratory faceted search systems.☆52Oct 15, 2013Updated 12 years ago
- The goal of this experiment is to take articles and certain metadata and group them by topic.☆11Apr 14, 2016Updated 9 years ago
- Sort-friendly URI Reordering Transform (SURT) python module☆45Sep 11, 2025Updated 5 months ago
- A Django based search engine powered by CouchDB, celery and whoosh.☆48Dec 26, 2015Updated 10 years ago
- An open-source news aggregator☆15Sep 9, 2016Updated 9 years ago
- Bicycle Incident reporting☆13Jul 22, 2022Updated 3 years ago
- A simple maintenance tracking tool for your vehicles.☆12Nov 1, 2025Updated 4 months ago
- Generate a Bitcoin paper wallet offline as a png file, no need for a browser.☆23Mar 27, 2017Updated 8 years ago
- Focused Crawler for VT's CTRNet☆10May 13, 2013Updated 12 years ago
- Green SqlAlchemy extensions for pulsar☆11Nov 24, 2017Updated 8 years ago
- Digitization information system build on top of Fedora repository☆16Jan 15, 2019Updated 7 years ago
- Narwhal is a keyword and KEY NARRATIVE manager that creates language-aware classes. Because Narhwal does not use NLP it avoids complexity…☆12Oct 16, 2018Updated 7 years ago
- ☆12Oct 25, 2015Updated 10 years ago
- Minimal web-based client for NewsBlur.☆20Dec 7, 2014Updated 11 years ago
- Collection of functions and scripts for text retrieval in Python: Document collection preprocessing, Feature Selection, Indexing, Query p…☆43Mar 23, 2013Updated 12 years ago
- WarcMiddleware lets users seamlessly download a mirror copy of a website when running a web crawl with the Python web crawler Scrapy.☆48Mar 19, 2018Updated 7 years ago
- Track the keyword positions☆19Oct 26, 2013Updated 12 years ago
- Prospective search for python☆26Dec 4, 2012Updated 13 years ago
- t test☆10Apr 27, 2014Updated 11 years ago
- BlogBridge, the cross platform, open source, blog and rss reader with super powers!☆29Nov 2, 2011Updated 14 years ago
- Micro-framework for publishing linked data☆11Aug 1, 2017Updated 8 years ago
- Drupal Computing is a framework that facilitates distributed computing between Drupal and external programs written in non-PHP languages …☆10Dec 29, 2014Updated 11 years ago
- Loopback web application for administration of Datawake networks☆10May 2, 2017Updated 8 years ago
- Expand tags by rendering local or remote RDF resources, recursively.☆10Dec 8, 2022Updated 3 years ago
- Document management system. Based on bill tracking needs. Simple model for stages, priorities, authors, content (abstract, tags), releate…☆19Sep 16, 2014Updated 11 years ago
- Rapidly develop your API client☆144Nov 10, 2015Updated 10 years ago
- ☆12Apr 7, 2015Updated 10 years ago
- Create an eBook from Pete Keen's Ledger tutorials☆10Jun 10, 2018Updated 7 years ago
- sparql-stream sensor queries☆16Sep 28, 2016Updated 9 years ago
- Stream Processing ToolKit☆18Aug 14, 2015Updated 10 years ago
- Discover, analyze and present data from the web and mobile in meaninful ways☆83Jul 16, 2013Updated 12 years ago
- Distributed Web Crawler, Parser and Search Engine.☆10Jun 16, 2016Updated 9 years ago
- This is the open source code of the City72 platform. Fork this code, then deploy your own City72 site.☆29Sep 3, 2016Updated 9 years ago