mohaps / tldrzrLinks
Algorithmic summarizer for RSS/Atom Feeds, Web Urls and arbitrary text. Codebase for the application deployed at http://tldrzr.herokuapp.com
☆53Updated 9 years ago
Alternatives and similar repositories for tldrzr
Users that are interested in tldrzr are comparing it to the libraries listed below
Sorting:
- A crawler, indexer, and query interface all in Python with distributed processing via Pyro4.☆23Updated 13 years ago
- Discussion Summarization is the process of condensing a text document which is a collection of discussion threads, using CBS (Cluster Bas…☆12Updated 11 years ago
- Human-Powered Data Analysis with Mechanical Turk☆300Updated 13 years ago
- XTractor is an algorithmic text extractor from web pages written in Java. It builds upon the "commonly used web design practices" approac…☆43Updated 9 years ago
- A dashboard with various internet-y widgets☆18Updated 8 years ago
- A POC at replicating Facebook Graph Search with Cypher and Neo4j☆101Updated 12 years ago
- Real-Time, Twitter sentiment analyzer engine☆143Updated 11 years ago
- An online sentiment analyzer built with Flask and TextBlob☆15Updated 12 years ago
- Akiva is a simple natural-language-processing, question-answering, artificial intelligence.☆347Updated 12 years ago
- Launch AWS Elastic MapReduce jobs that process Common Crawl data.☆49Updated 8 years ago
- Open Source implementation of Summly☆47Updated 9 years ago
- conceptnet 4 bridge☆72Updated 11 years ago
- We introduce TACIT: An Open-Source Text Analysis, Crawling and Interpretation Tool. TACIT's plugin architecture has three main components…☆109Updated 6 years ago
- OpenBlock is a web application and RESTful service that allows users to browse and search their local area for "hyper-local news☆61Updated 4 years ago
- RDF-Centric Map/Reduce Framework and Freebase data conversion tool☆149Updated 4 years ago
- An api to parse a CV, in particular the elements of its publication list☆35Updated 7 years ago
- ImageCat is an Apache OODT RADIX application that uses Apache Solr, Apache Tika and Apache OODT to ingest 10s of millions of files (image…☆95Updated 7 years ago
- General Architecture for Text Engineering☆49Updated 9 years ago
- Web interface to sentiment analyzer.☆151Updated 9 years ago
- Performs multi document summarization. Includes a method to generate summaries: The method uses a sentence importance score calculator ba…☆38Updated 12 years ago
- Train your own Natural Language Processor from a browser 🤖 (Prototype)☆174Updated 2 years ago
- Simple search results with Solr and EmberJS☆58Updated 6 years ago
- Neddick: Open Source Information Discovery Platform☆36Updated 2 years ago
- Collects multimedia content shared through social networks.☆19Updated 10 years ago
- Automatic Document Summarizer using Bipartite HITS, Natural Language Processing (NLP)☆30Updated 13 years ago
- This is the NewsFinder software, designed to automatically crawl the web for news related to artificial intelligence, filter, categorize,…☆64Updated 12 years ago
- Automatic, zero-config web scraping -- written in Java, has no dependency on Java EE or app servers, and the web scraper has a restful/JS…☆156Updated 8 years ago
- Hadoop jobs for WikiReverse project. Parses Common Crawl data for links to Wikipedia articles.☆38Updated 7 years ago
- Contains the implementation of algorithms that estimate the geographic location of media content based on their content and metadata. It …☆15Updated 9 years ago
- A tool for calculation semantic similarity between words from a text corpus based on lexico-syntactic patterns.☆27Updated 9 years ago