neopunisher / Open-Text-SummarizerLinks
Automatic text summarization
☆243Updated 6 years ago
Alternatives and similar repositories for Open-Text-Summarizer
Users that are interested in Open-Text-Summarizer are comparing it to the libraries listed below
Sorting:
- Aviation grade news article metadata extraction☆36Updated 2 years ago
- Extract a plain text corpus from MediaWiki XML dumps, such as Wikipedia.☆133Updated 6 years ago
- Index URLs in Common Crawl☆195Updated 8 years ago
- A python library for simple text summarization☆218Updated 10 years ago
- displaCy-ent.js: An open-source named entity visualiser for the modern web☆198Updated 7 years ago
- 💫 REST microservices for various spaCy-related tasks☆241Updated 3 years ago
- Scraper for downloading the entire ebooks repository of project Gutenberg☆152Updated this week
- WebAnnotator is a tool for annotating Web pages. WebAnnotator is implemented as a Firefox extension (https://addons.mozilla.org/en-US/fi…☆48Updated 3 years ago
- Adaptive crawler which uses Reinforcement Learning methods☆168Updated 7 years ago
- tool for collectively summarizing large discussions☆145Updated 2 years ago
- A command-line tool for using CommonCrawl Index API at http://index.commoncrawl.org/☆202Updated 7 years ago
- Reduction is a python script which automatically summarizes a text by extracting the sentences which are deemed to be most important.☆54Updated 10 years ago
- A python library detect and extract listing data from HTML page.☆108Updated 8 years ago
- Automatic keyword extraction - no alchemy required!☆169Updated 9 years ago
- Backend of Common Search. Analyses webpages and sends them to the index.☆122Updated 8 years ago
- A spell-checker extending Peter Norvig's with multi-typo correction, hamming distance weighting, and more.☆98Updated 5 years ago
- Automatically extracts and normalizes an online article or blog post publication date☆117Updated 2 years ago
- A Reddit bot that summarizes news articles written in Spanish or English. It uses a custom built algorithm to rank words and sentences.☆278Updated 3 years ago
- Natural language generation language☆55Updated 6 years ago
- Train your own Natural Language Processor from a browser 🤖 (Prototype)☆174Updated 2 years ago
- Demonstration of using Python to process the Common Crawl dataset with the mrjob framework☆166Updated 3 years ago
- ☆185Updated 6 years ago
- A scraping command line tool for the modern web☆260Updated 9 years ago
- An intelligent reading agent that understands text and translates it into Wikidata statements.☆116Updated 9 years ago
- Mechanical Turk on your own machine.☆207Updated 11 months ago
- Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends☆57Updated last year
- Python library for reading and writing warc files☆243Updated 3 years ago
- English Dependency Relationship Extractor☆85Updated 9 months ago
- Extract countries, regions and cities from a URL or text☆217Updated 5 years ago
- The Java Graphical Authorship Attribution Program☆277Updated last year