neopunisher / Open-Text-SummarizerLinks
Automatic text summarization
☆243Updated 7 years ago
Alternatives and similar repositories for Open-Text-Summarizer
Users that are interested in Open-Text-Summarizer are comparing it to the libraries listed below
Sorting:
- Index URLs in Common Crawl☆197Updated 8 years ago
- A python library for simple text summarization☆219Updated 10 years ago
- tool for collectively summarizing large discussions☆145Updated 3 years ago
- An intelligent reading agent that understands text and translates it into Wikidata statements.☆116Updated 9 years ago
- ☆185Updated 7 years ago
- Train your own Natural Language Processor from a browser 🤖 (Prototype)☆174Updated 2 years ago
- displaCy-ent.js: An open-source named entity visualiser for the modern web☆199Updated 7 years ago
- A spell-checker extending Peter Norvig's with multi-typo correction, hamming distance weighting, and more.☆98Updated 5 years ago
- A set of tools to allow PDF to XML conversion, utilising Apache Beam and other tools. The aim of this project is to bring multiple tools…☆296Updated last week
- Scraper for downloading the entire ebooks repository of project Gutenberg☆155Updated this week
- ThoughtTreasure commonsense knowledge base and architecture for natural language processing☆79Updated 10 years ago
- Demonstration of using Python to process the Common Crawl dataset with the mrjob framework☆168Updated 3 years ago
- A command-line tool for using CommonCrawl Index API at http://index.commoncrawl.org/☆205Updated 7 years ago
- English Dependency Relationship Extractor☆86Updated 2 months ago
- Extract a plain text corpus from MediaWiki XML dumps, such as Wikipedia.☆134Updated 7 years ago
- Aviation grade news article metadata extraction☆36Updated 2 years ago
- TopicDB is a topic maps-based semantic graph store (using SQLite for persistence)☆269Updated 11 months ago
- Quickly turn command-line applications into RESTful webservices with a web-application front-end. You provide a specification of your com…☆134Updated 2 months ago
- A python library detect and extract listing data from HTML page.☆108Updated 8 years ago
- Uses NLP and wikipedia to try to generate trivia questions☆132Updated 8 years ago
- 💫 REST microservices for various spaCy-related tasks☆241Updated 3 years ago
- Launch AWS Elastic MapReduce jobs that process Common Crawl data.☆49Updated 8 years ago
- Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & N…☆276Updated 3 years ago
- displaCy.js: An open-source NLP visualiser for the modern web☆345Updated 7 years ago
- Natural Language Engine on WikiData☆436Updated 9 years ago
- Automatically extracts and normalizes an online article or blog post publication date☆117Updated 2 years ago
- Automatic Web Article Summarizer☆416Updated 4 years ago
- Backend of Common Search. Analyses webpages and sends them to the index.☆122Updated 8 years ago
- An open source toolkit for mining Wikipedia☆128Updated 7 years ago
- LA-PDFText is a system for extracting accurate text from PDF-based research articles (and an interface to be able to improve performance …☆81Updated 7 years ago