gdamdam / sumo
Tool to extracts the text from a web article urls and get frequency words, entities recognition, automatic summary and more
☆20Updated 6 years ago
Alternatives and similar repositories for sumo:
Users that are interested in sumo are comparing it to the libraries listed below
- Example how to pre-process news articles with textbox and index on Elastic Search☆13Updated 7 years ago
- Traptor -- A distributed Twitter feed☆26Updated 2 years ago
- Pocketsphinx-based Linux Voice Dictation☆25Updated 4 years ago
- ☆15Updated 8 years ago
- Vidscraper is a python library which provides a simple API for fetching video data from various web services and sites.☆62Updated 2 years ago
- The Requests Stampede library is a wrapper around the Requests library that provides request retry logic and backoff delays.☆10Updated 3 years ago
- Tool for running transformations on columns in a SQLite database☆31Updated 3 years ago
- Small library to fetch files over HTTP and resuming their download☆13Updated 3 years ago
- Discussion Summarization is the process of condensing a text document which is a collection of discussion threads, using CBS (Cluster Bas…☆12Updated 10 years ago
- Write Like Hemingway☆11Updated 10 years ago
- Repository to allow collaboration between Cycle Labs Cloud community in support of the community.☆9Updated 3 years ago
- Scraping Amazon reviews using headless chrome and selenium☆10Updated 6 years ago
- LLM plugin adding support for the MPT-30B language model☆33Updated last year
- A fast TUI application (with optional webui) to visually navigate and inspect JSON and JSONL data. Easily localize parse errors in large …☆13Updated 5 months ago
- Markdown -> IPython conversion tool☆15Updated 10 years ago
- Maps clauses from a text corpus onto the metrical structure of a poem☆17Updated 9 years ago
- A raspberry pi 64bit image with spacy and neuralcoref pre-installed☆21Updated 5 years ago
- RESTful API around the PETRARCH coding software☆10Updated 3 years ago
- Identify and automatically fix issues in shell scripts☆15Updated last year
- framework for making streamcorpus data☆11Updated 8 years ago
- jq as a service☆35Updated 9 months ago
- Python's missing statistical Swiss Army knife☆15Updated 9 years ago
- Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.☆44Updated 7 years ago
- Natural Language Q/A app using DRT.☆34Updated 13 years ago
- Plots various graphs for a series of plaintext files in a directory☆19Updated 8 years ago
- Second project for UW LING 572. Automatic text summarization system.☆13Updated 12 years ago
- A simple mail "spool and send" daemon written in Go☆13Updated 10 years ago
- A workflow system for Natural Language Processing.☆21Updated 5 years ago
- Foundry is an interactive, real-time Javascript interface that allows flash teams to be assembled by anyone and tracked in real time.☆29Updated 7 years ago
- Reduction is a python script which automatically summarizes a text by extracting the sentences which are deemed to be most important.☆55Updated 10 years ago