gdamdam / sumoLinks
Tool to extracts the text from a web article urls and get frequency words, entities recognition, automatic summary and more
☆20Updated 7 years ago
Alternatives and similar repositories for sumo
Users that are interested in sumo are comparing it to the libraries listed below
Sorting:
- Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.☆47Updated 8 years ago
- Quickly turn command-line applications into RESTful webservices with a web-application front-end. You provide a specification of your com…☆134Updated 3 months ago
- ☆14Updated 2 years ago
- Remove duplicate documents/videos/images via popular algorithms such as SimHash, SpotSig, Shingling, etc.☆19Updated 2 years ago
- A company/project name generator for Python. Uses NLTK and diverse techniques derived from existing corporate etymologies and naming agen…☆50Updated 8 years ago
- Write Like Hemingway☆12Updated 11 years ago
- A free dataset of (almost) all publicly available podcasts.☆133Updated 11 years ago
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆57Updated 4 years ago
- Python package for converting xml and epubs to text files☆33Updated 5 years ago
- A collection of YouTube videos transcripts : Podcasts (Joe Rogan Experience, Tim Ferris, Jocko podcast, ..), lectures (YaleCourses, MIT l…☆79Updated 9 months ago
- a client side transcriptions text editor to proofread and correct the text before re-alignement back on the server.☆19Updated 7 years ago
- A collection of pre-built speech synthesis settings used to convey emotion☆11Updated 6 years ago
- Self-supervised neural network for music recommendations.☆18Updated 2 years ago
- Experiments with generating GPT-2 fanfiction on specified topics.☆11Updated 6 years ago
- The little things give you away... A collection of various small helper stuff – Mirror repo only, no longer kept in sync, refer to gitea.…☆24Updated 5 years ago
- Python code for building a GPT-3 based technical blog post optimizer.☆85Updated 3 years ago
- Quantified Self: A Personal Data Aggregator and Dashboard for Self-Trackers and Quantified Self Enthusiasts☆19Updated 2 years ago
- GUI text-based speech and music editor for creating radio/audio stories☆80Updated 3 years ago
- generate rules from lists of words☆16Updated 4 years ago
- Maps clauses from a text corpus onto the metrical structure of a poem☆17Updated 10 years ago
- Searching for the occurrence seconds of words/phrases or arbitrary regex patterns within audio files☆102Updated 5 years ago
- Simple and clean Python implementation of TextRank as per seminal paper by Rada Mihalcea and Paul Tarau. This implementation performs bot…☆11Updated 5 years ago
- A library that helps you to convert from one subtitle format to another☆19Updated 7 years ago
- WordNet Domains, WordNet Affect and SentiWords☆48Updated 10 years ago
- GPT2Explorer is bringing GPT2 OpenAI langage models playground to run locally on standard windows computers.☆28Updated 3 years ago
- Text-based media editing interface☆16Updated 8 years ago
- Mad (╯°□°)╯'ing☆10Updated 3 years ago
- A transcription text editor with respeak module☆14Updated this week
- Crawl Wikipedia pages and upload TTS to Youtube.☆10Updated 9 months ago
- App to explore latent spaces of music collections☆37Updated last month