gdamdam / sumoLinks
Tool to extracts the text from a web article urls and get frequency words, entities recognition, automatic summary and more
☆20Updated 6 years ago
Alternatives and similar repositories for sumo
Users that are interested in sumo are comparing it to the libraries listed below
Sorting:
- Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.☆46Updated 7 years ago
 - LLM plugin for embeddings using sentence-transformers☆72Updated 6 months ago
 - Remove duplicate documents/videos/images via popular algorithms such as SimHash, SpotSig, Shingling, etc.☆18Updated 2 years ago
 - Manage, generate convert chapters for podcasts and other media via cli and web☆38Updated 6 months ago
 - Python package for converting xml and epubs to text files☆33Updated 5 years ago
 - Crawl sites for RSS, Atom, and JSON feeds.☆81Updated this week
 - An Alexa skill providing a conversational interface to any public figure (as mimicked by GPT3). The legacy GUI is no longer maintained.☆20Updated last year
 - Google Colab Notebooks for Transcription with Whisper☆24Updated 6 months ago
 - Automatically exported from code.google.com/p/guess-language☆53Updated 2 weeks ago
 - Get an answer to a question from multiple backend engine like Google, wolframalpha or DuckDuckGo☆11Updated 4 years ago
 - Reproducing "Writing with Transformer" demo, using aitextgen/FastAPI in backend, Quill/React in frontend☆27Updated 4 years ago
 - Python wrapper library for the Datamuse API☆80Updated 2 years ago
 - Using large language models to maintain AI_CHANGELOG.md☆13Updated last year
 - WordNet Domains, WordNet Affect and SentiWords☆48Updated 9 years ago
 - A transcription text editor with respeak module☆13Updated last month
 - Create high-quality images programmatically with easily-hackable templates.☆189Updated last year
 - Wikidata's QRank as a SQLite DB.☆28Updated last year
 - Curated list of open source and openly accessible large language models☆26Updated 2 years ago
 - A free dataset of (almost) all publicly available podcasts.☆134Updated 11 years ago
 - LLM plugin for clustering embeddings☆82Updated last year
 - Write Like Hemingway☆12Updated 10 years ago
 - An interface for llama.cpp, ChatGPT, Gemini, and Claude☆27Updated this week
 - Faster, modernized fork of the language identification tool langid.py☆59Updated 11 months ago
 - Quickly turn command-line applications into RESTful webservices with a web-application front-end. You provide a specification of your com…☆133Updated last week
 - Cleaning tool for web scraped text☆38Updated 2 years ago
 - Identify and automatically fix issues in shell scripts☆15Updated last year
 - Automated behaviors that run in browser to interact with complex sites automatically. Used by ArchiveWeb.page and Browsertrix Crawler.☆50Updated 3 weeks ago
 - Render tweet into beautiful markdown☆26Updated last month
 - Generate embeddings for images and text using CLIP with LLM☆74Updated last year
 - Code for OpenAI Whisper Web App Demo☆93Updated 3 years ago