cdimascio / essence
Automatically extract the main text content (and more) from an HTML document
☆117Updated 2 years ago
Alternatives and similar repositories for essence
Users that are interested in essence are comparing it to the libraries listed below
Sorting:
- Crux offers a flexible plugin-based API & implementation to extract interesting information from Web pages.☆241Updated last month
- A Kotlin port of Mozilla‘s Readability. It extracts a website‘s relevant content and removes all clutter from it.☆154Updated 3 years ago
- Kotlin/Java library and cli tool for scraping posts and media from various sources with neither authorization nor full page rendering (Fa…☆285Updated this week
- Multiplatform Kotlin Hello World (Android/iOS/Java/JavaScript/Native)☆76Updated 10 months ago
- Life and collaboration assistant.☆36Updated this week
- TextRank algorithm implementation in Javascript☆41Updated 10 years ago
- Java port of SymSpell: 1 million times faster through Symmetric Delete spelling correction algorithm☆67Updated 4 years ago
- A Kotlin/Java API for generating .ts source files.☆47Updated last year
- A dataset of multinational first names and last names☆26Updated 2 years ago
- A Kotlin multi-platform library for graph data structures☆20Updated 2 years ago
- StaticLog - super lightweight static logging for Kotlin, Java and Android☆28Updated 7 years ago
- Java library to extract links (URLs, email addresses) from plain text; fast, small and smart☆209Updated 6 months ago
- extJWNL (Extended Java WordNet Library) is a Java API for creating, reading and updating dictionaries in WordNet format.☆128Updated last year
- The Sweble Wikitext Components module provides a parser for MediaWiki's wikitext and an engine trying to emulate the behavior of a MediaW…☆71Updated last year
- A Directory of Online Newspaper Sources for 70+ Languages☆32Updated 4 years ago
- Port of Andrej Karpathy's llama2.c to Kotlin.☆23Updated last year
- ☆30Updated 6 years ago
- A command-line tool for using CommonCrawl Index API at http://index.commoncrawl.org/☆190Updated 6 years ago
- This an Android App that helps you share/manage your files on your Android Device through a WebInterface in the Browser - Built with Ktor…☆36Updated 2 years ago
- SimpleDNN is a machine learning lightweight open-source library written in Kotlin designed to support relevant neural network architectur…☆99Updated 4 years ago
- NameKrea is an AI Domain Name Generator which uses GPT-2☆48Updated 2 years ago
- Kotlin multiplatform internationalization library (experimental)☆20Updated last year
- Gradle plugin, user guide and discussion forums for Conveyor☆137Updated 2 weeks ago
- Fast and robust date extraction from web pages, with Python or on the command-line☆126Updated 4 months ago
- Kotlin Multiplatform RocksDB library☆38Updated 2 months ago
- An implementation of Go-Links, written in Kotlin☆39Updated 2 months ago
- Cloud crawler functions for scrapeulous☆45Updated 4 years ago
- Sigbla is a framework for working with data in tables, using the Kotlin programming language. It supports various data types, reactive pr…☆26Updated 5 months ago
- A human-friendly alternative to cron. Designed after GAE's schedule for Kotlin and/or Java 8+.☆82Updated 3 years ago
- Kode-First Konfiguration for Kotlin☆22Updated 4 years ago