cdimascio / essenceLinks
Automatically extract the main text content (and more) from an HTML document
☆117Updated 2 years ago
Alternatives and similar repositories for essence
Users that are interested in essence are comparing it to the libraries listed below
Sorting:
- Crux offers a flexible plugin-based API & implementation to extract interesting information from Web pages.☆240Updated 4 months ago
- A Kotlin port of Mozilla‘s Readability. It extracts a website‘s relevant content and removes all clutter from it.☆161Updated 3 years ago
- Kotlin/Java library and cli tool for scraping posts and media from various sources with neither authorization nor full page rendering (Fa…☆295Updated 2 weeks ago
- Life and collaboration assistant.☆35Updated last week
- A Natural Language Date Time Parser that Extract date and time from text with context and parse to the required format☆241Updated 10 months ago
- Java library to extract links (URLs, email addresses) from plain text; fast, small and smart☆210Updated 2 months ago
- A set of reusable Java components that implement functionality common to any web crawler☆244Updated 2 weeks ago
- An implementation of Go-Links, written in Kotlin☆39Updated 4 months ago
- Java client for txtai☆38Updated 2 months ago
- Bindings to Google's Compact Language Detector 3 to JVM Based Languages☆22Updated last year
- Article extraction benchmark: dataset and evaluation scripts☆320Updated last year
- SimpleDNN is a machine learning lightweight open-source library written in Kotlin designed to support relevant neural network architectur…☆100Updated 5 years ago
- A port of the arclabs 'readability' package to Java☆72Updated 12 years ago
- Multiplatform Kotlin Hello World (Android/iOS/Java/JavaScript/Native)☆78Updated last year
- Google Search Results JAVA API via SerpApi☆44Updated 2 months ago
- A fork of Dragnet that also extract author, headline, date, keywords from context, as well as built in metadata extraction all in one pac…☆291Updated 2 months ago
- A simple Java library for reading RSS and Atom feeds☆177Updated last week
- Gradle plugin, user guide and discussion forums for Conveyor☆143Updated 2 weeks ago
- Treat your Dockerfiles as self-contained, editable scripts☆103Updated 4 years ago
- Matrix-based News Aggregation to Explore Media Bias☆20Updated 7 years ago
- An IntelliJ plugin that provides some useful utilities to support the daily work with Gradle.☆11Updated this week
- A Kotlin/Java API for generating .ts source files.☆48Updated last year
- A natural language event parser for java and android.☆103Updated 4 years ago
- The most accurate natural language detection library for Java and the JVM, suitable for long and short text alike☆765Updated 4 months ago
- A simple API for programmatically handling maven artifacts and metadata☆78Updated 2 years ago
- This a template project that helps you write Greasemonkey/Tampermonkey/ViolentMonkey scripts with KotlinJs☆19Updated last month
- A web crawling framework written in Kotlin☆131Updated 4 years ago
- NameKrea is an AI Domain Name Generator which uses GPT-2☆50Updated 2 years ago
- A java annotation library for Web scraping.☆28Updated 2 months ago
- Logquacious (lq) is a fast and simple log viewer.☆60Updated 3 years ago