cdimascio / essenceLinks
Automatically extract the main text content (and more) from an HTML document
☆118Updated 3 years ago
Alternatives and similar repositories for essence
Users that are interested in essence are comparing it to the libraries listed below
Sorting:
- Crux offers a flexible plugin-based API & implementation to extract interesting information from Web pages.☆243Updated 5 months ago
- A Natural Language Date Time Parser that Extract date and time from text with context and parse to the required format☆242Updated last year
- Life and collaboration assistant.☆38Updated last week
- A Kotlin port of Mozilla‘s Readability. It extracts a website‘s relevant content and removes all clutter from it.☆163Updated 3 years ago
- Kotlin/Java library and cli tool for scraping posts and media from various sources with neither authorization nor full page rendering (Fa…☆298Updated this week
- SimpleDNN is a machine learning lightweight open-source library written in Kotlin designed to support relevant neural network architectur…☆102Updated 5 years ago
- A set of reusable Java components that implement functionality common to any web crawler☆248Updated last week
- Google Search Results JAVA API via SerpApi☆45Updated 3 months ago
- Hunspell library for Java based on JNA☆63Updated 2 years ago
- Java library to extract links (URLs, email addresses) from plain text; fast, small and smart☆212Updated 3 months ago
- A fork of Dragnet that also extract author, headline, date, keywords from context, as well as built in metadata extraction all in one pac…☆293Updated 4 months ago
- The most accurate natural language detection library for Java and the JVM, suitable for long and short text alike☆772Updated 6 months ago
- Article extraction benchmark: dataset and evaluation scripts☆327Updated last year
- Bindings to Google's Compact Language Detector 3 to JVM Based Languages☆22Updated last year
- The LAW next generation crawler.☆89Updated 3 years ago
- A Java library to determine probability of objects being similar.☆251Updated 2 months ago
- PDF parser and converter to HTML☆87Updated 11 months ago
- StaticLog - super lightweight static logging for Kotlin, Java and Android☆29Updated 7 years ago
- Java autocomplete library.☆119Updated 5 years ago
- An overview of the AI-as-a-service landscape☆159Updated 7 years ago
- Kotlin client for JetBrains Space HTTP API☆48Updated 8 months ago
- Java client for txtai☆38Updated 3 weeks ago
- Java port of SymSpell: 1 million times faster through Symmetric Delete spelling correction algorithm☆67Updated 2 months ago
- Matrix-based News Aggregation to Explore Media Bias☆20Updated 7 years ago
- NeuralParser is a very simple to use dependency parser, based on the Latent Syntactic Structure encoding.☆20Updated 5 years ago
- A natural language event parser for java and android.☆103Updated 4 years ago
- An implementation of Go-Links, written in Kotlin☆39Updated 6 months ago
- Kotlin API wrapper for Java's WatchService powered with Channels and Coroutines. a.k.a. KWatchChannel☆49Updated last year
- Boilerplate Removal using Deep Learning☆82Updated 3 years ago
- A web crawling framework written in Kotlin☆131Updated 4 years ago