Implementation of Vision Based Page Segmentation algorithm in Java
☆107Oct 25, 2019Updated 6 years ago
Alternatives and similar repositories for vips_java
Users that are interested in vips_java are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Suite of tools for detecting changes in web pages and their rendering☆56Dec 17, 2023Updated 2 years ago
- Web page segmentation and noise removal☆55Feb 4, 2024Updated 2 years ago
- Tools for web page segmentation evaluation☆13Nov 6, 2019Updated 6 years ago
- code and data used to build a training dataset for dragnet models☆10Nov 29, 2020Updated 5 years ago
- A python implementation of DEPTA☆83Jan 14, 2017Updated 9 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A python library detect and extract listing data from HTML page.☆110May 5, 2017Updated 9 years ago
- datamining roadrunner☆13Apr 5, 2016Updated 10 years ago
- Browser Recorder And Player (BRAP) is a Java based tool that provides a programmatic way to record what users do in a browser (e.g. click…☆15Jan 7, 2015Updated 11 years ago
- Training/test data for Dragnet☆42Jan 29, 2015Updated 11 years ago
- Structured Data Extractor. An application to extract structured data from web pages. It uses Data Extraction Based on Partial Tree Alignm…☆50Jun 9, 2012Updated 14 years ago
- Html article content extractor in Golang.☆12Oct 31, 2022Updated 3 years ago
- Automatic Item List Extraction☆85Jun 15, 2016Updated 10 years ago
- Projects,include website project, chrome extension project,etc.☆19Oct 2, 2020Updated 5 years ago
- Segment a HTML document into structural data☆12Jan 15, 2019Updated 7 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 基于腾讯TexSmart分词SDK的ES分词插件☆15Sep 18, 2020Updated 5 years ago
- Computes normals for triangulated meshes☆18May 20, 2016Updated 10 years ago
- Example Proteus Project☆11May 27, 2020Updated 6 years ago
- fibx 是 fibjs 的一个 web 框架,提供了中间件安装以及请求接受和应答的功能☆10Dec 17, 2017Updated 8 years ago
- A Simple Http to Raw Socket Adapter for Android☆12Aug 30, 2015Updated 10 years ago
- SVM Entity Relation classification for ace2005 chinese data☆14Jun 25, 2017Updated 9 years ago
- Intelligent Web Data Extractor☆74Dec 5, 2022Updated 3 years ago
- A dataset of popular pages (taken from <dir.yahoo.com>) with manually marked up semantic blocks.☆15Feb 9, 2014Updated 12 years ago
- Rules used in Neural Rule Engine.☆28Aug 31, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Dense optical flow toolbox (from C.Liu)☆19Jun 14, 2012Updated 14 years ago
- Work in progress transmit from Google Code☆1,127Jan 3, 2018Updated 8 years ago
- Your Universal Cellular Automata☆14Aug 31, 2025Updated 10 months ago
- Fingerprint Authentication using BiometricPrompt Compat☆12Jun 6, 2019Updated 7 years ago
- Fixes to Sublime Text's JavaScript symbol list☆30Oct 15, 2014Updated 11 years ago
- Training of boolean logic models of signalling networks using prior knowledge networks and perturbation data.☆13Nov 24, 2025Updated 7 months ago
- Source code for the paper "Web2Text: Deep Structured Boilerplate Removal", full paper @ ECIR'18☆169Oct 28, 2021Updated 4 years ago
- An interactive grid for sorting, filtering, and editing DataFrames in Jupyter notebooks☆11Mar 15, 2022Updated 4 years ago
- Programmatically instantiate and modify Firebase instances.☆19Feb 14, 2017Updated 9 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆11Aug 31, 2015Updated 10 years ago
- This repository contains the complete source code that we used to conduct experiments in the paper: Text Window Denoising Autoencoder: Bu…☆15Jun 12, 2013Updated 13 years ago
- C++ Version Code for 'Recurrent Scale Approximation for Object Detection in CNN' in ICCV 2017☆16Jan 29, 2018Updated 8 years ago
- Character CNN model for DSL 2016☆16Jul 17, 2017Updated 8 years ago
- Implementation of Self-Governing Neural Networks for speech act classification☆12Nov 5, 2025Updated 7 months ago
- Framework for evaluating text extraction algorithms implemented as web services☆42Jun 30, 2012Updated 14 years ago
- Pure C natural language identifier with support for 97 languages☆27Sep 26, 2017Updated 8 years ago