tpopela/vips_java

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tpopela/vips_java)

tpopela / vips_java

Implementation of Vision Based Page Segmentation algorithm in Java

☆107

Alternatives and similar repositories for vips_java

Users that are interested in vips_java are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

asanoja / web-segmentation-evaluation
View on GitHub
Tools for web page segmentation evaluation
☆13Nov 6, 2019Updated 6 years ago
webis-de / cikm20-web-page-segmentation-revisited-evaluation-framework-and-dataset
View on GitHub
Code for "Web Page Segmentation Revisited: Evaluation Framework and Dataset", accepted as resources paper to CIKM 2020
☆14Jan 13, 2023Updated 3 years ago
miha-stopar / extract-repetitions
View on GitHub
Extract (DOM tree) repetitions from a webpage
☆11Jan 13, 2014Updated 12 years ago
waml-lang / waml
View on GitHub
Web Automation Markup Language (WAML) Specification
☆18Nov 21, 2018Updated 7 years ago
scrapinghub / mdr
View on GitHub
A python library detect and extract listing data from HTML page.
☆110May 5, 2017Updated 9 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
OCR-D / ocrd_segment
View on GitHub
OCR-D-compliant page segmentation
☆67May 6, 2026Updated 2 months ago
wolfbing / roadrunner
View on GitHub
datamining roadrunner
☆13Apr 5, 2016Updated 10 years ago
trec-web / trec-web-2014
View on GitHub
☆16Aug 8, 2014Updated 11 years ago
crawlerclub / ce
View on GitHub
Html article content extractor in Golang.
☆12Oct 31, 2022Updated 3 years ago
IremErturk / dtc-de-capstone-project
View on GitHub
☆11Apr 9, 2022Updated 4 years ago
seagatesoft / webdext
View on GitHub
Intelligent Web Data Extractor
☆74Dec 5, 2022Updated 3 years ago
rubenkruiper / FOBIE
View on GitHub
FOBIE dataset and code for Semi-Open Relation Extraction, applied to Biology for Computer-Aided Biomimetics.
☆35Jun 14, 2020Updated 6 years ago
schmmd / ollie
View on GitHub
Ollie is a open information extractor that uses dependency parses.
☆12Sep 27, 2013Updated 12 years ago
rkrzr / dataset-popular
View on GitHub
A dataset of popular pages (taken from <dir.yahoo.com>) with manually marked up semantic blocks.
☆15Feb 9, 2014Updated 12 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
Da-Capo / Entity-Relation-SVM
View on GitHub
SVM Entity Relation classification for ace2005 chinese data
☆14Jun 25, 2017Updated 9 years ago
mc2-project / muse
View on GitHub
Secure Inference Resilient Against Malicious Clients
☆14May 3, 2022Updated 4 years ago
marekrei / mltagger
View on GitHub
Multi-level tagger
☆24May 4, 2018Updated 8 years ago
omarmhaimdat / whatlang-pyo3
View on GitHub
Python Binding for Rust WhatLang, a language detection library
☆14Jan 5, 2024Updated 2 years ago
iproduct-database / vpm-filter-spark
View on GitHub
Virtual patent marking crawler at iproduct.epfl.ch
☆15Sep 13, 2017Updated 8 years ago
riveSunder / yuca
View on GitHub
Your Universal Cellular Automata
☆14Aug 31, 2025Updated 10 months ago
Ahmad1234567 / provable-data-possession
View on GitHub
Automatically exported from code.google.com/p/provable-data-possession
☆14Oct 19, 2015Updated 10 years ago
wintermute0 / tinybrain
View on GitHub
This repository contains the complete source code that we used to conduct experiments in the paper: Text Window Denoising Autoencoder: Bu…
☆15Jun 12, 2013Updated 13 years ago
HROlive / Secure-and-Private-AI
View on GitHub
This course introduced me to three cutting-edge technologies for privacy-preserving AI: Federated Learning, Differential Privacy, and Enc…
☆11Sep 2, 2019Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
VincentNi0107 / BadVLMDriver
View on GitHub
☆10May 8, 2024Updated 2 years ago
tomazk / Text-Extraction-Evaluation
View on GitHub
Framework for evaluating text extraction algorithms implemented as web services
☆42Jun 30, 2012Updated 14 years ago
carted / handling-variable-length-text-tf
View on GitHub
This repository shows how to efficiently process variable-length sequences in TensorFlow.
☆14Apr 26, 2022Updated 4 years ago
glicerico / SGNN
View on GitHub
Implementation of Self-Governing Neural Networks for speech act classification
☆12Jul 14, 2026Updated last week
sasza2 / react-arrows
View on GitHub
☆20Mar 4, 2023Updated 3 years ago
dsroche / la-por
View on GitHub
Linear algebra-based Proof of Retrievability protocol for ensuring data integrity
☆14Mar 28, 2022Updated 4 years ago
CatalinVoss / anchor-baggage
View on GitHub
Experimentation code for the article "Building Topic Models Based on Anchor Words" based on the paper "Learning Topic Models: Going beyon…
☆15May 13, 2014Updated 12 years ago
soummyaah / KGMedNLI
View on GitHub
A repository containing the code for the paper "Incorporating Domain Knowledge into Medical NLI using Knowledge Graphs" EMNLP 2019
☆13Nov 2, 2019Updated 6 years ago
whodewho / FluxEnder
View on GitHub
Ender of Fast-Flux malicious domains.
☆27Nov 2, 2014Updated 11 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
zhangsn-19 / PAN
View on GitHub
Code and data for PAN and PAN-phys.
☆14Mar 20, 2023Updated 3 years ago
nlplab / nersuite
View on GitHub
☆26Nov 20, 2018Updated 7 years ago
MohamedHmini / iww
View on GitHub
AI based web-wrapper for web-content-extraction
☆102Feb 6, 2023Updated 3 years ago
chand1012 / sql2gpt
View on GitHub
Python tool to turn SQL Database Schemas into ChatGPT Prompts
☆15Jan 28, 2026Updated 5 months ago
Tongzhenguo / shanghai_unicom_tourist_tagging
View on GitHub
init
☆11Sep 30, 2017Updated 8 years ago
kasnerz / d2t_iterative_editing
View on GitHub
Code for the paper Data-to-Text Generation with Iterative Text Editing
☆14Mar 23, 2021Updated 5 years ago
Strifee / co2_predict
View on GitHub
ARIMA model practicing with C02 Emission dataset Forecasting with python
☆10Dec 28, 2021Updated 4 years ago