Pattern-based table discovery in Open Data CSV files
☆25Dec 8, 2022Updated 3 years ago
Alternatives and similar repositories for Pytheas
Users that are interested in Pytheas are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Rule-based spreadsheet data extraction and transformation☆15Feb 20, 2023Updated 3 years ago
- ☆14May 6, 2018Updated 8 years ago
- Named Entity Disambiguation and Linking☆16May 24, 2024Updated 2 years ago
- ☆27Jan 31, 2019Updated 7 years ago
- Entity linking, entity typing and relation extraction: Matching CSV to a Wikibase instance (e.g., Wikidata) via Meta-lookup☆71Jun 9, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Implementation of algorithms for semantic table implementation, including the TableMiner+ method☆19Sep 1, 2022Updated 3 years ago
- Repository for "Towards Robust Named Entity Recognition for Historic German"☆18Dec 11, 2020Updated 5 years ago
- A Jupyter notebook extension to centralize and manage data☆15Dec 22, 2022Updated 3 years ago
- Mirror from: https://gitlab.com/ViDA-NYU/auctus/auctus☆44May 12, 2025Updated last year
- Check your modified Ground Truth files with visual support!☆10Jan 31, 2024Updated 2 years ago
- Cloud and Kubernetes configuration for deployment for wbstack.com. You'll want to look at the wikibase.cloud deploy repository soon!☆12Feb 9, 2024Updated 2 years ago
- T2K Match is a matching algorithm optimised to match millions of web tables to a central knowledge base.☆21May 5, 2018Updated 8 years ago
- SPRINT: Script-agnostic Structure Recognition in Tables☆16Mar 26, 2025Updated last year
- This repository is for ExcelTableCNN project - open source automatic table detection on Excel sheets with computer vision☆15Jan 31, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- NewsEye / READ OCR training dataset from Austrian Newspapers (1864–1911)☆18Oct 31, 2025Updated 7 months ago
- MediaWiki extension that adds support for local media files to Wikibase via a new data type.☆12Jun 11, 2026Updated last week
- generate shape expressions from CSV☆11Jun 2, 2026Updated 2 weeks ago
- ☆22Jan 3, 2023Updated 3 years ago
- ☆13Sep 7, 2021Updated 4 years ago
- This repository includes all the code and data for the paper ELiDi (End2end Entity Linking and Disambiguation)☆14Jul 18, 2021Updated 4 years ago
- Code repository for Mondrian, a project for multiregion template recognition in spreadsheets.☆14May 25, 2022Updated 4 years ago
- This repository contains the Wikibase configuration of the EU Knowledge Graph☆14May 4, 2026Updated last month
- Adult IPTV offers high-quality streaming of explicit content, including live channels and on-demand videos, tailored for adult entertainm…☆17Aug 26, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A documentation for FAIR GPT, a virtual RDM consultant☆16Oct 10, 2024Updated last year
- Collection of headless JS components for SurfaceUI☆13Sep 24, 2021Updated 4 years ago
- Wikibase extension that allows defining RDF mappings for Wikibase Entities☆16Jun 1, 2026Updated 2 weeks ago
- ☆47Updated this week
- The official repo for the QuickStatements PHP/HTML/JS interface☆53Apr 7, 2026Updated 2 months ago
- Python package to reconcile DataFrames☆24Feb 15, 2023Updated 3 years ago
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidata☆96Feb 5, 2026Updated 4 months ago
- Table extraction library☆31Mar 9, 2025Updated last year
- ☆27May 24, 2018Updated 8 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Example SPARQL queries, mostly for working with ZBW data sets☆16Oct 8, 2025Updated 8 months ago
- Crowdsourced data for open domain relation classification from sentences☆20Oct 26, 2018Updated 7 years ago
- Twitter stream and social network crawling tools☆17Nov 17, 2016Updated 9 years ago
- tesseractXplore a tesseract ease of use gui with full control☆28Nov 10, 2021Updated 4 years ago
- 2.6.35 Kernel for Samsung Galaxy S series Phones - "old" base repo, used as snapshot.☆15Oct 19, 2011Updated 14 years ago
- CodeQL and Binary Ninja scripts to accompany the blog post☆11Feb 3, 2023Updated 3 years ago
- The code base for paper: "ReAcTable: Enhancing ReAct for Table Question Answering"☆37Apr 28, 2024Updated 2 years ago