Pattern-based table discovery in Open Data CSV files
☆25Dec 8, 2022Updated 3 years ago
Alternatives and similar repositories for Pytheas
Users that are interested in Pytheas are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Rule-based spreadsheet data extraction and transformation☆15Feb 20, 2023Updated 3 years ago
- ☆14May 6, 2018Updated 7 years ago
- Named Entity Recognition☆19Feb 13, 2026Updated 2 months ago
- Named Entity Disambiguation and Linking☆16May 24, 2024Updated last year
- Sketch and LSH Index library for Java, including OPH methods as well as the Lazo method☆15Dec 24, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆27Jan 31, 2019Updated 7 years ago
- Entity linking, entity typing and relation extraction: Matching CSV to a Wikibase instance (e.g., Wikidata) via Meta-lookup☆70Jun 9, 2025Updated 10 months ago
- Repository for "Towards Robust Named Entity Recognition for Historic German"☆18Dec 11, 2020Updated 5 years ago
- A Jupyter notebook extension to centralize and manage data☆15Dec 22, 2022Updated 3 years ago
- Mirror from: https://gitlab.com/ViDA-NYU/auctus/auctus☆44May 12, 2025Updated 11 months ago
- Check your modified Ground Truth files with visual support!☆10Jan 31, 2024Updated 2 years ago
- Adds a reconciliation API endpoint to Datasette, based on the Reconciliation Service API specification.☆24Feb 2, 2024Updated 2 years ago
- GC4LM: A Colossal (Biased) language model for German☆13May 2, 2021Updated 4 years ago
- MediaWiki extension that adds support for local media files to Wikibase via a new data type.☆12Mar 26, 2026Updated 3 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆22Jan 3, 2023Updated 3 years ago
- ☆13Sep 7, 2021Updated 4 years ago
- This repository includes all the code and data for the paper ELiDi (End2end Entity Linking and Disambiguation)☆14Jul 18, 2021Updated 4 years ago
- ☆46Mar 31, 2026Updated 2 weeks ago
- The official repo for the QuickStatements PHP/HTML/JS interface☆52Apr 7, 2026Updated last week
- Python package to reconcile DataFrames☆24Feb 15, 2023Updated 3 years ago
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidata☆96Feb 5, 2026Updated 2 months ago
- PhD thesis: "Knowledge Graph Construction from Heterogeneous Data Sources exploiting Declarative Mapping Rules"☆14Mar 24, 2022Updated 4 years ago
- ☆26May 24, 2018Updated 7 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Example SPARQL queries, mostly for working with ZBW data sets☆16Oct 8, 2025Updated 6 months ago
- Crowdsourced data for open domain relation classification from sentences☆20Oct 26, 2018Updated 7 years ago
- The second version of Chronas in beta stage☆26Apr 9, 2026Updated last week
- A monolithic index that supports worst-case optimal joins (WCOJ) by providing all collation orders in a single redundancy eliminating dat…☆16Sep 18, 2025Updated 7 months ago
- Twitter stream and social network crawling tools☆17Nov 17, 2016Updated 9 years ago
- tesseractXplore a tesseract ease of use gui with full control☆28Nov 10, 2021Updated 4 years ago
- A large-scale place image dataset with multi-faceted annotations. Multi-level place recognition.☆10Jul 15, 2020Updated 5 years ago
- Matching Tabular Data to Knowledge Graphs☆20Apr 27, 2023Updated 2 years ago
- CodeQL and Binary Ninja scripts to accompany the blog post☆11Feb 3, 2023Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Extracting Entities with Limited Evidence☆16Dec 26, 2022Updated 3 years ago
- Identifying Historical People, Places and other Entities: Shared Task on Named Entity Recognition and Linking on Historical Newspapers at…☆21Aug 1, 2024Updated last year
- ☆78Mar 6, 2023Updated 3 years ago
- Code and experiment data for ICDM'19 paper, tabular cell classification using pre-trained cell embeddings. Note that the code and data is…☆29Jul 6, 2023Updated 2 years ago
- Forward messages to collaborators in East-oriented style.☆14Oct 18, 2025Updated 6 months ago
- Highly concurrent and fast content processing for Mighty Inference Server☆10Feb 6, 2023Updated 3 years ago
- Imports Wiktionary's grammatical data into Wikidata☆18Jan 11, 2020Updated 6 years ago