ArcheType uses LLMs to automatically assign custom labels to your tabular data
☆19May 21, 2025Updated 10 months ago
Alternatives and similar repositories for ArcheType
Users that are interested in ArcheType are comparing it to the libraries listed below
Sorting:
- Implementation of SANTOS: Relationship-based Semantic Table Union Search.☆13Nov 21, 2023Updated 2 years ago
- MARVIS (Modality Adaptive Reasoning over VISualizations) is an 'everything predictor' powered by VLMs + embeddings☆14Feb 20, 2026Updated last month
- TuneTables is a tabular classifier that implements prompt tuning for frozen prior-fitted networks.☆23Mar 31, 2025Updated 11 months ago
- Annotating Columns with Pre-trained Language Models☆34Jun 10, 2022Updated 3 years ago
- This repository contains code and data for reproducing the experiments of three papers that focus on two subtasks of table annotation: co…☆12Mar 5, 2025Updated last year
- Resources for PVLDB 2023 submission☆27Aug 28, 2024Updated last year
- Code for the paper "Rotom: A Meta-Learned Data Augmentation Framework for Entity Matching, Data Cleaning, Text Classification, and Beyond…☆24May 31, 2022Updated 3 years ago
- This is a repository for the Geospatial Data Abstraction Library (GDAL) and it's applications, examples and discussions in the world of s…☆10May 28, 2023Updated 2 years ago
- Turns any GeoJSON shape into a list of geohashes☆10Jan 8, 2023Updated 3 years ago
- Code for paper "W-RAG: Weakly Supervised Dense Retrieval in RAG for Open-domain Question Answering"☆15Oct 2, 2025Updated 5 months ago
- ☆14Jul 25, 2021Updated 4 years ago
- This project aims at predicting correlated column pairs in data tables by analyzing column names via large language models.☆11Aug 21, 2023Updated 2 years ago
- Official implementation of Neuronal Time-Invariant Representations (NeuPRINT), NeurIPS 2023☆10Mar 10, 2026Updated last week
- Official code repository for Findings of EMNLP 2022 paper: PseudoReasoner: Leveraging Pseudo Labels for Commonsense Knowledge Base Popula…☆11Oct 18, 2022Updated 3 years ago
- ☆11Jul 8, 2024Updated last year
- ☆12Jul 7, 2024Updated last year
- source code and data☆15Jan 16, 2019Updated 7 years ago
- MINERS ⛏️: The semantic retrieval benchmark for evaluating multilingual language models. (EMNLP 2024 Findings)☆14Oct 3, 2024Updated last year
- ☆10Oct 31, 2019Updated 6 years ago
- The public repository for the Proteomic Data Commons UI and APIs☆17Feb 19, 2026Updated last month
- Fuzzy Categorical Distances☆14Mar 31, 2020Updated 5 years ago
- Behavioral analysis via self-supervised pretraining of transformers☆23Feb 27, 2026Updated 3 weeks ago
- Deploy the marketing analytics application, CRMint☆15Feb 24, 2026Updated 3 weeks ago
- ☆11Apr 12, 2023Updated 2 years ago
- This repository holds the annotated spreadsheet files, comprising the DECO dataset.☆13Mar 21, 2019Updated 7 years ago
- Kleis is a python package to label keyphrases in scientific text.☆15Sep 9, 2020Updated 5 years ago
- Glottolog data as CLDF StructureDataset☆16Mar 2, 2026Updated 2 weeks ago
- Complete example code for an article for mkdev☆12Apr 22, 2022Updated 3 years ago
- ☆15Apr 26, 2025Updated 10 months ago
- Foilboard: Kite/Wind Surf Hydrofoil Board Simulator☆16Dec 25, 2021Updated 4 years ago
- ☆16Nov 2, 2020Updated 5 years ago
- ☆23May 9, 2024Updated last year
- ☆28May 27, 2024Updated last year
- "Head-to-Tail How Knowledgeable are Large Language Models (LLMs)? A.K.A. Will LLMs Replace Knowledge Graphs?" (NAACL 2024)☆17Jul 1, 2024Updated last year
- Framework for agentic coding supporting many popular agent coding tools.☆36Updated this week
- ☆61Aug 17, 2022Updated 3 years ago
- ☆22Oct 3, 2023Updated 2 years ago
- Fast, lean, efficient geohash C library☆28Jul 19, 2021Updated 4 years ago
- The source code of the Sudowoodo paper in ICDE 2023☆18May 24, 2023Updated 2 years ago