☆62Jun 5, 2025Updated 9 months ago
Alternatives and similar repositories for raha
Users that are interested in raha are comparing it to the libraries listed below
Sorting:
- ☆12Jun 1, 2021Updated 4 years ago
- Project overview and links to various resources☆21Nov 6, 2021Updated 4 years ago
- ☆10Oct 31, 2019Updated 6 years ago
- ☆12Jul 6, 2023Updated 2 years ago
- Sketch and LSH Index library for Java, including OPH methods as well as the Lazo method☆15Dec 24, 2023Updated 2 years ago
- Implementation of TANE for experimental purposes☆15Apr 29, 2022Updated 3 years ago
- A Jupyter notebook extension to centralize and manage data☆15Dec 22, 2022Updated 3 years ago
- Code to extract functional dependencies (FDs) and conditional functional dependencies (CFDs) from data☆37Mar 24, 2021Updated 4 years ago
- Picket is a system that safeguards against data corruptions during both training and deployment of machine learning models over tabular d…☆14Nov 24, 2020Updated 5 years ago
- ☆18Dec 3, 2015Updated 10 years ago
- A Generalized Data Cleaning System☆51Apr 28, 2016Updated 9 years ago
- Data-Centric What-If Analysis for Native Machine Learning Pipelines☆16Jun 14, 2023Updated 2 years ago
- Code for the paper "MultiEM: Efficient and Effective Unsupervised Multi-Table Entity Matching". ICDE 2024.☆17Nov 5, 2023Updated 2 years ago
- Source code for Make it Easy: An Effective End-to-End Entity Alignment Framework. SIGIR 2021.☆17Apr 15, 2021Updated 4 years ago
- Code for the paper "Rotom: A Meta-Learned Data Augmentation Framework for Entity Matching, Data Cleaning, Text Classification, and Beyond…☆23May 31, 2022Updated 3 years ago
- Source code for several Metanome data profiling algorithms☆59May 15, 2023Updated 2 years ago
- The design and algorithms used in LeCaR are described in this USENIX HotStorage'18 paper and talk slides: https://www.usenix.org/conferen…☆28Jun 4, 2020Updated 5 years ago
- Seagate-tools stores source codes of tools such as PerfPro, PerfLine, etc. These tools are developed and used by the Engineering team to …☆13May 3, 2024Updated last year
- Welcome to Snowman App – a Data Matching Benchmark Platform.☆38Feb 9, 2023Updated 3 years ago
- The source repository of the Metanome tool☆189Jun 5, 2025Updated 9 months ago
- The Llunatic Mapping and Cleaning Chase Engine☆37Jan 12, 2024Updated 2 years ago
- (ACL 2025 Main) Distilling RAG for SLMs from LLMs to Transfer Knowledge and Mitigate Hallucination via Evidence and Graph-based Distillat…☆33Aug 23, 2025Updated 6 months ago
- Data source of the Energy Transition Model☆18Feb 27, 2026Updated last week
- ☆10Aug 7, 2025Updated 6 months ago
- State-of-the-art neural cardinality estimators for join queries☆80Oct 6, 2020Updated 5 years ago
- A new cardinality estimation scheme for join query estimation☆42Dec 6, 2024Updated last year
- Symbolic Regression from Scratch with Python☆13Dec 6, 2022Updated 3 years ago
- Benchmark AFLOW Data Sets for Machine Learning doi.org/10.1007/s40192-020-00174-4☆11Aug 29, 2020Updated 5 years ago
- ☆11Oct 12, 2013Updated 12 years ago
- Code used to create NDVI change detection maps from Sentinel-2 imagery on the Google Earth Engine platform.☆13Dec 4, 2019Updated 6 years ago
- Designed to help lawyers and legal professionals find precedent fast and prepare for case negotiations by simulating trajectories☆10Oct 16, 2024Updated last year
- ☆10Sep 23, 2020Updated 5 years ago
- Sentiment Analysis of COVID-19 Vaccine Tweets☆12Mar 22, 2021Updated 4 years ago
- Some realistic tabular datasets for testing (CSV)☆21Mar 7, 2018Updated 7 years ago
- A repository to store articles, links, and other resources the club finds helpful☆10Apr 29, 2019Updated 6 years ago
- JVMCI examples for Java Day Tokyo 2017☆10Sep 30, 2019Updated 6 years ago
- VSCode extension for coredumpy☆14Apr 1, 2025Updated 11 months ago
- NASA SEES (2021): CNN Mosquito Detection Research☆12Mar 27, 2022Updated 3 years ago
- Load MovieLens dataset into Neo4j☆10Oct 25, 2018Updated 7 years ago