☆13Feb 25, 2022Updated 4 years ago
Alternatives and similar repositories for DADER
Users that are interested in DADER are comparing it to the libraries listed below
Sorting:
- Code for the paper "Rotom: A Meta-Learned Data Augmentation Framework for Entity Matching, Data Cleaning, Text Classification, and Beyond…☆23May 31, 2022Updated 3 years ago
- Resources for PVLDB 2023 submission☆24Aug 28, 2024Updated last year
- ☆18Jun 17, 2024Updated last year
- ☆32Apr 15, 2023Updated 2 years ago
- The approach involves the usage of Multi-Criteria Decision Analyses, including Weighted Sum Model (WSM), Weighted Product Model (WPM) and…☆11Oct 22, 2021Updated 4 years ago
- Repository for performing Blocking using Deep Learning based on the paper "Deep Learning for Blocking in Entity Matching: A Design Space …☆32Apr 5, 2023Updated 2 years ago
- Code to extract functional dependencies (FDs) and conditional functional dependencies (CFDs) from data☆37Mar 24, 2021Updated 4 years ago
- Redefining Video Management with power of SQL☆11Oct 15, 2023Updated 2 years ago
- Code for the paper "CollaborEM: A Self-supervised Entity Matching Framework Using Multi-features Collaboration". TKDE 2021.☆41Jul 12, 2022Updated 3 years ago
- this is my repository for Amazon review helpfulness prediction model☆11Sep 14, 2017Updated 8 years ago
- Repo - Paper "Capturing Semantics for Imputation with Pre-trained Language Models." [ICDE 2021]☆10Mar 13, 2022Updated 3 years ago
- [NeurIPS 2022] disentanglement evaluation robust to model dimension variance.☆10Sep 21, 2022Updated 3 years ago
- [Machine Learning 2023] NaCL: Noise-Robust Cross-Domain Contrastive Learning for Unsupervised Domain Adaptation☆12Jul 8, 2023Updated 2 years ago
- [VLDB'22] Cardinality Estimation of Approximate Substring Queries using Deep Learning.☆10Jul 18, 2025Updated 7 months ago
- Enabling Live Migration for Computational Notebooks.☆14Mar 11, 2024Updated last year
- 本项目对Deepseek-R1-Distill-Qwen-7B进行心理咨询CoT数据的LoRA微调,以进一步提升Deepseek-R1-Distill-Qwen-7B在心理咨询领域的慢思考能力。☆12Mar 11, 2025Updated 11 months ago
- a distributed computation platform for running Python and Bash computation tasks on multiple nodes☆12Mar 19, 2025Updated 11 months ago
- VSCode extension for coredumpy☆14Apr 1, 2025Updated 11 months ago
- Easy plot Italian maps with R and ggplot package☆10May 23, 2016Updated 9 years ago
- AirIndex: Versatile Index Tuning Through Data and Storage☆10Dec 18, 2024Updated last year
- StackStorm packs to automate sequencing center operations☆10Dec 6, 2022Updated 3 years ago
- Generic parse tree, configurable lexer, `lemon` parser generator, wrapped for C++17 and Python 3.☆15Apr 26, 2021Updated 4 years ago
- Make your own music just by waving your arms in front of a webcam.☆12Dec 8, 2020Updated 5 years ago
- The code of our AAAI'20 paper "GraphER: Token-Centric Entity Resolution with Graph Convolutional Neural Networks"☆11Aug 10, 2020Updated 5 years ago
- ☆12Apr 30, 2024Updated last year
- 本项目从零开始构建并优化了一个千万参数级别的大规模预训练语言模型,涵盖预训练、有监督微调(SFT)和R1推理蒸馏三个阶段。项目采用自定义Transformer架构(包括RMSNorm、分组注意力、多Query机制、SwiGLU激活和RoPE位置编码),实现高效的长文本处理和…☆21Mar 10, 2025Updated 11 months ago
- Master thesis - reproducing state-of-the-art schema matching algorithms☆14Jul 6, 2023Updated 2 years ago
- Compute Shapley-Shorrocks value decompositions☆14Jan 7, 2024Updated 2 years ago
- MCAN☆12Oct 11, 2025Updated 4 months ago
- Scalable data valuation using optimal transport (ICLR 2025)☆13Jul 15, 2025Updated 7 months ago
- Code and data for the VLDB 2023 paper: RECA: Related Tables Enhanced Column Semantic Type Annotation Framework☆12May 7, 2025Updated 9 months ago
- This repo holds the code, dataset, and running scripts for fast k-means evaluation☆15May 20, 2022Updated 3 years ago
- Continuous Benchmark of Filtering methods for Entity Resolution☆11Jul 20, 2025Updated 7 months ago
- [WACV 2024 Oral] Rethinking Visibility in Human Pose Estimation: Occluded Pose Reasoning via Transformers☆14Jul 6, 2024Updated last year
- A High-Throughput Multi-GPU System for Graph-Based Approximate Nearest Neighbor Search☆21Jul 22, 2025Updated 7 months ago
- ☆11May 11, 2022Updated 3 years ago
- 不到100行代码实现一个Python迷你内网穿透、反向正向代理小工具☆12May 27, 2023Updated 2 years ago
- Towards Efficient Shapley Value Estimation via Cross-contribution Maximization☆14Jul 8, 2022Updated 3 years ago
- A comprehensive open-source cache trace dataset☆22Aug 23, 2025Updated 6 months ago