Source code for several Metanome data profiling algorithms
☆59May 15, 2023Updated 2 years ago
Alternatives and similar repositories for metanome-algorithms
Users that are interested in metanome-algorithms are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository provides the implementation of several well-know INDs discovery algorithms☆14Nov 5, 2019Updated 6 years ago
- Provide functionality to build statistical models to repair dirty tabular data in Spark☆12Apr 21, 2023Updated 2 years ago
- FDX, SIGMOD 2020☆20May 3, 2024Updated last year
- Code for the paper "Rotom: A Meta-Learned Data Augmentation Framework for Entity Matching, Data Cleaning, Text Classification, and Beyond…☆24May 31, 2022Updated 3 years ago
- The BART Project: Benchmarking Algorithms for (data) Repairing and Translation☆42Nov 27, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- A Machine Learning System for Data Enrichment.☆533Jul 20, 2023Updated 2 years ago
- Pollock is a benchmark for data loading on character-delimited files.☆25Apr 9, 2025Updated 11 months ago
- ☆62Jun 5, 2025Updated 9 months ago
- TPCH benchmark adapted to Clickhouse SQL syntax☆10Jul 4, 2022Updated 3 years ago
- Code repository for Mondrian, a project for multiregion template recognition in spreadsheets.☆14May 25, 2022Updated 3 years ago
- Implementation of the G-CORE graph query language on Spark☆15Aug 25, 2021Updated 4 years ago
- ☆13May 26, 2017Updated 8 years ago
- S2RDF (SPARQL on Spark for RDF) is a SPARQL query processor for Hadoop based on Spark SQL. It uses the relational interface of Spark for …☆13Apr 21, 2018Updated 7 years ago
- A systematic Benchmarking on the performance of Spark-SQL for processing Vast RDF datasets☆14Jun 29, 2022Updated 3 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- A Generalized Data Cleaning System☆51Apr 28, 2016Updated 9 years ago
- 本项目对Deepseek-R1-Distill-Qwen-7B进行心理咨询CoT数据的LoRA微调,以进一步提升Deepseek-R1-Distill-Qwen-7B在心理咨询领域的慢思考能力。☆12Mar 11, 2025Updated last year
- Algorithms for approximate nearest neighbor search with window filters☆45Feb 5, 2024Updated 2 years ago
- modeling the behavior of stock markets: create a market simulator, technical indicator, and a strategy that generates orders☆19Feb 15, 2018Updated 8 years ago
- 本项目从零开始构建并优化了一个千万参数级别的大规模预训练语言模型,涵盖预训练、有监督微调(SFT)和R1推理蒸馏三个阶段。项目采用自定义Transformer架构(包括RMSNorm、分组注意力、多Query机制、SwiGLU激活和RoPE位置编码),实现高效的长文本处理和…☆21Mar 10, 2025Updated last year
- ☆11Jul 27, 2023Updated 2 years ago
- Udacity Artificial Intelligence Nanodegree - May 2017☆16Nov 1, 2017Updated 8 years ago
- List of papers on cryptography assisted deep learning privacy computation☆18Dec 29, 2025Updated 3 months ago
- A library to store metadata of relational databases including the schema, statistics, and integrity constraints.☆25Aug 7, 2018Updated 7 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Data-Centric What-If Analysis for Native Machine Learning Pipelines☆16Jun 14, 2023Updated 2 years ago
- Architecture of Streaming Twitter Data into Apache Kafka cluster, performing simple sentiment analysis with afinn module, storing the dat…☆20Jan 3, 2020Updated 6 years ago
- ☆12Jun 1, 2021Updated 4 years ago
- Building blocks and patterns for building data prep transformations and feature engineering in Spark.☆16Mar 16, 2016Updated 10 years ago
- A Benchmark for Joint Data Cleaning and Machine Learning☆50Jun 18, 2024Updated last year
- Code and data related to "Efficient, Compositional, Order-Sensitive n-gram Embeddings" (EACL 2017)☆15Apr 6, 2017Updated 8 years ago
- Code for the paper "Deep Entity Matching with Pre-trained Language Models"☆307Apr 17, 2024Updated last year
- Reinforcement Learning (RL), allows you to develop smart, quick and self-learning systems in your business surroundings. It is an effecti…☆12Sep 25, 2019Updated 6 years ago
- ScalaIO 2014 Workshop☆25Oct 23, 2014Updated 11 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Mastering PyTorch for Deep Learning, Published by Packt☆14Jan 14, 2021Updated 5 years ago
- ☆19Jun 14, 2024Updated last year
- Python package for performing Entity and Text Matching using Deep Learning.☆615Jun 18, 2024Updated last year
- RDF storage and SPARQL processing on top of Apache Spark.☆20Oct 5, 2022Updated 3 years ago
- Code repository for Large Scale Machine Learning with Spark by Packt☆20Oct 31, 2022Updated 3 years ago
- A collection of metric learning papers.☆20Apr 24, 2023Updated 2 years ago
- An android app which converts text/voice input to American Sign Language(ASL)☆14Sep 8, 2016Updated 9 years ago