Performs unique entity estimation corresponding to Chen, Shrivastava, Steorts (2018).
☆15Feb 21, 2019Updated 7 years ago
Alternatives and similar repositories for fasthash
Users that are interested in fasthash are comparing it to the libraries listed below
Sorting:
- A maximum-strength name parser for record linkage.☆39Sep 3, 2025Updated 6 months ago
- Extract statistics from Wikipedia Dump files.☆26Aug 2, 2021Updated 4 years ago
- Phonetic Spelling Algorithms in R☆32May 12, 2024Updated last year
- Scrapy extension that gives you all the scraping monitoring, alerting, scheduling, and data validation you will need straight out of the…☆38Dec 22, 2025Updated 2 months ago
- Implement the rquery piped query algebra in R using data.table. Distributed under choice of GPL-2 or GPL-3 license.☆38Aug 20, 2023Updated 2 years ago
- ☆16Oct 22, 2025Updated 4 months ago
- Automatize downloading of meteorological/hydrological dataset from IMGW-PIB☆12Aug 11, 2020Updated 5 years ago
- SQLiteFlow Support☆13Oct 31, 2022Updated 3 years ago
- A package for Bilateral and Multilateral Price Index Calculations☆11Feb 18, 2026Updated 2 weeks ago
- Exploratory Data Analysis of Time Series Data and Forecasting using Naïve Approach, Moving Average Method, Simple Exponential Smoothenin…☆12Jul 2, 2018Updated 7 years ago
- Wstęp do programowania używając R☆10Mar 14, 2024Updated last year
- SQL Server T-SQL scripts to create the data warehouse dimensions or business intelligence tables and load the tables with data.☆11May 24, 2023Updated 2 years ago
- Datamallet is a python library which contains several helper functions and module for the common tasks in a typical data science workflow…☆11May 19, 2022Updated 3 years ago
- Code to help generate SQL for stakeholders. Code at https://www.startdataengineering.com/post/data-democratize-llm/☆13May 24, 2024Updated last year
- Deploy a Ceramic daemon to AWS☆13Apr 18, 2023Updated 2 years ago
- Use an Adafruit PyPortal to show what’s currently playing on WBGO, or other NPR stations.☆11Feb 19, 2021Updated 5 years ago
- Simple CORPORA list crawler☆10Dec 2, 2016Updated 9 years ago
- Python bindings for the Unitex/GramLab corpus processor☆10Nov 25, 2022Updated 3 years ago
- Hackable, local, ai-enabled notes app☆17Feb 15, 2026Updated 2 weeks ago
- Contains source code and data used in ISM-4402 (Business Intelligence)☆12Sep 4, 2019Updated 6 years ago
- ☆10Jul 2, 2019Updated 6 years ago
- This work is to modularize the Trusted Authentication work for Tableau Server, save your time for ticket exchanging!☆10May 2, 2024Updated last year
- Drawbridge is a lightweight API gateway written in Go☆12Apr 18, 2020Updated 5 years ago
- ☆12Dec 26, 2023Updated 2 years ago
- Spatial Seemingly Unrelated Regressions☆11Apr 22, 2022Updated 3 years ago
- Some useful websites for programmers.☆12Dec 28, 2018Updated 7 years ago
- Chatbot for voice enable conversations☆10May 23, 2025Updated 9 months ago
- A JavaScript library for writing and testing brigade.js files for Brigade v1☆13Jun 1, 2022Updated 3 years ago
- A Pirate Weather workflow for Alfred☆12Mar 29, 2023Updated 2 years ago
- ☆43Apr 20, 2021Updated 4 years ago
- Datakit plugin to help manage Github integration on data projects.☆12Dec 6, 2022Updated 3 years ago
- Docs, notes and resources that don't fit elsewhere.☆13May 23, 2023Updated 2 years ago
- Example repository of using GitHub Actions to post to Bluesky from R☆10Sep 22, 2023Updated 2 years ago
- Set of plugins helping to work with imaging data in Airflow.☆15Jul 10, 2024Updated last year
- My remote docker workstation☆10Jul 10, 2019Updated 6 years ago
- ☆11Aug 13, 2018Updated 7 years ago
- A Python 3 compatible fork of https://launchpad.net/pymeta☆18Jan 9, 2019Updated 7 years ago
- Homebrew Tap for argo☆18Aug 28, 2025Updated 6 months ago
- Deploying a simple FastAPI app to Fly.io >> https://fly-fastapi.fly.dev/docs <<☆14Oct 2, 2023Updated 2 years ago