A set of distinct value estimators that give probabilistic bounds on a sets cardinality
☆22Dec 9, 2019Updated 6 years ago
Alternatives and similar repositories for countmemaybe
Users that are interested in countmemaybe are comparing it to the libraries listed below
Sorting:
- Low-level primitives for collapsed Gibbs sampling in python and C++☆33Mar 27, 2024Updated last year
- Tweets annotated with coarse-grained sense labels (supersenses)☆13Jun 13, 2014Updated 11 years ago
- Simple markup to web-friendly presentations that look great on mobile and on the big screen.☆20May 16, 2019Updated 6 years ago
- Pitman-Yor processes in python☆26Apr 18, 2014Updated 11 years ago
- common data interchange format for document processing pipelines that apply natural language processing tools to large streams of text☆35Sep 30, 2016Updated 9 years ago
- C++ library for modeling with Pitman-Yor processes☆34Nov 28, 2017Updated 8 years ago
- implementations of a counting bloom, a timing bloom and a scaling timing bloom... made for working with streams!☆42Feb 1, 2017Updated 9 years ago
- RiTaJS: A generative language toolkit for JavaScript☆43Dec 20, 2020Updated 5 years ago
- A streaming cross-cat inference engine☆49Dec 19, 2014Updated 11 years ago
- Cloud Mining automatically builds exploratory faceted search systems.☆52Oct 15, 2013Updated 12 years ago
- The goal of this experiment is to take articles and certain metadata and group them by topic.☆11Apr 14, 2016Updated 9 years ago
- Specification to describe the minimum information standard for online community data. Guidelines for describing data about online communi…☆11Sep 19, 2016Updated 9 years ago
- ☆10May 31, 2015Updated 10 years ago
- Hidden alignment conditional random field for classifying string pairs.☆36Sep 6, 2017Updated 8 years ago
- Green SqlAlchemy extensions for pulsar☆11Nov 24, 2017Updated 8 years ago
- Analysis on stop reasons☆10Jun 17, 2024Updated last year
- A Go library for specialized integer hash maps.☆11Sep 15, 2016Updated 9 years ago
- Bicycle Incident reporting☆13Jul 22, 2022Updated 3 years ago
- Links to all the source code and solutions I reference in my O'Reilly Introduction to Docker video tutorial☆11Dec 10, 2014Updated 11 years ago
- Issue tracker for the Open Targets Platform☆13Jul 8, 2025Updated 7 months ago
- CWTS OpenAlex ETL data pipeline.☆16Oct 29, 2025Updated 4 months ago
- [ICME 2019] Source code and datasets for "Semi-supervised Compatibility Learning Across Categories for Clothing Matching"☆10Apr 26, 2024Updated last year
- rddapp: Regression Discontinuity Design Application☆11Sep 2, 2025Updated 6 months ago
- An open-source news aggregator☆15Sep 9, 2016Updated 9 years ago
- ☆10Jun 29, 2021Updated 4 years ago
- ☆12Oct 25, 2015Updated 10 years ago
- Tabula Rasa Tic-Tac-Toe☆10Jan 3, 2019Updated 7 years ago
- Digitization information system build on top of Fedora repository☆16Jan 15, 2019Updated 7 years ago
- MPC Server for PySpark inpired by the LakeSail☆17Feb 26, 2026Updated last week
- Focused Crawler for VT's CTRNet☆10May 13, 2013Updated 12 years ago
- The Zero Effort Network Library for Python☆66Mar 8, 2018Updated 7 years ago
- Visual SPARQL query tool☆10Feb 26, 2016Updated 10 years ago
- Library to extract text from HTML files☆11Dec 20, 2015Updated 10 years ago
- A collection of various discourse segmenters☆10Jun 30, 2017Updated 8 years ago
- Document management system. Based on bill tracking needs. Simple model for stages, priorities, authors, content (abstract, tags), releate…☆19Sep 16, 2014Updated 11 years ago
- Python based data warehouse solution for the Lambda Architecture.☆14Jun 24, 2015Updated 10 years ago
- CROMER (CROss-document Main Events and entities Recognition), is a tool for cross-document coreference☆12Jan 14, 2015Updated 11 years ago
- Micro-framework for publishing linked data☆11Aug 1, 2017Updated 8 years ago
- pymur is a Python interface to The Lemur Toolkit.☆19Sep 17, 2018Updated 7 years ago