logv / snorkel
UI for interactive data analysis | https://snorkel.logv.org
☆162Updated 11 months ago
Alternatives and similar repositories for snorkel:
Users that are interested in snorkel are comparing it to the libraries listed below
- Interactive visualization framework for Runway models of distributed systems☆188Updated 3 years ago
- Timberlake is a Job Tracker for Hadoop.☆177Updated 5 years ago
- Metrics Query Engine☆171Updated last year
- The Chronix Server implementation that is based on Apache Solr.☆265Updated 5 years ago
- Automated fault monitoring and leader-election system for strongly-consistent, highly-available writes to PostgreSQL (Joyent SDC, Manta).☆227Updated 2 months ago
- type-checked dictionary templating library for python☆91Updated last year
- ScalienDB is a scalable, replicated datastore.☆86Updated 12 years ago
- C network daemon for HyperLogLogs☆449Updated 4 years ago
- Packer + Terraform scripts to experiment with FDB clusters in the cloud☆24Updated 6 years ago
- A general-purpose data analysis engine radically changing the way batch and stream data is processed☆7Updated 6 years ago
- A key/value store for serving static batch data☆175Updated last year
- HyperBitBit☆133Updated 7 years ago
- Event aggregation and indexing system☆53Updated 6 years ago
- ☆172Updated 10 years ago
- counters and logarithmically bucketed histograms for distributed systems☆84Updated 7 years ago
- Tools for working with parquet, impala, and hive☆134Updated 4 years ago
- Notes from VLDB conference☆30Updated 9 years ago
- HyperMinHash: Bringing intersections to HyperLogLog☆302Updated 6 years ago
- hokusai -- sketching streams in real-time☆78Updated 7 years ago
- Consus is a geo-replicated transactional key-value store.☆226Updated 6 years ago
- Time Adaptive Sketches (Ada-Sketches) for Summarizing Data Streams☆37Updated 7 years ago
- Packer + Terraform setup to experiment with FDB clusters in the cloud.☆26Updated 5 years ago
- A consistent-hashing relay for statsd and carbon metrics☆101Updated 4 years ago
- Query engine for TrailDB☆51Updated 6 years ago
- Ringbuffer-backed interactive data pipeline☆124Updated 9 years ago
- A logger for use with daemontools.☆77Updated last year
- Quickly detect already witnessed data.☆157Updated 7 months ago
- S3 backed key/value database for infrequent read access☆170Updated 7 years ago
- Gremlins is a python framework for fault-testing distributed systems☆122Updated 10 years ago
- A Cascading Workflow Visualizer☆83Updated last year