kootenpv / shrynk
Using Machine Learning to learn how to Compress
☆109Updated last year
Alternatives and similar repositories for shrynk:
Users that are interested in shrynk are comparing it to the libraries listed below
- Bidirectionally transformed strings☆367Updated last year
- 🐍 Python library implementing sorted containers with state-of-the-art query performance and compressed memory usage☆210Updated 9 months ago
- Wolfsort is a stable adaptive hybrid radix / merge sort.☆193Updated 6 months ago
- Set up the CTRL text-generating model on Google Compute Engine with just a few console commands.☆151Updated 5 years ago
- Search for similar short strings☆53Updated 4 years ago
- Blazing fast, composable, Pythonic quantile filters.☆135Updated last year
- Python stream processing for humans☆184Updated this week
- Automated Outlier Detection and Treatment Tool☆101Updated 2 years ago
- Interactive Computing for Humans☆70Updated 3 years ago
- DISCoHAsH - Simple, fast, quality hash in 120 lines. 10GB/s serial (depending on hardware). Also in NodeJS☆219Updated last year
- A Python schema-based machine learning library☆74Updated 6 years ago
- Visualization of filters in convolutional neural networks☆30Updated 10 months ago
- ☆19Updated 4 years ago
- A web app to create and browse text visualizations for automated customer listening.☆148Updated last year
- Deep Semantic Code Search aims to explore a joint embedding space for code and description vectors and then use it for a code search appl…☆65Updated 6 months ago
- Beamsplitter - A new (possibly universal) hash that passes SMHasher. Built mainly with a random 10x64 S-box. Also in NodeJS☆90Updated 3 months ago
- Interactive Model Iteration with Weak Supervision and Pre-Trained Embeddings☆76Updated 2 years ago
- Python bindings for xorfilter(faster and smaller than bloom and cuckoo filters)☆112Updated 3 months ago
- ☆110Updated 3 years ago
- Keyvi - the key value index. It is an in-memory FST-based data structure highly optimized for size and lookup performance.☆241Updated last week
- Command-line tool to remotely execute code in the cloud☆134Updated 2 years ago
- Feature engineering and machine learning: together at last!☆24Updated 4 years ago
- A fast and memory-optimized string library for heavy-text manipulation in Python☆250Updated 4 years ago
- All kinds of survival analysis distributions and methods to optimize how long to wait for them.☆39Updated 3 years ago
- Python bindings for the fast integer compression library FastPFor.☆57Updated last year
- Like a Python list but better.☆181Updated 3 years ago
- Utility function to parallelise pipelines of Python asyncio iterators/generators☆113Updated 4 years ago
- Python bindings for simdjson using libpy☆63Updated 2 years ago
- Arc is an opinionated framework for defining data pipelines which are predictable, repeatable and manageable.☆169Updated 11 months ago
- Succinct, compact, and compressed data structures for data-intensive applications☆60Updated 4 years ago