mwildehahn / mysql-dump-to-csvLinks
Script to parse a mysql dump and generate CSVs for all of the tables in the dump
☆21Updated 6 years ago
Alternatives and similar repositories for mysql-dump-to-csv
Users that are interested in mysql-dump-to-csv are comparing it to the libraries listed below
Sorting:
- C library for efficient string matching with Aho-Corasick☆21Updated 13 years ago
- Nginx upstream module for Sphinx 2.x☆42Updated 11 years ago
- Json Wikipedia, contains code to convert the Wikipedia xml dump into a json dump. Questions? https://gitter.im/idio-opensource/Lobby☆17Updated 3 years ago
- iCQA - Intelligent Community Question Answering Framework☆31Updated 8 years ago
- Import GeoNames.org data into a SQLite database for full-text search and autocomplete☆35Updated 6 years ago
- Mirror of https://gerrit.wikimedia.org/g/mediawiki/php/luasandbox See https://www.mediawiki.org/wiki/Developer_access for contributing☆19Updated last year
- Pre-built Scrapy spiders for AutoExtract☆19Updated last year
- A python implementation of DEPTA☆83Updated 8 years ago
- Mad (╯°□°)╯'ing☆10Updated 2 years ago
- Paginating the web☆37Updated 11 years ago
- Command-line tool to extract a ranked list of relevant keywords from a corpus with the option of using either topic modeling or tf-idf sc…☆40Updated 8 years ago
- A high performance indexing and search system for managing big data☆17Updated 6 years ago
- Wikipedia citation tool for Google Books, New York Times, ISBN, DOI and more☆22Updated 8 years ago
- Similarity hashing☆49Updated 13 years ago
- The complete Buddycloud stack in a VM☆23Updated 9 years ago
- code and data used to build a training dataset for dragnet models☆10Updated 4 years ago
- ☆12Updated 8 years ago
- Open Source Implementation of Simhash in Python☆24Updated 7 years ago
- google all pairs similarity search package, with swig bindings☆22Updated 10 years ago
- Content Extraction via Text Density (SIGIR11)☆25Updated 9 years ago
- Finds the Jaro Winkler Distance indicating a distance or similarity score between two strings.☆26Updated 3 months ago
- Extended tsvector type for PostgreSQL☆23Updated 4 years ago
- code to remove "noise" from hOCR output of Tesseract OCR.☆14Updated 8 years ago
- An attempt at creating a silver/gold standard dataset for backtesting yesterday & today's content-extractors☆35Updated 10 years ago
- A flexible implementation of enhanced suffix arrays in template based C++. Supports single and multi-position wildcard. Fast queries than…☆20Updated 4 years ago
- A fast python implementation of the SimHash algorithm.☆27Updated 3 years ago
- Wrapper for pdftohtml that tries to extract paragraph structure☆50Updated 6 years ago
- pure luajit bloom filter implementation☆10Updated 9 years ago
- Language Detection based on Chromium's Compact Language Detector library☆106Updated 4 years ago
- Tools to analyze web archives☆20Updated 8 years ago