datacoon / undatum
undatum: a command-line tool for data processing. Brings CSV simplicity to JSON lines and BSON
☆48Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for undatum
- Python library and cmd tool to backup API calls☆14Updated 4 months ago
- Python library to read, write and convert data files with formats BSON, JSON, NDJSON, Parquet, ORC, XLS, XLSX and XML☆16Updated 3 months ago
- Extracts tables from .docx files and saves them as .csv or .xls files☆61Updated last year
- NoSQL extract, transform, load (ETL) toolkit with Python☆12Updated 3 weeks ago
- Russian names parsers, gender identification and processing tools☆130Updated 11 months ago
- Metadata and data identification tool and Python library. Identifies PII, common identifiers, language specific identifiers. Fully custom…☆44Updated 4 months ago
- Lazy helper tool to make easier scraping with simple tasks☆18Updated 2 years ago
- Data catalog for everything in your company☆51Updated last year
- Quick and dirty date parsing Python library to parse HTML dates really fast☆20Updated last year
- Binary Python bindings for poppler utils for content extraction☆42Updated 3 years ago
- Awesome list of Russian government open source projects (not only Github)☆19Updated 3 years ago
- Awesome list of the software tools related to opendata: data catalogs, ingestion tools, data prep tools and so on☆29Updated 2 months ago
- data wrangling simplicity, complete audit transparency, and at speed☆35Updated 2 months ago
- Scripts to make specific datasets cleaner and more convenient☆40Updated last year
- Registry of metadata identifier entities like UUID, GUID, person fullname, address and so on. Linked with other sources☆16Updated 11 months ago
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.☆52Updated 3 weeks ago
- Project on text topics evolution over time analysis☆82Updated 2 years ago
- Script for integration with Logs API☆46Updated 2 years ago
- Named-Entity Recognition extension for OpenRefine☆24Updated last year
- Collecting and analysing open data stuff☆13Updated 3 years ago
- Extract networks of entities from journalistic reporting☆47Updated last year
- Создание реестра всех доменных имён Российской Федерации относящихся к органам власти, государственным учреждениям, а также региональным …☆48Updated 2 years ago
- Describe business metrics with YAML, query and visualize in Jupyter with zero SQL☆21Updated 2 years ago
- International Address formatter which considers the standard formatting rules of the country☆26Updated 3 years ago
- CSV on the web☆37Updated last month
- Задачи для волонтеров/стажеров/всех желающих по работе с открытыми, большими данными. А также всеми иными задачами связанными с темами кр…☆76Updated 5 years ago
- NLP project that works with news (NER, context generation, news trend analytics)☆43Updated 2 years ago
- convtools is a specialized Python library for dynamic, declarative data transformations with automatic code generation☆39Updated last month
- A curated list of awesome open source tools and commercial products to catalog, version, and manage data 🚀☆27Updated 2 years ago
- Opendata resources in Russian / Открытые данные на русском языке☆211Updated 2 years ago