datacoon / undatum
undatum: a command-line tool for data processing. Brings CSV simplicity to JSON lines and BSON
☆47Updated 6 months ago
Alternatives and similar repositories for undatum:
Users that are interested in undatum are comparing it to the libraries listed below
- Python library and cmd tool to backup API calls☆15Updated 8 months ago
- Extracts tables from .docx files and saves them as .csv or .xls files☆61Updated last year
- Registry of data portals, catalogs, data repositories including data catalogs dataset and catalog description standard☆41Updated 2 months ago
- NoSQL extract, transform, load (ETL) toolkit with Python☆12Updated 4 months ago
- Python library to read, write and convert data files with formats BSON, JSON, NDJSON, Parquet, ORC, XLS, XLSX and XML☆16Updated 7 months ago
- Metadata and data identification tool and Python library. Identifies PII, common identifiers, language specific identifiers. Fully custom…☆44Updated 8 months ago
- Lazy helper tool to make easier scraping with simple tasks☆18Updated 2 years ago
- Registry of metadata identifier entities like UUID, GUID, person fullname, address and so on. Linked with other sources☆17Updated last year
- Awesome list of Russian government open source projects (not only Github)☆20Updated 3 years ago
- Binary Python bindings for poppler utils for content extraction☆42Updated 3 years ago
- Data catalog for everything in your company☆50Updated last year
- data wrangling simplicity, complete audit transparency, and at speed☆34Updated last week
- Создание реестра всех доменных имён Российской Федерации относящихся к органам власти, государственным учреждениям, а также региональным …☆49Updated 2 years ago
- Extract networks of entities from journalistic reporting☆48Updated last year
- Универсальный парсер деклараций в формат для передачи в Декларатор.☆18Updated 4 months ago
- https://habr.com/ru/post/279833/☆38Updated 8 years ago
- ☆16Updated 6 months ago
- Сity guide generated from Instagram photos☆12Updated 5 years ago
- Парсер статистики ДТП с stat.gibdd.ru☆84Updated last year
- Scripts to make specific datasets cleaner and more convenient☆41Updated 2 years ago
- Задачи для волонтеров/стажеров/всех желающих по работе с открытыми, большими данными. А также всеми иными задачами связанными с темами кр…☆76Updated 5 years ago
- A search engine for Open Data☆53Updated 2 years ago
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.☆57Updated 3 months ago
- A curated list of awesome open source tools and commercial products to catalog, version, and manage data 🚀☆32Updated 2 years ago
- Run Datasette on AWS serverless.☆18Updated 4 years ago
- Data validation as a service. Project retired, got to the current one at frictionsless/repository☆69Updated 2 years ago
- A markdown wiki and dashboarding system for Datasette☆21Updated 3 years ago
- VisiData interface for databases☆66Updated last year
- Awesome list of the software tools related to opendata: data catalogs, ingestion tools, data prep tools and so on☆33Updated 7 months ago
- Script for integration with Logs API☆46Updated 2 years ago