datacoon / undatumLinks
undatum: a command-line tool for data processing. Brings CSV simplicity to NDJSON, BSON, XML and other data files
☆50Updated 2 weeks ago
Alternatives and similar repositories for undatum
Users that are interested in undatum are comparing it to the libraries listed below
Sorting:
- Extracts tables from .docx files and saves them as .csv or .xls files☆65Updated 2 years ago
- Python library and cmd tool to backup API calls☆18Updated 2 months ago
- Python library to read, write and convert data files with formats BSON, JSON, NDJSON, Parquet, ORC, XLS, XLSX, XML and many others☆27Updated last week
- Metadata and data identification tool and Python library. Identifies PII, common identifiers, language specific identifiers. Fully custom…☆45Updated last month
- NoSQL extract, transform, load (ETL) toolkit with Python☆15Updated 2 weeks ago
- Registry of data portals, catalogs, data repositories including data catalogs dataset and catalog description standard☆48Updated 3 weeks ago
- Registry of metadata identifier entities like UUID, GUID, person fullname, address and so on. Linked with other sources☆18Updated last month
- Awesome list of Russian government open source projects (not only Github)☆20Updated 4 years ago
- Awesome list of the software tools related to opendata: data catalogs, ingestion tools, data prep tools and so on☆35Updated 3 months ago
- Quick and dirty date parsing Python library to parse HTML dates really fast☆21Updated last month
- NLP project that works with news (NER, context generation, news trend analytics)☆42Updated 3 years ago
- data wrangling simplicity, complete audit transparency, and at speed☆35Updated 4 months ago
- Data catalog for everything in your company☆50Updated 2 years ago
- MkDocs plugin to generate semantic reference Markdown pages from a knowledge graph☆40Updated last year
- easypy makes python even easier!☆17Updated 6 months ago
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.☆66Updated last week
- Binary Python bindings for poppler utils for content extraction☆42Updated 4 years ago
- python functions for applied use of schema.org☆38Updated 4 years ago
- https://habr.com/ru/post/279833/☆38Updated 9 years ago
- Opendata resources in Russian / Открытые данные на русском языке☆221Updated 4 years ago
- Arrange the pieces of the world!☆62Updated 2 years ago
- Comprehensive markdown-based documentation toolkit☆175Updated 2 months ago
- Build a better understanding of your data in PostgreSQL.☆27Updated 4 years ago
- A curated list of awesome resources that feature usages of Semantic Web technologies in business cases (applications)☆21Updated 6 years ago
- Lazy helper tool to make easier scraping with simple tasks☆19Updated 3 years ago
- Задачи для волонтеров/стажеров/всех желающих по работе с открытыми, большими данными. А также всеми иными задачами связанными с темами кр…☆77Updated 6 years ago
- The framework for building app backends and microservices by specification-first API design approach based on the OpenAPI Specification 3☆25Updated 5 years ago
- Создание реестра всех доменных имён Российской Федерации относящихся к органам власти, государственным учреждениям, а также региональным …☆55Updated 3 years ago
- Tools for generating CSV and other flat versions of the structured data☆109Updated last month
- Code for my smart growbox experiment☆32Updated 4 years ago