Command line tool to convert spreadsheets to databases, made for the UK's Office for National Statistics.
☆81Dec 14, 2023Updated 2 years ago
Alternatives and similar repositories for databaker
Users that are interested in databaker are comparing it to the libraries listed below
Sorting:
- Navigating around a grid of cells like XPath for spreadsheets; supports Python 3.5+☆48Feb 1, 2023Updated 3 years ago
- An easier way to tidying pivoted tables.☆29Jun 8, 2020Updated 5 years ago
- Tools for parsing messy tabular data. This is now superseded by https://github.com/frictionlessdata/tabulator-py☆393May 22, 2023Updated 2 years ago
- A DSL to build Lucene text queries in Python.☆38Jan 5, 2017Updated 9 years ago
- Focused Crawler for VT's CTRNet☆10May 13, 2013Updated 12 years ago
- 🦆 SQL for R dataframes, with ducks☆44Jan 11, 2024Updated 2 years ago
- Scraper built with Scrapy.☆18Aug 14, 2024Updated last year
- Spellchecker service based on hunspell for 90 languages☆10Oct 26, 2020Updated 5 years ago
- Track the keyword positions☆19Oct 26, 2013Updated 12 years ago
- Power BI Custom Connector for loading tables directly from Tabular Data Packages (Frictionless Data) into Power BI☆10Jun 16, 2020Updated 5 years ago
- Grapheme to phoneme converter for Estonian☆14May 27, 2021Updated 4 years ago
- An online sentiment analyzer built with Flask and TextBlob☆15Sep 3, 2013Updated 12 years ago
- A lightweight Python script that fetches data from a Google spreadsheet, transforms to JSON, then optionally commits a data file to a Git…☆10Feb 23, 2023Updated 3 years ago
- Compute the most likely permutation of a lattice given an LM☆10Jan 3, 2013Updated 13 years ago
- OpenBlock is a web application and RESTful service that allows users to browse and search their local area for "hyper-local news☆61Jun 10, 2021Updated 4 years ago
- A platform for collecting, analyzing, and visualizing social media data.☆13Dec 27, 2020Updated 5 years ago
- A Text Comprehension Engine in Python☆15Aug 23, 2015Updated 10 years ago
- Formatting extension for Quarto☆13Sep 22, 2023Updated 2 years ago
- API Server for the official Turkish-to-turkish dictionary by TDK☆12Jan 16, 2016Updated 10 years ago
- Compare coverage across different media sources using the Juicer☆12Apr 1, 2016Updated 9 years ago
- CKAN extension for the IATI Registry☆10Oct 23, 2025Updated 4 months ago
- Manage and load dataprotocols.org Data Packages☆27Sep 17, 2015Updated 10 years ago
- Language checker and hyphenator extension for LibreOffice☆12Jan 27, 2020Updated 6 years ago
- UNIX top-like app for nginx (or Apache, if you wish) access logs.☆14Sep 3, 2020Updated 5 years ago
- SiteMonitor tool for monitoring web scraping☆11Apr 23, 2019Updated 6 years ago
- The API and Data Collection tasks that power NewsLynx☆12Dec 15, 2015Updated 10 years ago
- Datasette plugin providing instructions for exporting data to Jupyter or Observable☆13Sep 15, 2023Updated 2 years ago
- A map of NYC rat sightings.☆35Mar 23, 2019Updated 6 years ago
- Events and Situations Ontology☆14Apr 20, 2018Updated 7 years ago
- Convert any image into a Region Adjacency Graph (RAG)☆12Apr 27, 2020Updated 5 years ago
- [DEPRECATED] Please use https://github.com/frictionlessdata/specs☆17Nov 13, 2017Updated 8 years ago
- Technical question answering NLP bot☆13Sep 16, 2009Updated 16 years ago
- An SMS survey platform built by CfA Team Philadelphia☆65Dec 10, 2015Updated 10 years ago
- Measure is scripts and conventions to build KPI dashboards for projects.☆16Jul 14, 2020Updated 5 years ago
- Dexter document monitor for MMA☆16May 8, 2024Updated last year
- A ROS1/ROS2 compatible, RDFlib-backed knowledge base for robotic application. Mostly KB-API conformant.☆16Sep 12, 2025Updated 5 months ago
- Learning Social Media Analytics with R, published by Packt☆17Jan 30, 2023Updated 3 years ago
- Links parts of input text to Wikipedia articles☆16Sep 9, 2012Updated 13 years ago
- Advanced Spatial Analysis of Urban Systems at Northeastern University☆20Apr 16, 2021Updated 4 years ago