ahirner / TabulaRazr-OSLinks
Extract tabular data and semantically discover it with ease! (OS)
☆21Updated 9 years ago
Alternatives and similar repositories for TabulaRazr-OS
Users that are interested in TabulaRazr-OS are comparing it to the libraries listed below
Sorting:
- Evaluating the performance and accuracy of ABBYY FineReader's OCR on Senate Financial Disclosure scanned forms☆132Updated 9 years ago
- See https://github.com/tworavens/tworavens for current repository for this project and http://2ra.vn for project pages.☆30Updated 6 years ago
- Tooling to extract data from scanned paper forms OCR-ed by Tesseract using the HOCR standard.☆84Updated 9 years ago
- The ultimate twitter streaming data collector☆40Updated 8 years ago
- Supervised learning for novelty detection in text☆78Updated 8 years ago
- Analyze the structure and dynamics of an open source project's developer community, using graph algorithms, etc.☆58Updated 4 years ago
- Entity Extraction Text Processor☆147Updated last year
- Tribe extracts a network from an email mbox and writes it to a graphml file for visualization and analysis.☆79Updated 2 years ago
- Material for some talks I have given☆62Updated 10 months ago
- Collecting thoughts about data versioning☆108Updated 6 years ago
- Tools for parsing messy tabular data. This is now superseded by https://github.com/frictionlessdata/tabulator-py☆390Updated 2 years ago
- A Python implementation of a political forecasting model by Scholz, Calbert & Smith.☆11Updated 9 years ago
- just put my data in a database!☆39Updated 9 years ago
- ☆92Updated 9 years ago
- Navigating around a grid of cells like XPath for spreadsheets; supports Python 3.5+☆48Updated 2 years ago
- Agent-based modelling for resource allocation in viral crises to investigate resource allocation and policy interventions with respect to…☆63Updated 5 years ago
- Parser and standardizer for politician, individual and organization names.☆129Updated 8 years ago
- ☆24Updated 10 years ago
- Analyze topics and trends in news with NLP☆49Updated 2 years ago
- We introduce TACIT: An Open-Source Text Analysis, Crawling and Interpretation Tool. TACIT's plugin architecture has three main components…☆107Updated 6 years ago
- Linking Entities in CommonCrawl Dataset onto Wikipedia Concepts☆59Updated 12 years ago
- A collection of tools for mining government data☆140Updated 9 years ago
- Using ML to extract campaign finance data from messy forms for journalism☆76Updated 3 years ago
- A repository for the "Combining DBpedia and Topic Modeling" GSoC 2016 idea☆13Updated 8 years ago
- A highly configurable Google Cloud Dataflow pipeline that writes data into Google Big Query table from Pub/Sub☆67Updated 7 years ago
- Clicks-Attention-Satisfaction Evaluation Model and Metric☆77Updated 8 years ago
- Code + Jupyter Notebooks for Visualizing Clusters of Clickbait Headlines Using Spark, Word2vec, and Plotly☆47Updated 4 years ago
- Language Lego☆141Updated 5 years ago
- REST API for Text Summarization and Keywords Extraction☆16Updated 2 years ago
- An automated ingestion service for blogs to construct a corpus for NLP research.☆86Updated 7 years ago