Turn pdf document into simple annotated XML for further processing in a corpus preparation pipeline.
☆13Nov 19, 2019Updated 6 years ago
Alternatives and similar repositories for trickypdf
Users that are interested in trickypdf are comparing it to the libraries listed below
Sorting:
- GermaParl: Corpus of Plenary Protocols of the German Bundestag (TEI Format)☆37Jun 1, 2023Updated 2 years ago
- Knack Toolkit Library☆31Feb 26, 2026Updated last week
- Windows Batch script to install and setup the Splunk Universal Forwarder☆11Feb 24, 2020Updated 6 years ago
- Materials for the 2022 GESIS Training workshop "Tools and Workflows for Reproducible Research in the Quantitative Social Sciences"☆20Nov 19, 2022Updated 3 years ago
- Free cybersecurity training resources☆12Feb 5, 2020Updated 6 years ago
- A plugin to integrate Facebook with MyBB, letting users login and register through Facebook.☆27Aug 7, 2020Updated 5 years ago
- Terraform playbook of a vulnerable Azure deployment☆10Apr 28, 2022Updated 3 years ago
- Github Pages deployment for Ansible Best Practices☆12Jan 12, 2026Updated last month
- Guide for fixing 99-100% of cracking sound issues on Dell XPS 15 9570☆11Nov 1, 2018Updated 7 years ago
- Corpus In A Box: Automated Tools, Tutorials, & Advising☆11Dec 1, 2022Updated 3 years ago
- Create and analyze argument graphs and serialize them via Protobuf☆10Updated this week
- A repository for text_processing tools used by crow☆12Mar 21, 2025Updated 11 months ago
- Some basic CI for Splunk Apps.☆11Jan 8, 2020Updated 6 years ago
- R-package for text mining with the Corpus Workbench (CWB) as backend☆49Mar 26, 2025Updated 11 months ago
- Proof of concept tool used for phishing multi-factor authentication on O365☆14Aug 8, 2018Updated 7 years ago
- A fully programmable, multi-platform, syntax-slick modern language. Let’s finish this strong. 💪☆21Jun 15, 2025Updated 8 months ago
- ULMFiT Method for German Language☆15May 10, 2019Updated 6 years ago
- Eine R/Shiny App um Daten von der Website WG-Gesucht zu scrapen und Statistiken dazu anzuzeigen☆14Jul 4, 2022Updated 3 years ago
- Fan monitor for some Dell laptops☆14Dec 23, 2025Updated 2 months ago
- Measuring Emotion in Parliamentary Debates with Automated Textual Analysis☆18Apr 7, 2022Updated 3 years ago
- GE 2015/17 + EU Ref voter density shiny app☆14Nov 15, 2017Updated 8 years ago
- Example of (micro)services to do conversion from Microsoft Word Docx files to PDF using products on Google Cloud Platform☆20Apr 26, 2019Updated 6 years ago
- A template to write a reproducible paper in R Markdown.☆18Jun 20, 2023Updated 2 years ago
- German parliament (Bundestag and Bundesrat) legislative tracker. Also check the updated crawlers at http://github.com/bundestag☆29Sep 23, 2024Updated last year
- The `hp2xx' program is a versatile tool to convert vector-oriented graphics data given in Hewlett-Packard's HP-GL plotter language into a…☆18Feb 1, 2020Updated 6 years ago
- Pandoc document export plugin for Obsidian (https://obsidian.md)☆19Dec 27, 2022Updated 3 years ago
- Python wrapper for the CWB to extract concordances and score frequency lists☆22Jan 12, 2026Updated last month
- markupy - HTML in Python☆21Jun 2, 2025Updated 9 months ago
- A colorful introduction to some common functions in dplyr, part of the tidyverse.☆18Apr 13, 2022Updated 3 years ago
- Step-by-step guide for vectorizing/parallelizing your code☆20May 11, 2023Updated 2 years ago
- Quick and dirty .net console app for querying mssql servers.☆24Aug 30, 2018Updated 7 years ago
- Word2Vec in pure Python☆18Jun 13, 2018Updated 7 years ago
- A containerized all-in-one solution for CQPWeb☆18Jan 22, 2023Updated 3 years ago
- Repository for CRAN package BatchGetSymbols☆18Feb 2, 2026Updated last month
- Transliterate español (spanish) spelling to andaluz proposals using python☆27Jan 13, 2026Updated last month
- This is a simple svg isometric city animation with GSAP☆21Feb 13, 2018Updated 8 years ago
- Bulk modify Splunk Knowledge Object's owners, permissions, apps, sharing and move them to another app☆26Aug 27, 2022Updated 3 years ago
- A part-of-speech tagger with support for domain adaptation and external resources.☆24Oct 26, 2022Updated 3 years ago
- ☆30May 14, 2025Updated 9 months ago