Document Filters is an SDK for applications like content indexing, e-discovery, data migration, and feeding data into AI/ML models by extracting data from unstructured sources. It gives the ability to perform deep inspection, data extraction, output manipulation, and conversion for virtually any type of document, in any programming language.
☆25Feb 18, 2026Updated last month
Alternatives and similar repositories for DocumentFilters
Users that are interested in DocumentFilters are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Experimental command suggestion system based on historical usage of commands in certain locations.☆12Feb 18, 2026Updated last month
- ☆13Jan 19, 2026Updated 2 months ago
- micro plugin to open files via fzf☆17Mar 9, 2019Updated 7 years ago
- Custom zsh plugin to create custom plugins☆13May 27, 2021Updated 4 years ago
- A tool for exporting weight data from EufyLife application☆10Nov 16, 2020Updated 5 years ago
- Imports highlights from Shortform.com to Readwise.io.☆17Oct 10, 2021Updated 4 years ago
- Use powerful industry-standard tools to unlock new, actionable insights from your data☆17Oct 25, 2018Updated 7 years ago
- fzf functions described in https://seb.jambor.dev/posts/improving-shell-workflows-with-fzf/☆18Apr 2, 2021Updated 4 years ago
- portable configuration files for unix environment☆17Updated this week
- 💡 ripgrep-powered zsh plugin alias reminder☆24Apr 14, 2019Updated 6 years ago
- Artwork Linux☆17Sep 15, 2020Updated 5 years ago
- some utilities for helping dendron work better with pandoc☆18Jan 10, 2022Updated 4 years ago
- Highly concurrent and fast content processing for Mighty Inference Server☆10Feb 6, 2023Updated 3 years ago
- zfzf is a fzf-based file picker for zsh which allows you to quickly navigate the directory hierarchy☆26Mar 5, 2022Updated 4 years ago
- mReasoner is a unified computational implementation of the model theory of thinking and reasoning☆13Aug 17, 2023Updated 2 years ago
- Custom shell (sh, bash, zsh) plugins☆30Apr 3, 2025Updated 11 months ago
- generate clean readable PDFs from web-articles☆31Jun 9, 2023Updated 2 years ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Aug 12, 2023Updated 2 years ago
- Keep your dotfiles in sync using Git, a plugin for Oh My Zsh☆27Nov 25, 2019Updated 6 years ago
- Obsidian Plugin that adds the the markdown title within your notes to the file explorer☆31Feb 28, 2023Updated 3 years ago
- Export/access your Hypothes.is data: annotations and profile info☆46Jul 15, 2025Updated 8 months ago
- Containerfile for the Vanilla OS Desktop+Nvidia image.☆16Mar 13, 2026Updated last week
- Run greatexpectations.io on ANY SQL Engine using REST API. Supported by FastAPI, Pydantic and SQLAlchemy as best data quality tool☆14Dec 12, 2025Updated 3 months ago
- Runnable examples for Typed Clojure paper☆11Jul 2, 2015Updated 10 years ago
- Via Text Density Simple Web Crawler With Go☆13Mar 19, 2023Updated 3 years ago
- Particle Syntax Website☆16Sep 16, 2024Updated last year
- C4RepSet: Representative Subset from C4 data for Training Pre-trained LMs☆11Jan 13, 2023Updated 3 years ago
- A Julia library for working with Data Package.☆12Aug 10, 2021Updated 4 years ago
- Dataset from Tip of the Tongue Known-Item Retrieval (2021) paper.☆12Nov 4, 2021Updated 4 years ago
- prevent XSS attacks by sanitizing html (this is different then escaping!)☆22Oct 14, 2023Updated 2 years ago
- Code that drives the public web-based tools for the Media Cloud Online News Archive and Directory.☆11Mar 13, 2026Updated last week
- An Obsidian extension that adds extra features for note links, statistics, and randomizers☆35Nov 7, 2022Updated 3 years ago
- Scala implementations of standard algorithms for Multi-Armed Bandits Problem.☆12May 7, 2016Updated 9 years ago
- Encryption which converts English characters to unicode characters that mimicking their appearance☆12Sep 17, 2017Updated 8 years ago
- How to backdoor Diffie-Hellman, lessons learned from the Socat non-prime prime☆11Jun 29, 2021Updated 4 years ago
- Temporal and Causal Reasoning (dataset)☆10Apr 19, 2022Updated 3 years ago
- The inverted index exchange format as defined as part of the Open-Source IR Replicability Challenge (OSIRRC) initiative☆11Aug 6, 2025Updated 7 months ago
- LLM Oracle is a GPT-4 powered tool for predicting future events. It's like a Magic 8 Ball that is able to perform basic research, calcula…☆19May 27, 2023Updated 2 years ago
- Security research organization dedicated to finding low hanging, critical, vulnerabilities.☆15May 12, 2022Updated 3 years ago