DocAI helps developers quickly build document, image and text processing pipelines using open source and cloud-based machine learning models for a wide range of applications
☆20Dec 9, 2022Updated 3 years ago
Alternatives and similar repositories for docai
Users that are interested in docai are comparing it to the libraries listed below
Sorting:
- ☆12Updated this week
- API client for fetching and comparing passages from legislation☆14Jan 26, 2025Updated last year
- Trained BERT and Word2Vec legal clause classifiers for SPACY using the Atticus Project's Open Source Contract Label Corpus☆13Jan 2, 2021Updated 5 years ago
- An autonomous LLM-based agent that generates code to extract structured information from web pages and extracts it.☆11Oct 30, 2024Updated last year
- UniLM - Unified Language Model Pre-training / Pre-training for NLP and Beyond☆11Mar 27, 2024Updated last year
- Write beautifully short contract. https://reference.legal/ is a referenceable clause library to standardize contracts once and for all.☆13Jul 12, 2022Updated 3 years ago
- Python and JS tools to generate Printed LaTex formulas and images☆16Oct 26, 2023Updated 2 years ago
- Open Cap eXcel (OCX) - Convert Open Cap Format (OCF) packages into a standardized Excel format.☆38Apr 17, 2024Updated last year
- Download client for legal opinions☆13Jan 26, 2025Updated last year
- Utilities and applications for the FlatGov project by Demand Progress☆16Feb 8, 2023Updated 3 years ago
- A Python module to provide software abstractions to ease accessing hyperknowledge graphs☆11Dec 19, 2024Updated last year
- Google App Scripts that sends a number of emails from the specific number and that tracks the open status of each email☆17Dec 11, 2024Updated last year
- Themed, fully featured PDF viewer for the Atom editor☆12Jan 28, 2026Updated last month
- ☆15Jun 16, 2021Updated 4 years ago
- CTE: Contextualized Table Extraction Dataset☆17Feb 23, 2023Updated 3 years ago
- A simple library for segmenting legal texts☆17Apr 22, 2023Updated 2 years ago
- ☆16Oct 20, 2025Updated 4 months ago
- This repository contains materials for the Open Legal Data Forum at the Legal Hacker 2019 (September 2019 + Brooklyn, NYC)☆17Dec 8, 2022Updated 3 years ago
- ☆18May 14, 2024Updated last year
- Kelvin Legal Data OS - Public Examples☆19Oct 30, 2023Updated 2 years ago
- scraping and querying documents for LLMs☆24Oct 6, 2025Updated 5 months ago
- NLP Web API for Legal Text☆18Dec 23, 2022Updated 3 years ago
- python module to manipulate text, strings and list of strings☆21May 10, 2022Updated 3 years ago
- Black Dashboard PRO - Premium Django Template | Creative-Tim☆23Apr 18, 2025Updated 10 months ago
- Client library for OpenOCR☆31Dec 3, 2014Updated 11 years ago
- Sample repository (to accompany my blog post) for putting machine learning code into production.☆30Oct 14, 2021Updated 4 years ago
- Generates the files needed for a production ready Django deployment in Docker. Custom user model, PostgreSQL database backend, uWSGI Pyth…☆24May 8, 2019Updated 6 years ago
- LibreOffice scripting using python☆22Apr 2, 2021Updated 4 years ago
- Tool for parsing and converting various span encoding schemes.☆23Jan 13, 2024Updated 2 years ago
- React component for Dagre-D3☆21Nov 13, 2018Updated 7 years ago
- Visual, page-by-page comparison of two PDF files☆21Apr 7, 2014Updated 11 years ago
- Word segmentation using Conditional Random Fields (CRF) for Khmer document☆32Oct 4, 2025Updated 5 months ago
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…☆35May 24, 2024Updated last year
- Convert JSON Schemas to simple, human-readable Markdown documentation. Repo archived in favor of fork: sbrunner/jsonschema2md2☆27Jul 12, 2023Updated 2 years ago
- The corresponding code for our paper: "Exploring the Challenges of Open Domain Multi-Document Summarization". Do not hesitate to open an …☆33Jun 24, 2023Updated 2 years ago
- state-of-the-art gaze tracking model☆34Oct 15, 2021Updated 4 years ago
- A low-code microservices platform designed for legal engineers. Given a document, Gremlin will apply a series of Python scripts to it and…☆33May 25, 2022Updated 3 years ago
- Code and model for the AI City Challenge (CVPR 2022) Track 3 Action Detection (Naturalistic Driving Action Recognition)☆28Jul 22, 2023Updated 2 years ago
- The Big List of Protests - An AI-assisted Protest Flyer parser and event aggregator☆11Jan 24, 2026Updated last month