☆49Jul 4, 2024Updated last year
Alternatives and similar repositories for pdf_paragraphs_extraction
Users that are interested in pdf_paragraphs_extraction are comparing it to the libraries listed below
Sorting:
- ☆64Apr 9, 2024Updated last year
- Multiple GPT agents to have brainstorms and make decisions.☆20Nov 9, 2023Updated 2 years ago
- ☆40Jun 15, 2024Updated last year
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆19Jul 20, 2023Updated 2 years ago
- Code and Dataset for our paper: Layout-Aware Single-Image Document Flattening☆23Dec 16, 2024Updated last year
- OVALChat is a customizable Web app aimed at conducting user studies with chatbots☆28Jan 9, 2024Updated 2 years ago
- TRACE: Table Reconstruction Aligned to Corner and Edges (ICDAR 2023)☆31Mar 13, 2024Updated last year
- Trained Detectron2 object detection models for document layout analysis based on PubLayNet dataset☆29Apr 16, 2023Updated 2 years ago
- An official implementation of paper "Paragraph2Graph: A Language-independent GNN-based framework for layout analysis"☆81Oct 14, 2023Updated 2 years ago
- A low cost Arduino based model rocket altimeter that logs data & displays them via a Processing Sketch☆14Jun 24, 2018Updated 7 years ago
- Repository of IPBench☆19Jan 4, 2026Updated 2 months ago
- European Parliament website Python scraper☆12Oct 19, 2016Updated 9 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆12Nov 14, 2025Updated 3 months ago
- Etsy Python Support for v3+ API levels.☆10Updated this week
- Overview☆11Mar 26, 2021Updated 4 years ago
- ☆12May 15, 2024Updated last year
- A large scale camera-taken table detection and recognition dataset.☆149Jul 21, 2025Updated 7 months ago
- ☆42Feb 7, 2023Updated 3 years ago
- A simple FastAPI integration to protect documentation endpoints with HTTP Basic Authentication.☆13Aug 17, 2025Updated 6 months ago
- High-Performance Transformers for Table Structure Recognition Need Early Convolutions☆44Apr 3, 2024Updated last year
- 浙江大学NLP资源交流共享计划☆33Mar 12, 2022Updated 3 years ago
- An autonomous service implementing a decentralized Impact Evaluator☆13Dec 1, 2025Updated 3 months ago
- This is a very fast parsing script for downloaded TV shows and movies. It will use scene-standard naming conventions (and a lot of nonsta…☆16Oct 30, 2017Updated 8 years ago
- ☆10Aug 30, 2025Updated 6 months ago
- ☆22Dec 23, 2025Updated 2 months ago
- Unleash the fuzz on your C codebase.☆12Updated this week
- This repository is the implementation of our paper: Local Correntropy Matrix Representation for Hyperspectral Image Classification, which…☆10Apr 21, 2022Updated 3 years ago
- Python x ChatGPT script. Generates random Discord Nitro codes and test their validity by sending requests to the Discord server.☆10Mar 19, 2024Updated last year
- Guides to hopefully simplify the process of using ROCm.☆12Sep 26, 2024Updated last year
- 生成训练文本检测数据集☆12Jul 1, 2020Updated 5 years ago
- Abusing Certificate Transparency logs for getting HTTPS websites subdomains.☆11Mar 2, 2019Updated 7 years ago
- Vin Carnival is a non-profit open source virtual reality game made with Unity 3D game engine and GoogleVR.☆10Oct 3, 2018Updated 7 years ago
- UniTable: Towards a Unified Table Foundation Model☆525Jun 4, 2024Updated last year
- [MM'2024] PEneo, an effective algorithm for key-value pair extraction from form-like documents, designed for real-world applications.☆41Apr 7, 2025Updated 11 months ago
- ICDAR 2024 Table OCR Model☆39Feb 25, 2026Updated last week
- convert waymo dataset to rosbag1☆11Dec 20, 2021Updated 4 years ago
- This is a ROS package to connect Waymo open dataset to ROS☆10Jul 5, 2020Updated 5 years ago
- ☆13May 17, 2025Updated 9 months ago
- Highly concurrent and fast content processing for Mighty Inference Server☆10Feb 6, 2023Updated 3 years ago