This experimental tool leverages Google's Gemini 2.5 Flash Preview model to parse complex tables from PDF documents and convert them into clean HTML that preserves the exact layout, structure, and data.
☆14May 16, 2025Updated 9 months ago
Alternatives and similar repositories for google-gemini-pdf-table-extractor
Users that are interested in google-gemini-pdf-table-extractor are comparing it to the libraries listed below
Sorting:
- economic information you should know☆29Feb 23, 2016Updated 10 years ago
- Deduplicates property owners in Massachusetts using the MassGIS standardized assessors' parcel dataset and the OpenCorporates Bulk Data p…☆13Jan 26, 2026Updated last month
- Fundamental Accounting Concept Relations validation for International Financial Reporting Standards (IFRS).☆14Sep 20, 2018Updated 7 years ago
- Source Code for 'Implementing Machine Learning for Finance' by Tshepo Chris Nokeri☆34May 28, 2021Updated 4 years ago
- Python interface to the FDIC's API for publically available bank data☆12Apr 15, 2023Updated 2 years ago
- Search Volume for amazon completeion service☆13Feb 5, 2019Updated 7 years ago
- ☆12Mar 1, 2024Updated last year
- ☆19Feb 21, 2026Updated last week
- A simple Geocoding API Service built with FastAPI.☆11Jan 14, 2021Updated 5 years ago
- Interview Cake Python Algorithms Set☆10Mar 21, 2018Updated 7 years ago
- Agent based-model of the banking system (NetLogo)☆11Apr 13, 2018Updated 7 years ago
- Single-Clich-Proxy-Chains☆13Mar 29, 2021Updated 4 years ago
- Burp Suite extension designed to help security professionals search for custom sensitive information in HTTP responses☆11Apr 25, 2023Updated 2 years ago
- wordpress xmlrpc + wp-login.php brute force☆10Mar 13, 2021Updated 4 years ago
- ☆11May 28, 2024Updated last year
- Package to parse and analyze trademark data from the United States Patent and Trademark Office☆14Apr 5, 2017Updated 8 years ago
- A modification of traditional random forest for time-series forecasting☆12Apr 16, 2024Updated last year
- An autonomous LLM-based agent that generates code to extract structured information from web pages and extracts it.☆11Oct 30, 2024Updated last year
- ☆13Jul 25, 2024Updated last year
- ☆11Apr 7, 2025Updated 10 months ago
- Cookiecutter template for MCP servers with one-click Render.com deployment - Generate production-ready API integration servers in minutes☆19Jul 4, 2025Updated 7 months ago
- Wikipedia "people" Images Dataset Downloader☆11Dec 3, 2023Updated 2 years ago
- Evaluation and benchmarking of PatentsView disambiguation algorithms☆15Jan 18, 2024Updated 2 years ago
- A benchmark of globally-optimal anonymization methods for biomedical data☆16Dec 11, 2014Updated 11 years ago
- k3s using metallb in bpg mode, with a service preserving source IP.☆12Apr 9, 2019Updated 6 years ago
- PDF intelligence platform combining IBM Docling for document processing, LlamaIndex for data structuring, and Streamlit for a powerful UI…☆51Dec 30, 2024Updated last year
- Collection of business analytics case studies that leverage data science methods to create business value (R and Python)☆12Jul 12, 2019Updated 6 years ago
- Tools and analytics for smart derivative contracts.☆15Jan 21, 2026Updated last month
- This is a sentiment trading strategy, written in Python, and applying NLP on 10-K's from the SEC EDGAR database.☆11Feb 21, 2022Updated 4 years ago
- The benchmark for LLMs designed to tackle one of the most knowledge-intensive tasks in data science: writing feature engineering code, wh…☆19Oct 28, 2024Updated last year
- ☆14Jan 12, 2021Updated 5 years ago
- Learn how to use OpenML for reproducible, collaborative machine learning projects☆13Aug 6, 2023Updated 2 years ago
- A US equities trading & settlement calendar command-line tool☆12Mar 30, 2022Updated 3 years ago
- Probabilistic Entity Matching in Python☆13Apr 5, 2017Updated 8 years ago
- har2pcap converts .har (HTTP Archive Viewer) files into the pcapng file format - which can be analyzed with Wireshark.☆11Jun 19, 2016Updated 9 years ago
- Create your assistant in the OpenAI dashboard, generate an API key, and use this code to integrate it into Open WebUI. Remember that it i…☆16Jan 20, 2025Updated last year
- ☆10Nov 16, 2021Updated 4 years ago
- An Vulnerability detection and Exploitation tool for CVE-2024-7339☆16Aug 10, 2024Updated last year
- Contextual Doc Retrieval is a Python-based system leveraging OpenAI GPT-4o and Cohere for re-ranking and query expansion, combined with B…☆50Oct 13, 2024Updated last year