floriancochard / extract-data-from-paperLinks
A tool designed to extract numerical data from scanned historical weather documents.
☆13Updated 6 months ago
Alternatives and similar repositories for extract-data-from-paper
Users that are interested in extract-data-from-paper are comparing it to the libraries listed below
Sorting:
- Nougat is a Meta AI's revolutionary OCR model designed to transcribe scientific PDFs into an easy-to-use Markdown format.☆23Updated last year
- An intelligent OCR to detect tables and pure text inside PDFs and obtaing a csv file and a txt from it☆14Updated 6 years ago
- A Python notebook that walks you through how to transcribe audio files into text using the Deepgram API.☆10Updated last year
- Streamlit application to keep GPT3 Experimentation sane☆23Updated 3 years ago
- DocAI helps developers quickly build document, image and text processing pipelines using open source and cloud-based machine learning mod…☆20Updated 2 years ago
- Scripts to load the GDELT data set into MongoDB☆12Updated 2 years ago
- Yet another tool to search through your (exported) ChatGPT conversations☆12Updated 8 months ago
- NewsAgent is an enterprise-grade news aggregation agent designed to fetch, query, and summarize news from multiple sources at scale.☆17Updated last month
- A swarm of LLM agents that will help you test, document, and productionize your code!☆17Updated this week
- AI_Powered_Dev_Search_Engine☆12Updated last year
- Simple playground chat app that interacts with OpenAI's functions with memory and custom tools.☆18Updated last year
- WhisperAnywhere: Effortless speech-to-text everywhere on your Mac. Use a hotkey to dictate in any app, powered by Whisper AI and Groq API…☆22Updated 7 months ago
- Awesome Orchest projects, both official and submitted by the community.☆25Updated last year
- Fastlaw's purpose is to replace generic word embeddings for work on supervised machine learning NLP-tasks with legal texts.☆38Updated 6 years ago
- Multi-step AI agents powered by Gemini 2.0 and the LangGraph framework. These agents orchestrate complex workflows and enhance their reas…☆9Updated 6 months ago
- Visual Embeddings with OpenAI and Nomic☆12Updated last year
- Machine Learning-assisted correction of OCR errors in historical corpora☆9Updated 8 months ago
- Transforming textual descriptions into process models using deep learning☆14Updated 6 years ago
- NLP tool for scraping text from a corpus of PDF files, embedding the sentences in the text and finding semantically similar sentences to …☆35Updated 3 years ago
- Telegram > OpenAI > Read Later [instapaper/pocket/omnivore]☆17Updated last year
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆13Updated this week
- From Dataset Labeling, Entity Extraction to production Knowledge Graph Deployment: The Power of NLP and LLMs Combined.☆12Updated last year
- This library builds a graph-representation of the content of PDFs. The graph is then clustered, resulting page segments are classified an…☆23Updated 4 years ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆24Updated last month
- Reproducing "Writing with Transformer" demo, using aitextgen/FastAPI in backend, Quill/React in frontend☆28Updated 4 years ago
- ☆11Updated 2 years ago
- OVALChat is a customizable Web app aimed at conducting user studies with chatbots☆28Updated last year
- A computer vision project (image segmentation project) which aims to remove texts on images using Unet model. Tensorflow 2 is used as a M…☆14Updated 2 years ago
- Scripts for reading, extracting, and organizing data from either HTML or PDF documents and prepare them to be converted into embeddings f…☆13Updated 10 months ago
- Convert any image into a Region Adjacency Graph (RAG)☆12Updated 5 years ago