floriancochard / extract-data-from-paper
A tool designed to extract numerical data from scanned historical weather documents.
☆13Updated 4 months ago
Alternatives and similar repositories for extract-data-from-paper:
Users that are interested in extract-data-from-paper are comparing it to the libraries listed below
- An intelligent OCR to detect tables and pure text inside PDFs and obtaing a csv file and a txt from it☆14Updated 6 years ago
- Automated Document Intelligence Workflow☆20Updated 3 months ago
- [WIP] Behold, semantic-search, built over sentence-transformers to make it easy for search engineers to evaluate, optimise and deploy mod…☆15Updated last year
- Machine Learning-assisted correction of OCR errors in historical corpora☆9Updated 5 months ago
- DocAI helps developers quickly build document, image and text processing pipelines using open source and cloud-based machine learning mod…☆20Updated 2 years ago
- Custom Named Entity Recognition annotated using NER Annotated by tecoholic and Spacy for training the model☆16Updated 4 years ago
- ☆11Updated last year
- AI_Powered_Dev_Search_Engine☆12Updated last year
- An ongoing series of notebooks aimed at helping fellow NLP enthusiasts think about applying new tools and techniques to practical tasks.☆18Updated 4 years ago
- A web app built with Streamlit that summarizes input text☆13Updated 4 years ago
- Reproducing "Writing with Transformer" demo, using aitextgen/FastAPI in backend, Quill/React in frontend☆28Updated 4 years ago
- Repository for deepdoctection tutorial notebooks☆43Updated 4 months ago
- Text classification automl☆21Updated 3 years ago
- Use Google's state-of-the-art T5 pre-train model to create human-like summarization☆25Updated 3 years ago
- Python wrapper for xpdf☆19Updated 5 years ago
- Nougat is a Meta AI's revolutionary OCR model designed to transcribe scientific PDFs into an easy-to-use Markdown format.☆22Updated last year
- Use pretrained BERT model to automatically generate grammar multiple choice questions (MCQ) from any news article or story.☆13Updated 5 years ago
- Automated Continuous Data Quality Measurement☆12Updated last year
- Median is an open-source flashcard application that leverages the power of spaced repetition and artificial intelligence to transform the…☆22Updated 5 months ago
- Document Search Engine Tool☆73Updated 2 years ago
- A swarm of LLM agents that will help you test, document, and productionize your code!☆15Updated this week
- NSS Capstone project to use natural language modeling, classification, and information extraction to get the exact employee count values …☆15Updated 6 years ago
- Simple playground chat app that interacts with OpenAI's functions with memory and custom tools.☆18Updated last year
- Unstract's interface to LLMs, Embeddings and VectorDBs.☆18Updated 8 months ago
- Logical structure analysis for visually structured documents☆88Updated 2 years ago
- 🖍️ Highlight text in documents☆105Updated 3 months ago
- ChatBot App built using LangChain and Lightning AI☆18Updated 2 years ago
- 🛤️ Pathik - High-Performance Web Crawler ⚡☆26Updated 2 weeks ago
- Open-source, knowledge-grounded conversational AI☆13Updated 4 months ago
- Tools for using OpenAI Codex to do various useful things☆48Updated 3 years ago