floriancochard / extract-data-from-paperLinks
A tool designed to extract numerical data from scanned historical weather documents.
☆13Updated 10 months ago
Alternatives and similar repositories for extract-data-from-paper
Users that are interested in extract-data-from-paper are comparing it to the libraries listed below
Sorting:
- Nougat is a Meta AI's revolutionary OCR model designed to transcribe scientific PDFs into an easy-to-use Markdown format.☆25Updated 2 years ago
- DocAI helps developers quickly build document, image and text processing pipelines using open source and cloud-based machine learning mod…☆20Updated 2 years ago
- generate & query embeddings from VTT files using openai & pinecone on Andrej Karpathy's's latest GPT tutorial☆19Updated 2 years ago
- Query, ask and chat with a document-index via transformer models!☆17Updated 2 years ago
- Complex data extraction and orchestration framework designed for processing unstructured documents. It integrates AI-powered document pip…☆72Updated this week
- Convert any image into a Region Adjacency Graph (RAG)☆12Updated 5 years ago
- Visual Embeddings with OpenAI and Nomic☆13Updated 2 years ago
- LLMs sitting on a council together to decide, by consensus, who among them is the best.☆20Updated 3 months ago
- Example Code to Supplement the Label Studio Blog☆28Updated 2 weeks ago
- A simple library for segmenting legal texts☆17Updated 2 years ago
- Web App Capable of Predicting Next Word Using BERT☆14Updated 2 years ago
- Pandas-LLM☆46Updated 2 years ago
- Streamlit app for recommending eval functions using prompt diffs☆29Updated last year
- Reproducing "Writing with Transformer" demo, using aitextgen/FastAPI in backend, Quill/React in frontend☆27Updated 4 years ago
- Trained BERT and Word2Vec legal clause classifiers for SPACY using the Atticus Project's Open Source Contract Label Corpus☆13Updated 4 years ago
- PDF text data extraction web app with OCR for scanned documents☆90Updated last year
- Evaluation framework for document processing models and services.☆46Updated this week
- Search PDFs using Jina, DocArray and Jina Hub☆56Updated 3 years ago
- Phi-2 Fine Tuning to build a mental health GPT.☆11Updated last year
- A Python package to get useful information from documents using TopicRank Algorithm.☆16Updated 2 years ago
- Home to jupyter notebooks for Mindee OSS projects☆17Updated 3 months ago
- 🤖 Quantum-powered excuse generator for developers. Blame bugs on cosmic rays, AI sentience, or Schrödinger’s intern.☆26Updated last month
- Streamlit application to keep GPT3 Experimentation sane☆23Updated 4 years ago
- A chatbot made using the Chatterbot library in Python and locally hosted using Streamlit. Dataset used were collected during ConvAI2 comp…☆15Updated 4 years ago
- A visual tool to interpret and understand PyTorch machine learning models☆17Updated last year
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆24Updated last week
- Chat with Your Data App using Langchain, ChromaDB, Sentence Transformers, and LaMiNi LM Model. This Chatbot is completely powered by Open…☆17Updated 2 years ago
- Logical structure analysis for visually structured documents☆92Updated 3 years ago
- ☆10Updated last year
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆52Updated 7 months ago