inuwamobarak / nougat
Nougat is a Meta AI's revolutionary OCR model designed to transcribe scientific PDFs into an easy-to-use Markdown format.
☆22Updated last year
Alternatives and similar repositories for nougat
Users that are interested in nougat are comparing it to the libraries listed below
Sorting:
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆22Updated 7 months ago
- Demos of ChatGPT's function calling/structured data support.☆24Updated last year
- a streaming markdown component for streamlit with LaTeX, Mermaid, Table, code support. A drop-in replacement for st.markdown.☆18Updated 3 months ago
- Embedding models from Jina AI☆60Updated last year
- AI_Powered_Dev_Search_Engine☆12Updated last year
- DocAI helps developers quickly build document, image and text processing pipelines using open source and cloud-based machine learning mod…☆20Updated 2 years ago
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆50Updated 2 months ago
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆13Updated 3 weeks ago
- OVALChat is a customizable Web app aimed at conducting user studies with chatbots☆28Updated last year
- ☆32Updated last year
- Taking Normal Text as Input and Generating SQL commands using the OpenAI's GPT-3☆15Updated 4 years ago
- Docutron Toolkit: detection and segmentation analysis for legal data extraction over documents.☆25Updated last year
- ☆18Updated 3 months ago
- Multimodal LLM Application with PyMuPDF4LLM☆36Updated 7 months ago
- OpenMindedChatbot is a Proof Of Concept that leverages the power of Open source Large Language Models (LLM) with Function Calling capabil…☆29Updated last year
- ☆20Updated last year
- ☆25Updated 3 months ago
- Solve Geometric & Graph Problems with Large Language Models☆29Updated 2 years ago
- Chrome Extension for YouTube. Acts as an assistant for the YouTube video you are watching☆23Updated 2 years ago
- ☆41Updated 11 months ago
- ☆22Updated last month
- The official repository for the Anything But Wrappers: Llama Edition Hackameetup☆22Updated last year
- A function to do all☆36Updated last year
- A multimodal RAG application that enables semantic search on multimedia sources like audio, video and images☆38Updated last year
- Exploration: using technology to aid people who lack both the ability to speak and fine motor control.☆20Updated 6 months ago
- ☆48Updated last year
- Small python package to measure OCR quality and other related metrics.☆21Updated last year
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆38Updated last year
- ☆49Updated 10 months ago
- Repository for deepdoctection tutorial notebooks☆45Updated 5 months ago