inuwamobarak / nougatLinks
Nougat is a Meta AI's revolutionary OCR model designed to transcribe scientific PDFs into an easy-to-use Markdown format.
☆25Updated 2 years ago
Alternatives and similar repositories for nougat
Users that are interested in nougat are comparing it to the libraries listed below
Sorting:
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆52Updated 7 months ago
- OVALChat is a customizable Web app aimed at conducting user studies with chatbots☆28Updated last year
- Unstract's interface to LLMs, Embeddings and VectorDBs.☆18Updated last year
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆22Updated last year
- Evaluation framework for document processing models and services.☆53Updated this week
- Embedding models from Jina AI☆65Updated last year
- ☆48Updated last year
- DocAI helps developers quickly build document, image and text processing pipelines using open source and cloud-based machine learning mod…☆20Updated 2 years ago
- 👩🏻🔬 ResearchGPT - OpenAI wrapper with document reading capabilities, made with Svelte and FastAPI. [NEEDS MAINTENANCE]☆16Updated last year
- Browser-based Voice Assistant☆44Updated 2 years ago
- ChatData 🔍 📖 brings RAG to real applications with FREE✨ knowledge bases. Now enjoy your chat with 6 million wikipedia pages and 2 milli…☆178Updated last year
- ☆20Updated last year
- create workflows with LLMs☆54Updated last year
- Demos of ChatGPT's function calling/structured data support.☆24Updated last year
- Complex data extraction and orchestration framework designed for processing unstructured documents. It integrates AI-powered document pip…☆74Updated this week
- AlphaXIV open-source alternative: Chat with any arXiv paper.☆88Updated 5 months ago
- Solve Geometric & Graph Problems with Large Language Models☆33Updated 2 years ago
- a streaming markdown component for streamlit with LaTeX, Mermaid, Table, code support. A drop-in replacement for st.markdown.☆26Updated 8 months ago
- From Dataset Labeling, Entity Extraction to production Knowledge Graph Deployment: The Power of NLP and LLMs Combined.☆12Updated last year
- Microsoft Phi 2 Streamlit App, deployed on HuggingFace Spaces is based on the Microsoft Phi 2 small language model (SLM) for text generat…☆14Updated last year
- GPT-4 Level Conversational QA Trained In a Few Hours☆65Updated last year
- hotpdf is a fast PDF parsing library to extract text and find text within PDF documents built on top of pdfminer.six☆196Updated 10 months ago
- Very minimal (and stateless) agent framework☆45Updated 9 months ago
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆91Updated 2 months ago
- Medical Mixture of Experts LLM using Mergekit.☆20Updated last year
- Small python package to measure OCR quality and other related metrics.☆25Updated last year
- Jupyter Notebooks and an R Notebook for encoding Pokémon embeddings and creating data visualizations.☆20Updated last year
- I have explained how to create superior RAG pipeline for complex pdfs using LlamaParse. We can extract text and tables from pdf and QA on…☆47Updated last year
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆38Updated last year
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆46Updated last year