conjuncts / gmft
Lightweight, performant, deep table extraction
☆256Updated this week
Related projects: ⓘ
- ☆150Updated last month
- 🥚 Transform PDF to JSON or Markdown with ease and speed 🐣☆441Updated this week
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆497Updated 3 weeks ago
- 针对文档类图像,整合版面分析、文字识别、表格识别和公式识别结果,还原版面布局信息。☆118Updated last week
- ☆236Updated 2 months ago
- Your first AI prompt engineer☆308Updated this week
- An enterprise-grade AI retriever designed to streamline AI integration into your applications, ensuring cutting-edge accuracy.☆219Updated 2 weeks ago
- The simplest open-source implementation of perplexity.ai☆243Updated 2 weeks ago
- ☆201Updated 2 months ago
- Incremental Knowledge Graphs Constructor Using Large Language Models☆310Updated this week
- TF-ID: Table/Figure IDentifier for academic papers☆206Updated 2 months ago
- Yet another open source Perplexity☆335Updated last month
- ☆268Updated 3 months ago
- A python wrapper for the Doc2X API and comes with native PDF processing (to improve PDF recall in RAG). | Doc2X API的python封装,同时附带本地的PDF处理…☆174Updated last week
- Structured information extraction from documents☆187Updated this week
- Analysis of Chinese and English layouts 中英文版面分析☆94Updated 2 months ago
- Extract structured text from pdfs quickly☆292Updated 3 weeks ago
- ☆246Updated this week
- TEaR framework for paper "TEaR: Improving LLM-based Machine Translation with Systematic Self-Refinement"☆39Updated last month
- RAG (Retrieval-Augmented Generation) Chatbot Examples Using PyMuPDF☆282Updated this week
- LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA☆227Updated this week
- ☆87Updated last month
- ☆28Updated 2 months ago
- Full toolkit for running an AI agent service built with LangGraph, FastAPI and Streamlit☆238Updated 2 weeks ago
- High-performance retrieval engine for unstructured data☆778Updated this week
- ☆160Updated 2 months ago
- SearchGPT / Perplexity Pages clone, but personalised for you.☆199Updated 2 weeks ago
- Microsoft's GraphRAG + AutoGen + Ollama + Chainlit = Fully Local & Free Multi-Agent RAG Superbot☆400Updated 2 months ago
- A Docker-powered service for PDF document layout analysis. This service provides a powerful and flexible PDF analysis service. The servic…☆100Updated this week
- ☆389Updated 4 months ago