opengovsg / pdf2mdLinks
A PDF to Markdown converter
β458Updated 2 months ago
Alternatives and similar repositories for pdf2md
Users that are interested in pdf2md are comparing it to the libraries listed below
Sorting:
- π± semantic-chunking β’ semantically create chunks from large document for passing to LLM workflowsβ128Updated 2 months ago
- SemanticFinder - frontend-only live semantic search with transformers.jsβ317Updated 9 months ago
- Web service for web page to Markdown conversionβ295Updated 2 months ago
- Browser based tool to convert PDFs to Markdownβ269Updated 2 weeks ago
- Extract structured text from pdfs quicklyβ648Updated 7 months ago
- A PDF to Markdown converterβ1,515Updated last year
- JavaScript implementation of LiteLLM.β145Updated 9 months ago
- A client side vector search library that can embed, store, search, and cache vectors. Works on the browser and node. It outperforms OpenAβ¦β224Updated last year
- LLM Based OCR and Document Parsing for Node.jsβ107Updated last year
- EntityDB is an in-browser vector database wrapping indexedDB and Transformers.js over WebAssemblyβ261Updated 8 months ago
- Vectra is a local vector database for Node.js with features similar to pinecone but built using local files.β556Updated 7 months ago
- Promptrix is a prompt layout engine for Large Language Models.β81Updated last year
- HTML to Markdown converter and crawler.β608Updated 2 years ago
- Using GPT-4 Vision and GPT-4 Turbo, take a PDF as input and get a markdown file as output.β98Updated 11 months ago
- Vector Storage is a vector database that enables semantic similarity searches on text documents in the browser's local storage. It uses Oβ¦β242Updated last year
- Integrate 200+ LLMs with one TypeScript SDK using OpenAI's format.β300Updated 8 months ago
- remark plugin to compile markdown to docx (Microsoft Word, Office Open XML).β109Updated this week
- A TypeScript framework for building MCP servers elegantlyβ183Updated 8 months ago
- Convert Word documents to beautiful Markdown. Via command line or in your browser.β174Updated last month
- Simple package to extract text with coordinates from programmatic PDFsβ226Updated last month
- A template based pptx generator for Node.jsβ148Updated 3 weeks ago
- Fully typed & consistent chat APIs for OpenAI, Anthropic, Groq, and Azure's chat models for browser, edge, and node environments.β169Updated last year
- UI components for your LLM applicationβ99Updated last year
- Parse PDFs into markdown using Vision LLMsβ455Updated 3 months ago
- Simple tool for converting PDF to text using OCRβ98Updated 2 years ago
- A Node.js library to parse text out of any office file. Currently supports docx, pptx, xlsx and odt, odp, ods..β259Updated last week
- Turn Webpage to LLM friendly input text. Similar to Firecrawl and Jina Reader API. Makes RAG, AI web scraping, image & webpage links extrβ¦β270Updated last month
- Export any Kindle book you own as text, PDF, EPUB, or as a custom, AI-narrated audiobook. π₯β212Updated last month
- Convert any PDF into a podcast episode!β816Updated 9 months ago
- OCR Benchmarkβ604Updated 2 months ago