opengovsg / pdf2mdLinks
A PDF to Markdown converter
☆464Updated last week
Alternatives and similar repositories for pdf2md
Users that are interested in pdf2md are comparing it to the libraries listed below
Sorting:
- Convert Word documents to beautiful Markdown. Via command line or in your browser.☆178Updated 2 months ago
- SemanticFinder - frontend-only live semantic search with transformers.js☆320Updated 10 months ago
- Web service for web page to Markdown conversion☆300Updated last week
- Vectra is a local vector database for Node.js with features similar to pinecone but built using local files.☆571Updated last week
- 🍱 semantic-chunking ⇢ semantically create chunks from large document for passing to LLM workflows☆131Updated 3 months ago
- Demo rendering rich responses from LLMs☆162Updated 2 years ago
- Browser based tool to convert PDFs to Markdown☆294Updated last month
- Vector Storage is a vector database that enables semantic similarity searches on text documents in the browser's local storage. It uses O…☆242Updated last year
- LLM Based OCR and Document Parsing for Node.js☆108Updated last year
- JavaScript implementation of LiteLLM.☆145Updated 10 months ago
- Extract structured text from pdfs quickly☆656Updated 7 months ago
- Integrate 200+ LLMs with one TypeScript SDK using OpenAI's format.☆302Updated 9 months ago
- A template based pptx generator for Node.js☆151Updated last month
- Simple tool for converting PDF to text using OCR☆98Updated 2 years ago
- A client side vector search library that can embed, store, search, and cache vectors. Works on the browser and node. It outperforms OpenA…☆225Updated last year
- pdf2html is a module which helps to convert PDF file to HTML pages using Apache Tika. This module also helps to generate thumbnail image …☆201Updated 2 weeks ago
- EntityDB is an in-browser vector database wrapping indexedDB and Transformers.js over WebAssembly☆268Updated 8 months ago
- An API to transcribe audio with OpenAI's Whisper Large v3!☆333Updated last year
- This library exposes PAPA, your Personal Assistant powered by Private AI, which can be used in any browser environment and completely off…☆55Updated last month
- Fully typed & consistent chat APIs for OpenAI, Anthropic, Groq, and Azure's chat models for browser, edge, and node environments.☆171Updated last year
- 🏭 PDF text extraction pipeline: self-hosted, local-first, Docker-based☆328Updated 2 years ago
- An open-source, AI-powered text editor inspired by medium.com.☆175Updated 2 years ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆121Updated 2 years ago
- A TypeScript framework for building MCP servers elegantly☆184Updated 9 months ago
- A multithreaded 🕸️ web crawler that recursively crawls a website and creates a 🔽 markdown file for each page, designed for LLM RAG☆424Updated last year
- 📄 Set of modern React components for PDF highlighting☆107Updated last year
- A user-friendly, feature-rich UI enhancing interaction with Anthropic's Claude AI, enabling model selection, chat saving, and improved pr…☆122Updated 2 years ago
- A bash script using OpenAI Whisper API for continuous audio transcription with automatic silence detection☆116Updated last year
- An open-source VSCode extension, the AI coding assistant, integrates with Ollama, HuggingFace, OpenAI, and Anthropic.☆266Updated 6 months ago
- Promptrix is a prompt layout engine for Large Language Models.☆81Updated last year