opengovsg / pdf2mdLinks
A PDF to Markdown converter
☆409Updated 7 months ago
Alternatives and similar repositories for pdf2md
Users that are interested in pdf2md are comparing it to the libraries listed below
Sorting:
- SemanticFinder - frontend-only live semantic search with transformers.js☆295Updated 5 months ago
- Web service for web page to Markdown conversion☆249Updated 6 months ago
- 🍱 semantic-chunking ⇢ semantically create chunks from large document for passing to LLM workflows☆111Updated last month
- Extract structured text from pdfs quickly☆589Updated 2 months ago
- Convert Word documents to beautiful Markdown. Via command line or in your browser.☆129Updated last year
- A multithreaded 🕸️ web crawler that recursively crawls a website and creates a 🔽 markdown file for each page, designed for LLM RAG☆400Updated last year
- A client side vector search library that can embed, store, search, and cache vectors. Works on the browser and node. It outperforms OpenA…☆216Updated last year
- JavaScript implementation of LiteLLM.☆136Updated 5 months ago
- Browser based tool to convert PDFs to Markdown☆219Updated 2 months ago
- A simple vector database built on idb☆99Updated last year
- Vectra is a local vector database for Node.js with features similar to pinecone but built using local files.☆516Updated 3 months ago
- Vector Storage is a vector database that enables semantic similarity searches on text documents in the browser's local storage. It uses O…☆234Updated 8 months ago
- Making openapi spec swagger documents friendly for GPT and other LLMs.☆65Updated 2 years ago
- Local semantic search. Stupidly simple.☆435Updated last year
- Fully typed & consistent chat APIs for OpenAI, Anthropic, Groq, and Azure's chat models for browser, edge, and node environments.☆169Updated last year
- HTML to Markdown converter and crawler.☆588Updated last year
- Simple tool for converting PDF to text using OCR☆94Updated 2 years ago
- A TypeScript framework for building MCP servers elegantly☆182Updated 4 months ago
- Integrate 200+ LLMs with one TypeScript SDK using OpenAI's format.☆288Updated 4 months ago
- Convert any PDF into a podcast episode!☆796Updated 5 months ago
- TypeScript-based library for real-time audio transcription, integrating OpenAI's Whisper model for accurate speech-to-text conversion.☆71Updated last year
- Demo rendering rich responses from LLMs☆159Updated last year
- A user-friendly, feature-rich UI enhancing interaction with Anthropic's Claude AI, enabling model selection, chat saving, and improved pr…☆118Updated 2 years ago
- LLM Based OCR and Document Parsing for Node.js☆104Updated 11 months ago
- This project aims to extract text from PDF files using the outputs generated by the pdf-document-layout-analysis service. By leveraging t…☆34Updated 7 months ago
- ☆107Updated last year
- Like Claude Artifacts but lives in a single static HTML page which you can use with any language model of your choosing☆209Updated 6 months ago
- pdf2html is a module which helps to convert PDF file to HTML pages using Apache Tika. This module also helps to generate thumbnail image …☆192Updated last month
- Library to generate vector embeddings in NodeJS☆140Updated 4 months ago
- ☆42Updated 6 months ago