opengovsg / pdf2mdLinks
A PDF to Markdown converter
☆315Updated 4 months ago
Alternatives and similar repositories for pdf2md
Users that are interested in pdf2md are comparing it to the libraries listed below
Sorting:
- Vectra is a local vector database for Node.js with features similar to pinecone but built using local files.☆484Updated 3 weeks ago
- Web service for web page to Markdown conversion☆214Updated 3 months ago
- A simple vector database built on idb☆86Updated last year
- SemanticFinder - frontend-only live semantic search with transformers.js☆273Updated 2 months ago
- HTML to Markdown converter and crawler.☆557Updated last year
- Demo rendering rich responses from LLMs☆156Updated last year
- Extract Markdown + Images from PDF☆45Updated 5 months ago
- A Node.js library to parse text out of any office file. Currently supports docx, pptx, xlsx and odt, odp, ods..☆204Updated 6 months ago
- A PDF to Markdown converter☆1,366Updated 11 months ago
- pdf2html is a module which helps to convert PDF file to HTML pages using Apache Tika. This module also helps to generate thumbnail image …☆171Updated this week
- 🕸️🦀 A WASM vector similarity search written in Rust☆967Updated last year
- Extract structured text from pdfs quickly☆485Updated last week
- A client side vector search library that can embed, store, search, and cache vectors. Works on the browser and node. It outperforms OpenA…☆203Updated last year
- TypeScript generator for llama.cpp Grammar directly from TypeScript interfaces☆137Updated 10 months ago
- LLM Based OCR and Document Parsing for Node.js☆103Updated 9 months ago
- 🍱 semantic-chunking ⇢ semantically create chunks from large document for passing to LLM workflows☆96Updated 3 months ago
- A JavaScript library that brings vector search and RAG to your browser!☆121Updated 9 months ago
- An open-source, AI-powered text editor inspired by medium.com.☆160Updated 2 years ago
- Library to generate vector embeddings in NodeJS☆121Updated last month
- JavaScript implementation of LiteLLM.☆126Updated 2 months ago
- 🦜️🔗 This is a very simple re-implementation of LangChain, in ~100 lines of code☆253Updated last year
- Convert Word documents to beautiful Markdown. Via command line or in your browser.☆105Updated 9 months ago
- gpt + aria = ability to read browser contents☆70Updated last year
- Talk to your Obsidian notes!☆123Updated 7 months ago
- Vector Storage is a vector database that enables semantic similarity searches on text documents in the browser's local storage. It uses O…☆226Updated 5 months ago
- ☆151Updated last year
- Node.js module for rendering pdf pages to images, svgs, html files, text files and json metadata☆99Updated 2 years ago
- Python bindings to PDFium☆582Updated this week
- ☆113Updated 11 months ago
- HTML to Markdown converter☆255Updated 3 months ago