yannbanas / mrkdwn_analysis
mrkdwn_analysis is a Python library for analyzing Markdown files. It extracts and categorizes Markdown elements like headers, sections, links, images, etc. Ideal for data analysis, content generation, and tool-building that requires Markdown parsing.
ā36Updated last month
Alternatives and similar repositories for mrkdwn_analysis
Users that are interested in mrkdwn_analysis are comparing it to the libraries listed below
Sorting:
- LitePali is a minimal, efficient implementation of ColPali for image retrieval and indexing, optimized for cloud deployment.ā51Updated 7 months ago
- Parallel and LAzY Analyzer for PDFs šļøā27Updated this week
- Clean, filter and sample URLs to optimize data collection ā Python & command-line ā Deduplication, spam, content and language filtersā137Updated 4 months ago
- Python library for converting JSON Schemas to Pydantic modelsā15Updated last year
- Hybrid Search (BM25 & Vector) with SQLiteā15Updated 8 months ago
- POC Port of the openai-realtime-console to streamlit.ā47Updated 7 months ago
- A Python library to extract tabular data from PDFsā66Updated last month
- Datasette plugin adding a llm_embed(model_id, text) SQL functionā14Updated last year
- A Python library to chunk/group your texts based on semantic similarity.ā97Updated 10 months ago
- Streamlit PDF viewerā145Updated last week
- A GPT powered CLI tool that answers questions about your dataā99Updated 2 years ago
- A Python client for the Unstructured Platform APIā101Updated this week
- Benchmark study on LanceDB, an embedded vector DB, for full-text search and vector searchā24Updated last year
- Python bindings to PDFiumā568Updated this week
- LLM prompt language based on Jinja. Banks provides tools and functions to build prompts text and chat messages from generic blueprints. Iā¦ā92Updated 2 weeks ago
- LLM plugin for embeddings using sentence-transformersā60Updated 2 weeks ago
- Simple package to extract text with coordinates from programmatic PDFsā121Updated last month
- A fast, lightweight and easy-to-use Python library for splitting text into semantically meaningful chunks.ā300Updated last month
- Easily deploy Haystack pipelines as REST APIs and MCP Tools.ā81Updated this week
- Tools for building SQLite databases from files and directoriesā12Updated last year
- 𦦠weasel: A small and easy workflow systemā83Updated 10 months ago
- Build reliable, secure, and production-ready AI apps easily.ā72Updated this week
- Python library for the instruction and reliable validation of structured outputs (JSON) of Large Language Models (LLMs) with Ollama and Pā¦ā76Updated 4 months ago
- Repository for deepdoctection tutorial notebooksā44Updated 5 months ago
- Pipeline for converting PDFs to raw text with PaddleOCRā23Updated last year
- A cookiecutter template for building plugins for LLMā24Updated last month
- Run embedding models using ONNXā32Updated last year
- ā113Updated 2 weeks ago
- Markdown to pdf rendererā84Updated 3 weeks ago
- š¬ minimalistic ChatBot Interface in pure pythonā222Updated 9 months ago