A library to extract the main content from html. Developed for information on LLM and for feeding data into LangChain and LlamaIndex.
☆51May 16, 2024Updated last year
Alternatives and similar repositories for main_content_extractor
Users that are interested in main_content_extractor are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A zero-shot captcha solver.☆16Dec 22, 2023Updated 2 years ago
- An AI-powered GitHub search tool utilising Generative UI☆14Jul 20, 2024Updated last year
- Dataset from Tip of the Tongue Known-Item Retrieval (2021) paper.☆12Nov 4, 2021Updated 4 years ago
- Clober Solidity Library☆10Jun 9, 2025Updated 10 months ago
- A list of delightful MINDSTORMS software and resources☆14Mar 10, 2025Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ai trading agent using interactive brokers api☆97Feb 17, 2025Updated last year
- project trying to replicate http://arxiv.org/pdf/1412.5567v2.pdf☆12Mar 22, 2015Updated 11 years ago
- A Simple example how to use FastAPI with Async SQLAlchemy 2.0☆20Apr 27, 2026Updated last week
- Generate standalone HTML from OpenAPI Specification.☆27Jul 13, 2025Updated 9 months ago
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1…☆14Jan 19, 2024Updated 2 years ago
- ☆18Apr 29, 2026Updated last week
- Generate your own artisitic Qr Code in 5 mins!☆25Nov 14, 2023Updated 2 years ago
- Simple setup for personal dotfiles☆11Mar 29, 2026Updated last month
- Scrape Airbnb, Booking, Hotels.com from a single JavaScript module. ❗No longer maintained.☆18Apr 18, 2023Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Finetuning Whisper ASR model for Belarusian language☆17Feb 16, 2025Updated last year
- This very simple python script takes inputs from your business and outputs articles written bhy claude.☆13Apr 3, 2024Updated 2 years ago
- Python Script for Copywriters to Gather Data from Competing Content and Find Keyword Overlap☆15Apr 23, 2022Updated 4 years ago
- A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning☆10Sep 19, 2020Updated 5 years ago
- An accurate, extensible, and fast HTML-to-markdown converter.☆23Feb 7, 2026Updated 2 months ago
- A financial disclosure data extraction tool.☆21Aug 2, 2023Updated 2 years ago
- Progressive type annotation without regression! 🚀☆31Oct 16, 2022Updated 3 years ago
- This repo includes Midjourney prompt curation to use Midjourney better.☆13Feb 22, 2023Updated 3 years ago
- The SQL-RL-GEN is an algorithm based on a Reinforcement Learning approach with a reward function generated by a LLM to guide the agent's …☆22Sep 18, 2025Updated 7 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- An API that allows you to scrape blog posts and articles and get a list of notes or a summary back.☆10Mar 31, 2023Updated 3 years ago
- Original schema.org python-appengine codebase☆19Apr 10, 2022Updated 4 years ago
- ☆10Jan 5, 2024Updated 2 years ago
- Delete your PDF is a set of tools to export information from your PDFs so you can delete them.☆13Sep 11, 2024Updated last year
- A ChatGPT web client that supports multiple users, multiple languages, and multiple database connections for persistent data storage. Pro…☆13May 19, 2023Updated 2 years ago
- SDK to access ZenRows API directly from Python. We handle proxies rotation, headless browsers and CAPTCHAs for you.☆18Jan 22, 2026Updated 3 months ago
- ☆11Feb 24, 2019Updated 7 years ago
- Python library to work with proxy server items loaded from local file or network document.☆18Dec 21, 2022Updated 3 years ago
- agents.md guides agents. codingagents.md helps humans pick the right one.☆37Feb 17, 2026Updated 2 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A website showing several companies' stocks and their market sentiments using Yahooquery and Marketaux API.☆13Jan 18, 2024Updated 2 years ago
- Easily trim 'messages' arrays for use with GPTs☆74Dec 19, 2023Updated 2 years ago
- Legal Matter Standard Specification (LMSS) library for Python☆17Nov 14, 2023Updated 2 years ago
- "밑바닥부터 시작하는 데이터 사 이언스" 예시 코드☆14Feb 5, 2020Updated 6 years ago
- A collection of Ollama model deployments on Google Cloud Run☆28Jun 27, 2024Updated last year
- Tool for decrypting files encrypted by the SynoLocker ransomware☆15Aug 22, 2014Updated 11 years ago
- ☆27Aug 16, 2025Updated 8 months ago