A library to extract the main content from html. Developed for information on LLM and for feeding data into LangChain and LlamaIndex.
☆51May 16, 2024Updated 2 years ago
Alternatives and similar repositories for main_content_extractor
Users that are interested in main_content_extractor are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A zero-shot captcha solver.☆16Dec 22, 2023Updated 2 years ago
- Spell check for Arabic text using python☆14Mar 22, 2019Updated 7 years ago
- Minimal Chatbot based on Vercel AI Chatbot☆36Jan 8, 2026Updated 5 months ago
- Scripts to finetune the official implementation of OpenAI's Whisper model☆25Apr 14, 2026Updated 2 months ago
- About MIMBCD-UI Project☆14Dec 4, 2025Updated 6 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- mcp wrapper for openai built-in tools☆12Mar 13, 2025Updated last year
- ☆18Apr 6, 2026Updated 2 months ago
- Transformer OCR is a Optical Character Recognition tookit built for researchers working on both OCR for both Vietnamese and English. This…☆10Dec 27, 2021Updated 4 years ago
- Integrate with OpenAI API Codex to shape and evolve code through generative iterations.☆12Mar 31, 2026Updated 2 months ago
- ai trading agent using interactive brokers api☆99Feb 17, 2025Updated last year
- ☆20Sep 13, 2024Updated last year
- reverse engineering OpenAI plugins through system messages☆17May 12, 2023Updated 3 years ago
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1…☆14Jan 19, 2024Updated 2 years ago
- A Soul-grounded Minecraft social simulation runtime where Mineflayer actors pursue LifeGoals through evidence-backed action skills and tr…☆23Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Generate your own artisitic Qr Code in 5 mins!☆28Nov 14, 2023Updated 2 years ago
- A Model Context Protocol (MCP) server for YouTube clip information extraction.☆19Dec 16, 2025Updated 5 months ago
- ☆10Jun 6, 2024Updated 2 years ago
- SQL parser and converter☆11Jan 5, 2024Updated 2 years ago
- Parsing, processing, and translation of PostgreSQL, MySQL and ADQL queries☆15Aug 18, 2025Updated 9 months ago
- Emoji embeddings trained using their emotional content from their online dictionary meanings.☆18Jan 10, 2022Updated 4 years ago
- 🤖 Auto Content Generator: A .NET 8 API that leverages OpenAI's GPT-4o to automatically generate markdown formatted blog posts, commit th…☆36Jun 3, 2024Updated 2 years ago
- 커피 한 잔 마시며 끝내는 Vue.JS☆11Dec 10, 2022Updated 3 years ago
- A curated list of resources dedicated to NLP (paper, blogs, note and etc)☆13Nov 30, 2019Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Python script designed to simplify the process of submitting URLs to Google's Indexing API for faster and more efficient website indexing…☆12Sep 12, 2023Updated 2 years ago
- The OWCA dataset is a polish translated dataset of instructions for fine-tuning the Alpaca model made by Stanford .☆21May 18, 2023Updated 3 years ago
- An API that allows you to scrape blog posts and articles and get a list of notes or a summary back.☆10Mar 31, 2023Updated 3 years ago
- Original schema.org python-appengine codebase☆19Apr 10, 2022Updated 4 years ago
- ChatGPT-Messenger-Clone is designed by using Next.js 13 with Tailwind CSS. Firebase has been used for Google Authentication and Cloud Fir…☆18Mar 30, 2023Updated 3 years ago
- Generate beautiful, functional UI/UX designs with the power of AI ✨☆18Mar 5, 2025Updated last year
- Delete your PDF is a set of tools to export information from your PDFs so you can delete them.☆13Sep 11, 2024Updated last year
- Handling PWA installation prompt made easier.☆13Feb 12, 2026Updated 4 months ago
- Medical natural language parsing and utility library☆14Dec 10, 2025Updated 6 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Python libraries for extracting from data sources like Rechtspraak, ECHR, Cellar☆13Jul 2, 2025Updated 11 months ago
- Python library to work with proxy server items loaded from local file or network document.☆18Dec 21, 2022Updated 3 years ago
- agents.md guides agents. codingagents.md helps humans pick the right one.☆40Feb 17, 2026Updated 3 months ago
- This is the ultimate web scraping tool for extracting the most relevant data points from products on Walmart.com! this powerful scraper i…☆22Mar 6, 2023Updated 3 years ago
- Easily trim 'messages' arrays for use with GPTs☆74Dec 19, 2023Updated 2 years ago
- Legal Matter Standard Specification (LMSS) library for Python☆17Nov 14, 2023Updated 2 years ago
- ⚙️ Convert HTML to Markdown. Even works with entire websites and can be extended through rules.☆28Mar 19, 2026Updated 2 months ago