A library to extract the main content from html. Developed for information on LLM and for feeding data into LangChain and LlamaIndex.
☆51May 16, 2024Updated 2 years ago
Alternatives and similar repositories for main_content_extractor
Users that are interested in main_content_extractor are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Self-hosted, horizontally-scalable Playwright grid. Spin up as many browser workers as you need on your own infrastructure and access the…☆30Apr 6, 2026Updated last month
- Experiments w/ FastAPI and asyncio☆11Feb 8, 2023Updated 3 years ago
- An AI-powered GitHub search tool utilising Generative UI☆14Jul 20, 2024Updated last year
- Remove DIVs, style stuff and normalize HTML preserving structure information☆14Oct 24, 2025Updated 7 months ago
- ☆24Jan 22, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Minimal Chatbot based on Vercel AI Chatbot☆36Jan 8, 2026Updated 4 months ago
- Convert datasets from Hugging Face to FiftyOne for Visualization☆11Mar 15, 2024Updated 2 years ago
- Dataset from Tip of the Tongue Known-Item Retrieval (2021) paper.☆12Nov 4, 2021Updated 4 years ago
- ⛩️ Generate infinite Japanese city names using a simple 3-Layer MLP!☆24Jan 25, 2025Updated last year
- mcp wrapper for openai built-in tools☆12Mar 13, 2025Updated last year
- AI Command Center where specialized agents collaborate to create tasks and work on projects.☆56Updated this week
- ai trading agent using interactive brokers api☆98Feb 17, 2025Updated last year
- MetricEval: A framework that conceptualizes and operationalizes four main components of metric evaluation, in terms of reliability and va…☆12Nov 6, 2023Updated 2 years ago
- Utilities for working with videos☆13Jul 5, 2025Updated 10 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A Simple example how to use FastAPI with Async SQLAlchemy 2.0☆19May 18, 2026Updated last week
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1…☆14Jan 19, 2024Updated 2 years ago
- StrongSort-Pip: Packaged version of StrongSort☆10Sep 3, 2022Updated 3 years ago
- LossHub: Loss Functions Library for Image Classification and Detection☆14Oct 9, 2022Updated 3 years ago
- Simple setup for personal dotfiles☆11Mar 29, 2026Updated last month
- Torchreid-Pip: Packaged version of Torchreid☆13Oct 16, 2022Updated 3 years ago
- Cookie based feature flags implementation for Rails.☆20Apr 19, 2022Updated 4 years ago
- Scrape Airbnb, Booking, Hotels.com from a single JavaScript module. ❗No longer maintained.☆18Apr 18, 2023Updated 3 years ago
- 커피 한 잔 마시며 끝내는 Vue.JS☆11Dec 10, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This very simple python script takes inputs from your business and outputs articles written bhy claude.☆13Apr 3, 2024Updated 2 years ago
- Generate the dnsmasq domains configuration for me☆11Sep 29, 2025Updated 7 months ago
- AI-based web extractor☆12Feb 25, 2023Updated 3 years ago
- The Florence Tool CLI provides a command-line interface for processing images using the Florence-2 model. This tool allows users to apply…☆16Jan 21, 2025Updated last year
- Progressive type annotation without regression! 🚀☆31May 1, 2026Updated 3 weeks ago
- The SQL-RL-GEN is an algorithm based on a Reinforcement Learning approach with a reward function generated by a LLM to guide the agent's …☆24Sep 18, 2025Updated 8 months ago
- Bu repo SAHI uygulamasını mantığını öğreniyoruz.☆12Mar 11, 2022Updated 4 years ago
- Code for XPERT algorithm from Personalized Retrieval over Millions of Items☆13Sep 14, 2023Updated 2 years ago
- Python script designed to simplify the process of submitting URLs to Google's Indexing API for faster and more efficient website indexing…☆12Sep 12, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆11Nov 14, 2019Updated 6 years ago
- Serving hugging face guidance behind a server☆13Jun 14, 2023Updated 2 years ago
- Implementation Code for "LLM-based Medical Assistant Personalization with Short- and Long-Term Memory Coordination"☆14May 17, 2026Updated last week
- ☆10Feb 16, 2025Updated last year
- Pyramid Mask Text Detector designed by SenseTime Video Intelligence Research team.☆14Aug 1, 2019Updated 6 years ago
- ☆14Mar 11, 2025Updated last year
- The BXAQ malware is used in China. They force tourists to install this app at the border. The malware downloads a tourist’s text messages…☆10Jul 12, 2019Updated 6 years ago