A library to extract the main content from html. Developed for information on LLM and for feeding data into LangChain and LlamaIndex.
☆51May 16, 2024Updated last year
Alternatives and similar repositories for main_content_extractor
Users that are interested in main_content_extractor are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Self-hosted, horizontally-scalable Playwright grid. Spin up as many browser workers as you need on your own infrastructure and access the…☆30Apr 6, 2026Updated last week
- ☆14Jul 8, 2023Updated 2 years ago
- ☆24Jan 22, 2025Updated last year
- Minimal Chatbot based on Vercel AI Chatbot☆36Jan 8, 2026Updated 3 months ago
- Convert datasets from Hugging Face to FiftyOne for Visualization☆11Mar 15, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Celery + Flower + Docker + Nginx + Basic Auth☆21Jul 22, 2022Updated 3 years ago
- Clober Solidity Library☆10Jun 9, 2025Updated 10 months ago
- MetricEval: A framework that conceptualizes and operationalizes four main components of metric evaluation, in terms of reliability and va…☆12Nov 6, 2023Updated 2 years ago
- ai trading agent using interactive brokers api☆97Feb 17, 2025Updated last year
- Utilities for working with videos☆13Jul 5, 2025Updated 9 months ago
- A Simple example how to use FastAPI with Async SQLAlchemy 2.0☆20Mar 6, 2026Updated last month
- ☆18Dec 5, 2025Updated 4 months ago
- A curated list of personalized Language model / Large language model (continually updated)☆10Nov 17, 2023Updated 2 years ago
- Constructing community of LLM-based Agent in the minecraft☆17Nov 27, 2025Updated 4 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- StrongSort-Pip: Packaged version of StrongSort☆10Sep 3, 2022Updated 3 years ago
- Official TypeScript/JavaScript SDK for the Supadata API.☆21Feb 23, 2026Updated last month
- ☆14Apr 26, 2022Updated 3 years ago
- LossHub: Loss Functions Library for Image Classification and Detection☆14Oct 9, 2022Updated 3 years ago
- not financial advice but im using this to scale in slowly☆31Aug 28, 2025Updated 7 months ago
- Emoji embeddings trained using their emotional content from their online dictionary meanings.☆16Jan 10, 2022Updated 4 years ago
- Parsing, processing, and translation of PostgreSQL, MySQL and ADQL queries☆15Aug 18, 2025Updated 7 months ago
- An AI agent to create short stories, using Gemini and Imagen for illustrations. The project is developed in Java 21 with LangChain4j, and…☆12Sep 4, 2025Updated 7 months ago
- Cookie based feature flags implementation for Rails.☆20Apr 19, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 🤖 Auto Content Generator: A .NET 8 API that leverages OpenAI's GPT-4o to automatically generate markdown formatted blog posts, commit th…☆35Jun 3, 2024Updated last year
- Scrape Airbnb, Booking, Hotels.com from a single JavaScript module. ❗No longer maintained.☆18Apr 18, 2023Updated 2 years ago
- 커피 한 잔 마시며 끝내는 Vue.JS☆11Dec 10, 2022Updated 3 years ago
- Python Script for Copywriters to Gather Data from Competing Content and Find Keyword Overlap☆15Apr 23, 2022Updated 3 years ago
- Generate the dnsmasq domains configuration for me☆13Sep 29, 2025Updated 6 months ago
- A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning☆11Sep 19, 2020Updated 5 years ago
- The Florence Tool CLI provides a command-line interface for processing images using the Florence-2 model. This tool allows users to apply…☆16Jan 21, 2025Updated last year
- An accurate, extensible, and fast HTML-to-markdown converter.☆23Feb 7, 2026Updated 2 months ago
- Progressive type annotation without regression! 🚀☆31Oct 16, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- This repo includes Midjourney prompt curation to use Midjourney better.☆13Feb 22, 2023Updated 3 years ago
- ☆10Nov 14, 2019Updated 6 years ago
- Serving hugging face guidance behind a server☆13Jun 14, 2023Updated 2 years ago
- An API that allows you to scrape blog posts and articles and get a list of notes or a summary back.☆10Mar 31, 2023Updated 3 years ago
- Use Amazon S3 as a simple json database and serverless API☆24Aug 1, 2023Updated 2 years ago
- A curated list of small language models (SLMs) that are very good a particular task and have seen real enterprise adoption.☆57Oct 17, 2025Updated 5 months ago
- wip : personal finance tracker☆16Jan 15, 2025Updated last year