HawkClaws / main_content_extractorLinks

A library to extract the main content from html. Developed for information on LLM and for feeding data into LangChain and LlamaIndex.

☆45

Alternatives and similar repositories for main_content_extractor

Users that are interested in main_content_extractor are comparing it to the libraries listed below

Sorting:

langchain-ai / kork
Natural Language Interfaces Powered by LLMs
☆94Updated last year
gustavoespindola / chunkerizer
Split and analyze text files using langchain and streamlit
☆48Updated last year
Portkey-AI / portkey-python-sdk
Build reliable, secure, and production-ready AI apps easily.
☆80Updated last week
langchain-ai / langgraph-example-pyproject
☆43Updated 11 months ago
masci / banks
LLM prompt language based on Jinja. Banks provides tools and functions to build prompts text and chat messages from generic blueprints. I…
☆113Updated 3 weeks ago
IIMunchII / restllm
REST API for Large Language Models using FastAPI, Redis and LiteLLM
☆14Updated last year
mendableai / firecrawl-py
Crawl and convert any website into clean markdown
☆55Updated last year
zozoheir / tinyllm
Develop, evaluate and monitor LLM applications at scale
☆100Updated 8 months ago
deepdoctection / notebooks
Repository for deepdoctection tutorial notebooks
☆46Updated last month
ammirsm / llamaindex-omakase-rag
This project enhances the construction of RAG applications by addressing challenges, improving accessibility, scalability, and managing d…
☆146Updated last year
langchain-ai / competitor-analysis-bot
Example LangGraph flow that does "competitor analysis" on the web.
☆32Updated last year
langchain-ai / langchain-elastic
Elasticsearch integration into LangChain
☆58Updated 6 months ago
onepointconsulting / elasticsearch-agent
ElasticSearch agent based on ElasticSearch, LangChain and ChatGPT 4
☆48Updated last year
langchain-ai / langgraph-gen-py
CLI to generate LangGraph stubs from a specification
☆87Updated 4 months ago
hwchase17 / chain-of-verification
☆33Updated last year
zhangzhejian / codeinterpreter-codebox
Easy to deploy.A cloud service for python code interpreter sandbox for Code-Interpreter.
☆53Updated last year
Spryngtime / openai-load-balancer
☆88Updated last year
langchain-ai / text-split-explorer
☆265Updated last year
jakecyr / openai-function-calling
Helper functions to generate JSON schema dicts for OpenAI ChatGPT function calling requests.
☆81Updated 5 months ago
addy999 / omniparser-api
Self-hosted version of Microsoft's OmniParser Image-to-text model
☆71Updated 2 months ago
reworkd / perplexity-style-streaming
⚡️ Perplexity.ai style LLM response streaming
☆161Updated last year
mendableai / QA_clustering
Analyzing chat interactions w/ LLMs to improve 🦜🔗 Langchain docs
☆80Updated 2 years ago
onepointconsulting / data-questionnaire-agent
Data Questionnaire Agent Chatbot
☆67Updated 3 months ago
yoheinakajima / jsondr
converts url content into JSON with a simple prefix
☆70Updated last year
agamm / semantic-split
A Python library to chunk/group your texts based on semantic similarity.
☆97Updated last year
gkamradt / SemanticDeduplicator
☆93Updated last year
ngaut / jarvis
☆27Updated last year
topoteretes / awesome-ai-memory
A list of AI memory projects
☆185Updated 6 months ago
parsee-ai / parsee-core
Retrieval of fully structured data made easy. Use LLMs or custom models. Specialized on PDFs and HTML files. Extensive support of tabular…
☆72Updated 2 weeks ago
hwchase17 / conversational-retrieval-agent
☆61Updated 2 years ago