A Web Crawler based on LLMs implemented with Ray and Huggingface. The embeddings are saved into a vector database for fast clustering and retrieval. Use it for your RAG.
☆97Oct 15, 2023Updated 2 years ago
Alternatives and similar repositories for LLMWebCrawler
Users that are interested in LLMWebCrawler are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 2024 PyCon Korea 튜토리얼☆12Nov 8, 2024Updated last year
- Custom tools for agent based crewAI langchain solutions☆10May 27, 2024Updated last year
- cluster text embeddings with DBSCAN and HDBSCAN — parameter sweep, Excel export☆17Feb 21, 2026Updated 3 months ago
- ✍️ A browser add-on (Firefox, Chrome, Thunderbird) that allows you to autocorrect common text sequences and convert text characters to a …☆12Apr 28, 2026Updated 3 weeks ago
- The home of the Streamlit graph visualization component powered by yFiles for HTML☆19Nov 18, 2025Updated 6 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Example of running LangChain on Digitalocean App Platform☆12Apr 14, 2023Updated 3 years ago
- For DJs - Import beatgrids from Traktor/Rekordbox to Engine Prime☆17Feb 2, 2020Updated 6 years ago
- scrape the data for certain topics from reddit and create tutorial video based on that☆10Sep 9, 2019Updated 6 years ago
- LLM-MapBook: AI-Powered Maps for Storytelling. Extracts geo-coordinates from books, visualizes on interactive maps, offering immersive st…☆12Aug 27, 2024Updated last year
- ☆20May 30, 2024Updated last year
- This is a LlamaIndex project bootstrapped with create-llama to act as a full stack UI to accompany Retrieval-Augmented Generation (RAG) B…☆31Feb 23, 2024Updated 2 years ago
- scripts for working with traktor files☆10Mar 11, 2018Updated 8 years ago
- Prompt + regex lab☆10Nov 22, 2023Updated 2 years ago
- Crafting AI companions for authentic and engaging conversations.☆12Apr 14, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Unreal Media Player to bring Live and VOD video streaming in HLS and DASH formats into your Unreal apps across multiple platforms.☆12May 13, 2026Updated last week
- 🔎 A deep-dive into HyDE for Advanced LLM RAG + 💡 Introducing AutoHyDE, a semi-supervised framework to improve the effectiveness, covera…☆37Mar 26, 2024Updated 2 years ago
- Automatic OCR of clipboard contents.☆14Aug 12, 2022Updated 3 years ago
- Follow-Up Differential Descriptions: Language Models Resolve Ambiguities for Image Classification☆11Nov 15, 2023Updated 2 years ago
- Upload a document image or PDF, or provide a URL, to convert it into a structured format using SmolDocling.☆16Mar 31, 2025Updated last year
- Revolutionize your recruitment process with this cutting-edge proof of concept! Developed from the inspiration of josephkearney91's Linke…☆37Mar 12, 2026Updated 2 months ago
- ☆12Jan 7, 2025Updated last year
- A Trip planner that actually does trip planning☆12Oct 15, 2022Updated 3 years ago
- arduino code for controlling cameras via lanc☆16Jul 14, 2013Updated 12 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Code to reproduce the experiments in the paper: Does CLIP Bind Concepts? Probing Compositionality in Large Image Models.☆16Oct 14, 2023Updated 2 years ago
- 공학수학 강의노트☆19Feb 27, 2024Updated 2 years ago
- A curated list of awesome things related to the WebMCP W3C standard☆91Mar 25, 2026Updated last month
- Evently is a platform for event management.☆16Updated this week
- Extended Few-Shot Learning: Exploiting Existing Resources for Novel Tasks☆10Jul 6, 2021Updated 4 years ago
- Base NestJs project with JWT auth, Mailer, CRUD API as a Microservice architecture.☆14Mar 19, 2022Updated 4 years ago
- This AI agent analyzes code repositories, detects potential security vulnerabilities, reviews code quality, and suggests fixes based on S…☆12Feb 6, 2025Updated last year
- Static site for big five personality tests☆10May 14, 2026Updated last week
- Command-line tool to generate SEO metadata and HTML meta tags using AI models☆20Jan 9, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A simple NextJS app that streams Langserve (python) streamings on NextJS frontend, using a hook to make it clean on components, and api c…☆10Mar 12, 2024Updated 2 years ago
- You can use this tool to export your Telegram user, group, or chat history in JSON format, extract text messages, and it can help you ext…☆13May 22, 2025Updated last year
- predicting linear regression with prediction intervals☆15Jul 9, 2016Updated 9 years ago
- ES5 - Javascript design pattern examples☆10Mar 28, 2017Updated 9 years ago
- Free demo projects☆12Oct 6, 2022Updated 3 years ago
- An open source and enterprise-grade implementation of the orchestrator-worker pattern from Anthropic's paper, "How we built our multi-age…☆29Oct 9, 2025Updated 7 months ago
- Tidal Cycles Code Files☆11Mar 9, 2025Updated last year