watercrawl / self-hostedView external linksLinks
A self-hosted version of WaterCrawl, a powerful web crawling and data extraction platform.
☆13Jul 27, 2025Updated 6 months ago
Alternatives and similar repositories for self-hosted
Users that are interested in self-hosted are comparing it to the libraries listed below
Sorting:
- OmniByteFormer is a generalized Transformer model that can process any type of data by converting it into byte sequences, bypassing tradi…☆15Jan 31, 2026Updated last week
- Transform unstructured documents into actionable, structured data with enterprise-grade precision and reliability, ready for large-scale …☆20Oct 13, 2025Updated 4 months ago
- A Gradio web UI for Large Language Models. Supports LoRA/QLoRA finetuning,RAG(Retrieval-augmented generation) and Chat☆37Nov 26, 2023Updated 2 years ago
- Deepractice Role System☆24Updated this week
- Code and data for NAACL 2025 paper "IHEval: Evaluating Language Models on Following the Instruction Hierarchy"☆16Feb 25, 2025Updated 11 months ago
- Emotion based music recommender system☆11Mar 26, 2025Updated 10 months ago
- NDIToolbox is an open source extensible signal and image processing application under development by TRI/Austin designed to assist with t…☆10Aug 19, 2018Updated 7 years ago
- [ICLR 2025 SSI-FM] Self-Taught Self-Correction for Small Language Models☆11Sep 19, 2025Updated 4 months ago
- A jekyll template for easy creation of course websites. Checkout the template here:☆11Aug 1, 2024Updated last year
- ☆11Aug 26, 2024Updated last year
- Code repository for TIDMAD: Time series Dataset for Discovering Dark Matter with AI Denoising.☆14Oct 23, 2025Updated 3 months ago
- A detail Implementation of handling long-term memory in Agentic AI☆35Oct 9, 2025Updated 4 months ago
- Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.☆12Feb 11, 2024Updated 2 years ago
- Huggingface Backup - Jupyter, Colab and Python Script☆10Jan 20, 2026Updated 3 weeks ago
- Templates for musical textual inversion for riffusion☆11Apr 14, 2023Updated 2 years ago
- Streamlines the creation of dataset to train a Large Language Model with triplets : instruction-input-output . The default configuration …☆13Apr 17, 2023Updated 2 years ago
- A knowledge graph based forward chain inferencing engine in typescript/node.☆11Jan 23, 2021Updated 5 years ago
- This plugin provides tools to extract text from a document using the Azure AI Document Intelligence service.☆12Jan 17, 2025Updated last year
- Caddy module: dns.providers.gandi☆17Jul 15, 2025Updated 6 months ago
- Write your next novel faster and easier☆14Dec 7, 2025Updated 2 months ago
- Gradio chat interface for FastMLX☆12Sep 22, 2024Updated last year
- A Next.js chat app to use Llama 2 locally using node-llama-cpp☆12Oct 27, 2024Updated last year
- In-browser semantic search demo using EmbeddingGemma and Transformers.js. No server required.☆26Sep 7, 2025Updated 5 months ago
- This project aims to utilize Generative AI for the next marketing strategy in the case of e-commerce customer segmentation.☆12Mar 19, 2024Updated last year
- Rivet plugin to access E2B goodies☆10Feb 6, 2025Updated last year
- ☆10Sep 26, 2025Updated 4 months ago
- Guide to Installing Ragflow on Google Cloud Compute Engine☆13Sep 12, 2024Updated last year
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and FastChat-T5.☆11May 26, 2023Updated 2 years ago
- Easy OpenCV Python Object Tracking Application using selectROI☆16Jun 9, 2020Updated 5 years ago
- My Gen AI research☆11Jun 3, 2024Updated last year
- page and script for generating chladni figures☆11Jul 19, 2017Updated 8 years ago
- A powerful MCP memory using a knowledge graph powered by elastic search☆16Oct 28, 2025Updated 3 months ago
- Empowering Tomorrow Together: Your Community-Powered AI Platform☆14Aug 19, 2024Updated last year
- ☆12Feb 3, 2026Updated last week
- Project of Singing Voice Conversion.☆15Oct 27, 2023Updated 2 years ago
- Automate the batch upload and parsing of documents into Dify's knowledge base, reducing manual intervention and wait time.☆15Aug 29, 2024Updated last year
- Example plugin for Rivet, showing how to execute a python script in a node☆11Nov 15, 2023Updated 2 years ago
- Auto-Video maker handling many AI's☆11Mar 18, 2024Updated last year
- ttsmaker is a Text-to-Speech library implemented using the TTSMaker API.☆14Apr 25, 2025Updated 9 months ago