Docs‑focused crawler that converts documentation sites to clean Markdown.
☆45Mar 21, 2026Updated 2 months ago
Alternatives and similar repositories for docrawl
Users that are interested in docrawl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Ask a directory of files questions. Powered by ChromaDB and ChatGPT☆14Aug 15, 2023Updated 2 years ago
- treemind interprets tree models☆41Apr 9, 2026Updated last month
- ComfyUI custom node to extend Wan videos in loops with overlap consistency, per loop prompts, and optional LoRA control.☆28Nov 29, 2025Updated 5 months ago
- Implementation of a fast semantic chunker in C++, installable in python 3.7+ projects.☆22Sep 20, 2025Updated 8 months ago
- Groq-powered MAD: The first work to explore Multi-Agent Debate with Large Language Models :D☆12Jul 5, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- LiteLLM model integration for Pydantic AI framework - access 100+ LLM providers through a unified interface☆22Apr 15, 2026Updated last month
- An example of how to use Whisper.cpp bindings for Rust to perform speech-to-text☆24Dec 22, 2023Updated 2 years ago
- A diffusers API in Burn (Rust)☆27Mar 17, 2026Updated 2 months ago
- Using deep research workflow to generate datasets for finetuning LLMs.☆40Oct 9, 2025Updated 7 months ago
- A lightweight selfhosted web file uploader using a gist backend☆19Feb 12, 2025Updated last year
- ☆19Oct 18, 2025Updated 7 months ago
- code for training and using chess embeddings models☆13Jun 9, 2024Updated last year
- IaC framework for k8s clusters☆18Jun 19, 2025Updated 11 months ago
- A text analysis library for relevance and subtheme detection☆16Mar 20, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This is an interface that will offline convert anything pdf document you give it into an interview between two people discussing it.☆17Dec 8, 2024Updated last year
- This is a training method to produce a split brain model☆14Mar 7, 2025Updated last year
- IngestRSS is an AWS-based RSS feed processing system that automatically fetches, processes, and stores articles from specified RSS feeds.…☆17Dec 22, 2024Updated last year
- A minimalist implementation of the ViT (Vision Transformer) model, using tinygrad☆17Sep 2, 2024Updated last year
- MQTT broker☆11Apr 2, 2026Updated last month
- An automated data pipeline scaling RL to pretraining levels☆77Oct 11, 2025Updated 7 months ago
- Upload SQLite database files to Datasette☆14Nov 10, 2025Updated 6 months ago
- a docker isolated proxmox vma file extractor - get full access to your files☆51Apr 27, 2022Updated 4 years ago
- Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks" [ICLR 2025]☆29Feb 20, 2026Updated 3 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Streaming Retrieval-Augmented Generation (RAG) agent in Go. It consumes real-time data from Kafka topics, processes it in configurable wi…☆26Jun 7, 2025Updated 11 months ago
- Terraform Module for managing Oracle Cloud Infrastructure Identity and Access Management (IAM) resources☆15May 30, 2025Updated 11 months ago
- Real-world AI engineering dataset creation, SFT fine-tuning, and GRPO alignment ETL pipeline.☆34Aug 27, 2025Updated 8 months ago
- Datasette plugin adding a llm_embed(model_id, text) SQL function☆18Mar 17, 2024Updated 2 years ago
- Datasette plugin providing a UI for executing SQL writes against the database☆12Nov 11, 2025Updated 6 months ago
- An interactive web app to visualize and explore data structures and algorithms. Users can perform operations like insertion, deletion, an…☆22Apr 14, 2025Updated last year
- Datasette plugin for working with Apple's binary plist format☆14Feb 17, 2023Updated 3 years ago
- Persistent caching for Python functions☆18Dec 10, 2025Updated 5 months ago
- Datasette plugin that adds a .atom output format☆14Apr 8, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆11Mar 11, 2023Updated 3 years ago
- [COLM '24] Source-Aware Training Enables Knowledge Attribution in Language Models☆19Apr 1, 2025Updated last year
- A Model Context Protocol (MCP) server that provides vision capabilities to analyze image and video☆50Apr 6, 2026Updated last month
- Implicit Data Markup☆13Jan 15, 2025Updated last year
- A collection of experimental Retrieval Augmented Generation (RAG) Techniques to elevate your pipelines, all with code and intuitive expla…☆36Jul 21, 2025Updated 10 months ago
- world's stupidest moe llm in 103M parameters☆20Jul 18, 2025Updated 10 months ago
- ☆13Feb 24, 2026Updated 2 months ago