Docs‑focused crawler that converts documentation sites to clean Markdown.
☆40Feb 9, 2026Updated last month
Alternatives and similar repositories for docrawl
Users that are interested in docrawl are comparing it to the libraries listed below
Sorting:
- treemind interprets tree models☆41Jul 23, 2025Updated 7 months ago
- Rewrite of Andrej Karpathy makemore character level language model.☆22Jan 30, 2025Updated last year
- Precision Knowledge Editing (PKE): A novel method to reduce toxicity in LLMs while preserving performance, with robust evaluations and ha…☆11Nov 26, 2024Updated last year
- A diffusers API in Burn (Rust)☆27Updated this week
- Using deep research workflow to generate datasets for finetuning LLMs.☆39Oct 9, 2025Updated 5 months ago
- The rag pipeline for optimizing dynamic data editing.☆21Oct 30, 2025Updated 4 months ago
- A Kotlin framework for writing cleaner backend APIs through intelligent data composition☆24Sep 26, 2025Updated 5 months ago
- code for training and using chess embeddings models☆13Jun 9, 2024Updated last year
- Yupiik Fusion is a modern, high-performance web framework built on top of GraalVM. The framework is designed to provide a streamlined and…☆15Mar 15, 2026Updated last week
- ☆16Jun 4, 2025Updated 9 months ago
- A text analysis library for relevance and subtheme detection☆16Updated this week
- Pure Java Protobuf tools☆30Updated this week
- ☆17Dec 16, 2024Updated last year
- a git command to display an overview of the status of many git projects at once☆40Nov 1, 2025Updated 4 months ago
- IngestRSS is an AWS-based RSS feed processing system that automatically fetches, processes, and stores articles from specified RSS feeds.…☆16Dec 22, 2024Updated last year
- An automated data pipeline scaling RL to pretraining levels☆74Oct 11, 2025Updated 5 months ago
- Upload SQLite database files to Datasette☆14Nov 10, 2025Updated 4 months ago
- appengine-awt is a pure java implementation of the java.awt and javax.imageio packages for use in the Google AppEngine environment.☆16Mar 12, 2013Updated 13 years ago
- A self-hosted AI agent daemon built in Rust. Runs on your machine, talks through Telegram, Discord, Slack, Email and Matrix. Fully local …☆49Mar 8, 2026Updated 2 weeks ago
- ☆21Dec 22, 2024Updated last year
- a lightweight groovy orm☆10Sep 7, 2015Updated 10 years ago
- Real-world AI engineering dataset creation, SFT fine-tuning, and GRPO alignment ETL pipeline.☆33Aug 27, 2025Updated 6 months ago
- A Maven plugin for compiling coffeescript into javascript.☆35May 25, 2012Updated 13 years ago
- Add your configs for tmux☆18Apr 3, 2022Updated 3 years ago
- Datasette plugin providing a UI for executing SQL writes against the database☆12Nov 11, 2025Updated 4 months ago
- LLM Context Manager for inference optimization☆25Jul 28, 2025Updated 7 months ago
- An interactive web app to visualize and explore data structures and algorithms. Users can perform operations like insertion, deletion, an…☆20Apr 14, 2025Updated 11 months ago
- Datasette plugin that adds a .atom output format☆14Nov 2, 2025Updated 4 months ago
- rudradb-opin-examples is for example implementations of the pip install rudradb-opin☆29Mar 3, 2026Updated 2 weeks ago
- ☆11Mar 11, 2023Updated 3 years ago
- [COLM '24] Source-Aware Training Enables Knowledge Attribution in Language Models☆19Apr 1, 2025Updated 11 months ago
- MATLAB/Octave generator of Hamming ECC coding. Output format is Verilog HDL.☆12Dec 27, 2022Updated 3 years ago
- Implicit Data Markup☆13Jan 15, 2025Updated last year
- A collection of experimental Retrieval Augmented Generation (RAG) Techniques to elevate your pipelines, all with code and intuitive expla…☆34Jul 21, 2025Updated 8 months ago
- world's stupidest moe llm in 103M parameters☆20Jul 18, 2025Updated 8 months ago
- ☆13Feb 24, 2026Updated 3 weeks ago
- A View Model framework written in rust, inspired by Next.js.☆10May 29, 2023Updated 2 years ago
- A distributed execution framework built upon lunatic.☆16Jan 19, 2024Updated 2 years ago
- Let Claude Code answer your Microsoft Teams messages while you do literally anything else☆72Updated this week