KLGR123 / html_chunkingView external linksLinks
Token-aware HTML chunking that preserves structure and attributes, with optional cleaning and attribute length control.
☆15Aug 12, 2025Updated 6 months ago
Alternatives and similar repositories for html_chunking
Users that are interested in html_chunking are comparing it to the libraries listed below
Sorting:
- User-friendly viewer for Parquet files☆10Jan 10, 2026Updated last month
- fine-tuning tutorial☆17Dec 13, 2025Updated 2 months ago
- DOS Program Development☆12Nov 9, 2022Updated 3 years ago
- A multifunctional car using Openmv4 and Arduino, including Machine vision, Socket communication,WiFi graph transmission☆12Nov 25, 2020Updated 5 years ago
- A modern online auction platform for trading second-hand goods.☆12Feb 10, 2022Updated 4 years ago
- 该仓库是 BUPT 智能系统实 验室的法律大模型项目,基于 ChatGLM 等开源大模型进行实现。☆11Nov 28, 2023Updated 2 years ago
- ☆14Dec 12, 2022Updated 3 years ago
- ☆11Dec 6, 2023Updated 2 years ago
- LightGBM for handling label-imbalanced data with focal and weighted loss functions in binary and multiclass classification☆21Jan 29, 2026Updated 2 weeks ago
- ☆10Jan 9, 2024Updated 2 years ago
- Redis distributed lock implementation for Python based on Pub/Sub messaging☆11Nov 15, 2025Updated 2 months ago
- Less-Resilient MapReduce for Go☆10Feb 15, 2023Updated 2 years ago
- On November 21, 1972, Kim Doo-han collapsed due to high blood pressure, an orange disease.☆11Jun 27, 2022Updated 3 years ago
- Label shift estimation for transfer difficulty with Familiarity.☆10Feb 4, 2025Updated last year
- Fine Tune Multimodal LLM "Idefics 2" using QLoRA.☆11Apr 20, 2024Updated last year
- Array quantization and compression☆14Dec 8, 2023Updated 2 years ago
- Today I learned / 오늘의 학습 기록소 (근본있는 개발자가 되자)☆12Jun 6, 2020Updated 5 years ago
- A parameter-efficient compression model architecture for a variety of NLP tasks at BERT level performance at a fraction of the computatio…☆10Jan 25, 2026Updated 2 weeks ago
- 🎹 Instruct.KR 2025 Summer Meetup: 오픈소스 LLM, vLLM으로 Production까지 🎹☆24Aug 2, 2025Updated 6 months ago
- 🚀 LLM inference optimization simulator, modeling compute-bound prefill and memory-bound decode phases.☆13Jul 12, 2025Updated 7 months ago
- Collection of shortest path algorithms (Dijkstra, A*, Bellman-Ford, All pair SP, DFS, BFS, and own) that converge to the most cost-effecti…☆10May 5, 2019Updated 6 years ago
- SDK bindings for Huidu receiver cards☆11Dec 14, 2022Updated 3 years ago
- Server side for dymension Rollapps☆15Jan 17, 2023Updated 3 years ago
- ktor-koin-sample-app☆12Oct 25, 2020Updated 5 years ago
- Claude Code for Kimi K2☆14Sep 15, 2025Updated 4 months ago
- uvx is now uvenv☆15Dec 4, 2024Updated last year
- Hunt Town is a web3 co-building community where builders come together to contribute to the expansion of web3 culture and products.☆14Jan 15, 2026Updated 3 weeks ago
- ☆11Sep 16, 2025Updated 4 months ago
- 📒 마크다운 문서속에서 터미널을 사용할 수 있다!☆10Dec 22, 2019Updated 6 years ago
- 코로나19 발생현황 변동 및 새 공지사항 푸시알림 서비스(질병관리본부 코로나19 홈페이지 데이터 이용)☆12Jan 5, 2023Updated 3 years ago
- 🛰️ Assets for Station☆13Aug 18, 2024Updated last year
- Clober Solidity Library☆10Jun 9, 2025Updated 8 months ago
- 비즈엠 개발 서버에서 전화번호 인증을 쉽게 할 수 있는 웹사이트입니다.☆10Feb 27, 2023Updated 2 years ago
- Circuit playground for learning Zero-Knowledge Proofs in circom.☆11Feb 7, 2024Updated 2 years ago
- Deep Learning papers reading roadmap for anyone who are eager to learn this amazing tech!☆10Aug 29, 2018Updated 7 years ago
- [연봉 4000만원 이상] 지도 기반 소셜 네트워크 서비스☆10Jan 3, 2024Updated 2 years ago
- msx game development library ubox example☆11Apr 26, 2023Updated 2 years ago
- BERT score for text generation☆12Jan 15, 2025Updated last year
- This project auto-instruments containerized workloads in Kubernetes with New Relic agents.☆12Updated this week