ChunkNorris is a black belt in document chunking to feed your LLMs and RAG apps π₯πͺ
β23Feb 10, 2026Updated 3 weeks ago
Alternatives and similar repositories for chunknorris
Users that are interested in chunknorris are comparing it to the libraries listed below
Sorting:
- Knowledge graph-based retrieval augmeted generation demonstrator for EGC 2024β15Jan 10, 2024Updated 2 years ago
- A terminal UI to monitor and query Elasticsearch.β12May 10, 2024Updated last year
- A curated list of awesome online courses about Large Langage Models (LLMs)β259Oct 8, 2024Updated last year
- Linear Attention for Efficient Bidirectional Sequence Modelingβ15May 13, 2025Updated 9 months ago
- β10Oct 2, 2024Updated last year
- Semantic Ranking Solution for Azure Database for PostgreSQLβ14Apr 29, 2025Updated 10 months ago
- decontaminationβ26Dec 3, 2025Updated 3 months ago
- OpenSSH Vulnerabilities Scanner: Bulk Scanning Tool for 21 different OpenSSH CVEs.β10Apr 29, 2025Updated 10 months ago
- β12Apr 27, 2025Updated 10 months ago
- Advanced RAG + Raptor: A sophisticated document processing and retrieval system combining hierarchical document clustering with advanced β¦β11Mar 29, 2025Updated 11 months ago
- Poetry Corpora Annotated on Aesthetic Emotionsβ12Aug 2, 2022Updated 3 years ago
- 0-Shot Tokenizer Transplantβ14May 16, 2025Updated 9 months ago
- Digitale Geisteswissenschaften rund um Graphentechnologienβ10Feb 12, 2026Updated 3 weeks ago
- Malware detection tool for Windows PE files based on DFIR ORC dataβ10Updated this week
- HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation [ACL 2023]β14Jul 11, 2023Updated 2 years ago
- This repository provides the source code used to automatically generate the book summarization datasets described in the paper titled "Ecβ¦β10Apr 14, 2025Updated 10 months ago
- β10Sep 13, 2022Updated 3 years ago
- Customisable angular module to animate scroll event to an element. Compatible with Angular 2.x onwardsβ12Feb 11, 2019Updated 7 years ago
- β12Apr 3, 2014Updated 11 years ago
- β10Dec 17, 2020Updated 5 years ago
- Collection of description of concepts, procedures, and simple XSLT files for text processing, e.g. simplify InDesign documents (.idml) toβ¦β12Jan 9, 2020Updated 6 years ago
- Word embeddings from PPMI-weighted and dirichlet-smoothed co-occurrence matricesβ10Aug 3, 2020Updated 5 years ago
- Label shift estimation for transfer difficulty with Familiarity.β10Feb 4, 2025Updated last year
- β13Nov 28, 2025Updated 3 months ago
- mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Modelsβ11Jan 19, 2024Updated 2 years ago
- KnowMAN: Weakly Supervised Multinomial Adversarial Networksβ12Nov 9, 2021Updated 4 years ago
- The official code of "PixelWorld: Towards Perceiving Everything as Pixels" [TMLR25]β16Sep 12, 2025Updated 5 months ago
- a collection of Node-RED nodes and flows for interactive low-code development of applications using AI technologies - free of charge and β¦β16Jun 1, 2024Updated last year
- πΈ GlotWeb: Web Indexing for Minority Languages (WWW 2026)β17Updated this week
- A web application to help check for domain or SSL/TLS certificate expirations.β13Mar 9, 2023Updated 2 years ago
- IsoBN: Fine-Tuning BERT with Isotropic Batch Normalizationβ12Nov 23, 2021Updated 4 years ago
- β14Apr 8, 2021Updated 4 years ago
- Handling PWA installation prompt made easier.β12Feb 12, 2026Updated 3 weeks ago
- Data and code: "Answering legal questions from laymen in German civil law system", BΓΌttner & Habernal, EACL'24β14Mar 2, 2024Updated 2 years ago
- Script to upload the result of a command to google cloud storage. Used for dump backup.β12Nov 16, 2017Updated 8 years ago
- Kernel source tree for Raspberry Pi Foundation-provided kernel builds. Issues unrelated to the linux kernel should be posted on the commuβ¦β10Jan 8, 2026Updated last month
- [Konvens21] This repository contains the DFKI MobIE Corpus, a dataset of 3,232 German-language documents that have been annotated with fiβ¦β12Sep 17, 2024Updated last year
- [ICML'25] "Rethinking Addressing in Language Models via Contextualized Equivariant Positional Encoding" by Jiajun Zhu, Peihao Wang, Ruisiβ¦β14Jun 6, 2025Updated 8 months ago
- β10Jun 8, 2024Updated last year