A library for structural-semantic chunking of documents.
☆12Oct 8, 2025Updated 4 months ago
Alternatives and similar repositories for s2-chunking-lib
Users that are interested in s2-chunking-lib are comparing it to the libraries listed below
Sorting:
- ☆11Nov 7, 2025Updated 3 months ago
- FTRL-Proximal Online Learning Algorithm☆15May 22, 2017Updated 8 years ago
- 新词发现分布式机器学习算法。☆15Jul 21, 2014Updated 11 years ago
- This project has included related source codes and datasets of our EMNLP2021 paper☆10May 28, 2022Updated 3 years ago
- Turning messy repos into weapons of mass structured context.☆22Feb 20, 2026Updated last week
- allowing R users to work with dlib through Rcpp☆13Apr 11, 2018Updated 7 years ago
- ☆15Jun 6, 2025Updated 9 months ago
- Automatically track your Slack community's activity in a TSV with git☆12Jun 19, 2017Updated 8 years ago
- Implementation for EACL 2024 paper "Corpus-Steered Query Expansion with Large Language Models"☆12Mar 19, 2024Updated last year
- Serve markdown versions of your HTML pages to AI agents and bots☆45Feb 22, 2026Updated last week
- Documentation and clients for the RaceHero REST API☆12Jul 12, 2019Updated 6 years ago
- A MCP for Claude Desktop to build n8n workflows for you☆19Oct 25, 2025Updated 4 months ago
- ☆11Nov 13, 2024Updated last year
- Character Embedding + ESIM + Focal Loss for Chinese Answer Sentence Selection☆10Jan 4, 2020Updated 6 years ago
- ACL24☆11Jun 7, 2024Updated last year
- Bringing some SQL to Qdrant☆15Jun 17, 2025Updated 8 months ago
- An inplementation of vggish in keras with tf backend☆11Feb 12, 2022Updated 4 years ago
- ROUGE L metric implementation using tensorflow ops☆12Sep 17, 2018Updated 7 years ago
- Simple PHP Google and Microsoft OAuth SSO integration.☆12Feb 10, 2021Updated 5 years ago
- Diff filtering, text mapping, and windowed transforms for LLM apps☆21Sep 19, 2025Updated 5 months ago
- Short Text Similarity as described in https://dl.acm.org/citation.cfm?id=2806475☆17Feb 7, 2019Updated 7 years ago
- Simple Structured Perceptron tagger in Python☆10May 30, 2017Updated 8 years ago
- Agent Skills to empower developers building AI applications with Weaviate.☆50Feb 27, 2026Updated last week
- An MCP server providing tools for validating and rendering Mermaid diagrams.☆15Apr 1, 2025Updated 11 months ago
- PromptCraft is a prompt perturbation toolkit from the character, word, and sentence levels for prompt robustness analysis. PyPI Package: …☆20Jan 3, 2024Updated 2 years ago
- Transforming NotebookLM into a versatile bot☆19Feb 22, 2026Updated last week
- aigc evals☆10Dec 2, 2023Updated 2 years ago
- Converts templated Excel files to SKOS RDF☆13Jan 4, 2024Updated 2 years ago
- Mirror of Apache Spark (With R Frontend on Spark Streaming)☆11May 30, 2015Updated 10 years ago
- Report browser errors to the server with the W3C Reporting API☆12Jan 1, 2026Updated 2 months ago
- Sentiment Analysis implemented using Gluon and MXNet☆11May 12, 2018Updated 7 years ago
- Self-Evolving Vibe Coding Skillsets Accessible to Non-Technical Users. 可自我进化的技能组合:让非技术背景的人也能无碍享受AI编程。☆43Feb 5, 2026Updated last month
- ☆13May 23, 2021Updated 4 years ago
- Implementations of some semi-supervised machine learning algorithms☆13Feb 19, 2017Updated 9 years ago
- A PyTorch implementation of the CorefQA Model.☆10Jun 27, 2020Updated 5 years ago
- NL2Flow: A PDDL Interface to Flow Construction☆14Dec 4, 2025Updated 3 months ago
- This program finds words and/or phrases in PDF document pages. Elasticsearch, Django.☆13Jan 23, 2016Updated 10 years ago
- Ssebowa is free and open source library in Python that provides generative-ai models.☆14Jan 31, 2024Updated 2 years ago
- 一个微型的基于 Python 的 HMM (隐马尔可夫模型) 包 | A micro python package for HMM (Hidden Markov Model)☆15Jan 15, 2020Updated 6 years ago