A Python library to chunk/group your texts based on semantic similarity.
☆106Jun 12, 2026Updated 3 weeks ago
Alternatives and similar repositories for semantic-split
Users that are interested in semantic-split are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Exploration of semantic chunking and chunk classification☆19Sep 16, 2024Updated last year
- This is the source code of IJCNN 2023 paper TieFake: Title-Text Similarity and Emotion-Aware Fake News Detection (TieFake).☆16Dec 21, 2023Updated 2 years ago
- Cocktail: A Comprehensive Information Retrieval Benchmark with LLM-Generated Documents Integration☆15Jun 4, 2024Updated 2 years ago
- A quick Crew AI tutorial☆23May 9, 2024Updated 2 years ago
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Sep 17, 2025Updated 9 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Codebase of ACL2024 paper "Spiral of Silence: How is Large Language Model Killing Information Retrieval?—A Case Study on Open Domain Ques…☆16Jun 4, 2024Updated 2 years ago
- Code and data for GMT-KBQA☆17Jan 5, 2023Updated 3 years ago
- A simple web application for searching Word2Vec embeddings derived from approximately 2,000 law reports published by the The Incorporated…☆27Oct 4, 2022Updated 3 years ago
- GPT based autonomous agent that does online comprehensive research on any given topic☆13Aug 29, 2023Updated 2 years ago
- Implementation and Extensions to models in STS 2017 Shared Task for Semantic Textual Similarity☆19Jun 21, 2018Updated 8 years ago
- Codes for the paper "Grad2Task: Improved Few-shot Text Classification Using Gradients for Task Representation"☆14Nov 24, 2022Updated 3 years ago
- BPE modification that implements removing of the intermediate tokens during tokenizer training.☆27Nov 25, 2024Updated last year
- PyTorch reimplementation of the paper "SimCLS: A Simple Framework for Contrastive Learning of Abstractive Summarization"☆16Oct 17, 2021Updated 4 years ago
- Data mapping framework for rust stuff☆57Mar 25, 2026Updated 3 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆1,470Jun 18, 2024Updated 2 years ago
- ☆11Mar 5, 2025Updated last year
- Vector Search Benchmarking suite☆16May 4, 2026Updated last month
- Tayra is a sophisticated call center analytics platform designed to systematically evaluate and score call center audio interactions. By …☆14Dec 19, 2025Updated 6 months ago
- A fully autonomous AI artist☆19Jun 19, 2023Updated 3 years ago
- ☆12Jan 25, 2025Updated last year
- Revolve: Optimizing AI Systems by Tracking Response Evolution in Textual Optimization☆22Dec 13, 2024Updated last year
- A git repo showcasing RAG Techniques for building Naive to Advance RAG solutions☆13Feb 16, 2025Updated last year
- A set of containerized Google Cloud Platform emulators used for development and testing purposes.☆13Nov 25, 2022Updated 3 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆18Sep 21, 2023Updated 2 years ago
- ☆11Apr 10, 2026Updated 2 months ago
- ☆58Jun 26, 2026Updated last week
- An AI character interaction system with emotional modeling and advanced memory management☆17Oct 26, 2024Updated last year
- A fast, lightweight and easy-to-use Python library for splitting text into semantically meaningful chunks.☆653Jun 13, 2026Updated 3 weeks ago
- Code and dataset "ZEST" from "Learning from task descriptions", Weller et al, EMNLP 2020☆17Mar 15, 2021Updated 5 years ago
- arXiv-Chat: An AI research assistant and Discord bot☆13Jul 16, 2023Updated 2 years ago
- Centralized multi-channel notification management component for streamlined communication across email, SMS, WhatsApp, and push notificat…☆13Updated this week
- ☆21Apr 24, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Aspect and opinion terms extraction for hotel's review from AiryRooms in Bahasa Indonesia☆16Jul 3, 2019Updated 7 years ago
- An automated data pipeline scaling RL to pretraining levels☆76Jun 2, 2026Updated last month
- ComfyUI workflows☆11Sep 19, 2024Updated last year
- ☆16Feb 1, 2025Updated last year
- Code for the ACL 2024 paper "PLUG: Leveraging Pivot Language in Cross-Lingual Instruction Tuning"☆13Aug 13, 2025Updated 10 months ago
- Portal: GUI Tools for Agents☆25Sep 18, 2025Updated 9 months ago
- Reflective memory for agents☆36Jun 21, 2026Updated last week