UniSim is a package for efficient similarity computation, fuzzy matching, and clustering of data.
☆149Apr 4, 2025Updated last year
Alternatives and similar repositories for unisim
Users that are interested in unisim are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- utilities for loading and running text embeddings with onnx☆45Aug 16, 2025Updated 10 months ago
- Apache Arrow-compatible space-efficient "tape" class in pure Rust to be used with StringZilla for GPU, NUMA, and disk transfers of variab…☆31Nov 21, 2025Updated 7 months ago
- Nadir: Cutting-edge PyTorch optimizers for simplicity & composability! 🔥🚀💻☆14Jun 15, 2024Updated 2 years ago
- Monitor data sources and track changes over time 🐿️☆11Nov 7, 2024Updated last year
- Text Classification Dataset for Turkish Language☆10Nov 16, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Fast Open-Source Search & Clustering engine × for Vectors & Arbitrary Objects × in C++, C, Python, JavaScript, Rust, Java, Objective-C, S…☆4,196May 28, 2026Updated last month
- A maximum-strength name parser for record linkage.☆41Sep 3, 2025Updated 9 months ago
- Simple Telegram bot to annotate and varify automatic speech recognition datasets☆12Mar 30, 2021Updated 5 years ago
- Resources related to EMNLP 2021 paper "FAME: Feature-Based Adversarial Meta-Embeddings for Robust Input Representations"☆13Dec 14, 2021Updated 4 years ago
- ☆14Sep 18, 2024Updated last year
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆49Nov 13, 2023Updated 2 years ago
- Hugging Face RoBERTa with Flash Attention 2☆24Sep 14, 2025Updated 9 months ago
- Efficient BM25 with DuckDB 🦆☆69Dec 20, 2024Updated last year
- Multi-model transactional embedded database☆67Dec 10, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆13Jan 22, 2025Updated last year
- If only std::set was a DBMS: collection of templated ACID in-memory exception-free thread-safe and concurrent containers in a header-only…☆44Oct 30, 2025Updated 7 months ago
- ☆32Jun 5, 2025Updated last year
- Improving Text Embedding of Language Models Using Contrastive Fine-tuning☆64Aug 2, 2024Updated last year
- MLOps samples and docs from real world projects in manufacturing industry☆32Oct 25, 2023Updated 2 years ago
- ☆29Jan 10, 2021Updated 5 years ago
- Blazingly fast neighborhood attention☆15Nov 28, 2023Updated 2 years ago
- Lightweight Python wrapper around the DuckDB extension, httpserver (extension developed by @quackscience)☆17Sep 24, 2025Updated 9 months ago
- 收集优质的角色扮演聊天数据 | Collection of roleplay conversations of high quality☆16Dec 1, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Example using echo conversational agent server☆15Aug 20, 2024Updated last year
- A quick Crew AI tutorial☆23May 9, 2024Updated 2 years ago
- ☆17Jun 20, 2023Updated 3 years ago
- AIBench, a tool for comparing and evaluating AI serving solutions. forked from [tsbs](https://github.com/timescale/tsbs) and adapted to A…☆21Sep 4, 2024Updated last year
- Semantic Search demo featuring UForm, USearch, UCall, and StreamLit, to visual and retrieve from image datasets, similar to "CLIP Retriev…☆54Dec 29, 2023Updated 2 years ago
- Web Serving and Remote Procedure Calls at 50x lower latency and 70x higher bandwidth than FastAPI, implementing JSON-RPC & REST over io_u…☆1,330Sep 16, 2025Updated 9 months ago
- Multi-Modal Database replacing MongoDB, Neo4J, and Elastic with 1 faster ACID solution, with NetworkX and Pandas interfaces, and bindings…☆635Sep 1, 2023Updated 2 years ago
- pseudopeople is a Python package that generates realistic simulated data about a fictional United States population, designed for use in …☆24Mar 25, 2026Updated 3 months ago
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.☆96May 28, 2026Updated last month
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our foc…☆32Jun 13, 2024Updated 2 years ago
- Embedding Recycling for Language models☆38Jul 11, 2023Updated 2 years ago
- Creating Debian Packages from CRAN Sources☆12Jul 1, 2020Updated 5 years ago
- ☆15Dec 21, 2025Updated 6 months ago
- A framework for evaluating Machine Translation models.☆12Apr 21, 2026Updated 2 months ago
- Social value orientation (SVO) notes for pro-social pro-self concepts☆13Apr 14, 2025Updated last year
- A fast implementation of T5/UL2 in PyTorch using Flash Attention☆116Oct 30, 2025Updated 7 months ago