Simplified implementation of UMAP like dimensionality reduction algorithm
☆54Nov 18, 2024Updated last year
Alternatives and similar repositories for nano-umap
Users that are interested in nano-umap are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- music visualization via umap of stable audio latents☆54Nov 29, 2025Updated 5 months ago
- ☆24Dec 11, 2021Updated 4 years ago
- FlexiTokens☆22Dec 27, 2025Updated 4 months ago
- Training code for Sparse Autoencoders on Embedding models☆39Apr 25, 2026Updated 2 weeks ago
- ☆95Apr 21, 2026Updated 2 weeks ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆35Jul 27, 2025Updated 9 months ago
- I2M2: Jointly Modeling Inter- & Intra-Modality Dependencies for Multi-modal Learning (NeurIPS 2024)☆22Oct 30, 2024Updated last year
- TopoTrans: Optimal Transport meets Topological Data Analysis☆14Apr 20, 2023Updated 3 years ago
- Demo of fine-tuning QA models for answering FAQ of cloud providers documentation☆11Mar 7, 2023Updated 3 years ago
- Extract code into standalone executable scripts from a Quarto Document (sort of like `knitr::purl()` but for all outputs)☆30Jan 24, 2026Updated 3 months ago
- Interactive Variational Autoencoder (VAE)☆73Oct 26, 2024Updated last year
- ☆15Sep 27, 2023Updated 2 years ago
- High-Performance Text Deduplication Toolkit☆62Aug 25, 2025Updated 8 months ago
- Precision Knowledge Editing (PKE): A novel method to reduce toxicity in LLMs while preserving performance, with robust evaluations and ha…☆11Nov 26, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Fast, permanent and flexible patterns for sharing and computing on texts with metadata using Apache Arrow.☆15Mar 1, 2022Updated 4 years ago
- coded with and corrected by Google Anti-Gravity☆13Nov 23, 2025Updated 5 months ago
- The Linear Optimal Transport Framework☆16Oct 7, 2020Updated 5 years ago
- Tools to create, edit, and persist annotated regions for HoloViews☆29Mar 14, 2025Updated last year
- Some random tools for working with the GGUF file format☆32Nov 24, 2023Updated 2 years ago
- ComfyUI custom node to extend Wan videos in loops with overlap consistency, per loop prompts, and optional LoRA control.☆28Nov 29, 2025Updated 5 months ago
- Implementation of a fast semantic chunker in C++, installable in python 3.7+ projects.☆22Sep 20, 2025Updated 7 months ago
- Code for SaGe subword tokenizer (EACL 2023)☆28Nov 30, 2024Updated last year
- ☆26Apr 17, 2026Updated 3 weeks ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Notebooks and other course materials for Emory QTM 340 (Fall 2022)☆12Dec 13, 2022Updated 3 years ago
- [NAACL 2024] A Framework aims to wisely initialize unseen subword embeddings in PLMs for efficient large-scale continued pretraining☆18Nov 26, 2023Updated 2 years ago
- Using deep research workflow to generate datasets for finetuning LLMs.☆39Oct 9, 2025Updated 7 months ago
- Implementation of the GLOM model for text☆11Mar 4, 2021Updated 5 years ago
- ☆15Oct 24, 2023Updated 2 years ago
- The rag pipeline for optimizing dynamic data editing.☆21Oct 30, 2025Updated 6 months ago
- A library to easily synchronize sessions between themselves or on local drive for later reuse.☆25Jun 12, 2023Updated 2 years ago
- A parent repo that ties together the other three happy repos for development.☆30Dec 3, 2025Updated 5 months ago
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆32Aug 5, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Benchmark of common hash functions☆10Sep 15, 2019Updated 6 years ago
- Exploring dimension-reduced embeddings☆111Jul 6, 2022Updated 3 years ago
- ☆10Feb 2, 2023Updated 3 years ago
- generalized principal component analysis (GLM-PCA) implemented in python☆63Feb 1, 2021Updated 5 years ago
- code for training and using chess embeddings models☆13Jun 9, 2024Updated last year
- ☆16Jun 4, 2025Updated 11 months ago
- A text analysis library for relevance and subtheme detection☆16Mar 20, 2026Updated last month