Pre-train Static Word Embeddings
☆106Jun 9, 2026Updated 2 weeks ago
Alternatives and similar repositories for tokenlearn
Users that are interested in tokenlearn are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Lightweight Nearest Neighbors with Flexible Backends☆345May 24, 2026Updated last month
- Fast Multimodal Semantic Deduplication & Filtering☆937May 24, 2026Updated last month
- Fast State-of-the-Art Static Embeddings☆2,132Jun 6, 2026Updated 3 weeks ago
- 🔢 Work with static vector models☆39Apr 21, 2025Updated last year
- YASEM - Yet Another Splade|Sparse Embedder - A simple and efficient library for SPLADE embeddings☆13May 22, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Trainable embedding transformation for confidence estimation, feature extraction, explainability and conversion from dense to sparse.☆28Updated this week
- Generalist and Lightweight Model for Text Classification☆226Jun 15, 2026Updated 2 weeks ago
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆24Jun 30, 2025Updated 11 months ago
- FlexiTokens☆23Dec 27, 2025Updated 6 months ago
- Nearly Inference Free Embeddings: make your RAG queries 500x faster☆78Apr 27, 2026Updated 2 months ago
- Load embeddings and featurize your sentences.☆31Oct 23, 2024Updated last year
- Fine-tune ModernBERT with custom tokenizers, curriculum learning, and next-gen optimizers.☆74Jan 16, 2026Updated 5 months ago
- A RAG that can scale 🧑🏻💻☆11May 28, 2024Updated 2 years ago
- Plug-and-play document AI with zero-shot models.☆126May 11, 2026Updated last month
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Word Sense Linking model is designed to identify and disambiguate spans of text to their most suitable senses from a reference inventory.☆13Aug 23, 2024Updated last year
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆48Jul 25, 2023Updated 2 years ago
- ☆57Dec 27, 2025Updated 6 months ago
- Datamodels for hugging face tokenizers☆107Jun 18, 2026Updated last week
- Model implementation for the contextual embeddings project☆47Jun 2, 2025Updated last year
- ☆74May 14, 2026Updated last month
- This repository contains the code for the EMNLP'23 paper "AdaSent: Efficient Domain-Adapted Sentence Embeddings for Few-Shot Classificati…☆16Jun 3, 2024Updated 2 years ago
- Sparse Embedding Compression for Scalable Retrieval in Recommender Systems☆36Nov 21, 2025Updated 7 months ago
- Late Interaction Models Training & Retrieval☆859Updated this week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Simple customizable evaluation for text retrieval performance of Sentence Transformers embedders on PDFs☆30Jan 20, 2025Updated last year
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da☆122Mar 31, 2025Updated last year
- Partial code for "Skill Extraction from Job Postings using Weak Supervision" at RecSysHR 2022.☆13May 19, 2023Updated 3 years ago
- Lightweight Non-Parametric Embedding Fine-Tuning☆42Sep 13, 2025Updated 9 months ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆89Feb 10, 2026Updated 4 months ago
- 🚂 Fine-tune OpenAI models for text classification, question answering, and more☆17May 1, 2023Updated 3 years ago
- Multilingual RAG benchmark.☆11Nov 22, 2024Updated last year
- Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts)☆3,314Jun 16, 2026Updated last week
- Visualization and sparse autoencoder training for mechanistic interpretability on audio models☆23Apr 6, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code for the MTEB leaderboard☆31Feb 4, 2025Updated last year
- BERT score for text generation☆12Jan 15, 2025Updated last year
- Google 공식 Rouge Implementation을 한국어에서 사용할 수 있도록 처리☆17Jan 3, 2024Updated 2 years ago
- Training code for Sparse Autoencoders on Embedding models☆39Jun 16, 2026Updated last week
- Simply, faster, sentence-transformers☆144Aug 27, 2024Updated last year
- ModernBERT model optimized for Apple Neural Engine.☆32Jan 10, 2025Updated last year
- A Wikipedia-based summarization dataset☆14Mar 27, 2023Updated 3 years ago