google-research / retvecLinks
RETVec is an efficient, multilingual, and adversarially-robust text vectorizer.
☆293Updated 8 months ago
Alternatives and similar repositories for retvec
Users that are interested in retvec are comparing it to the libraries listed below
Sorting:
- UniSim is a package for efficient similarity computation, fuzzy matching, and clustering of data.☆144Updated 8 months ago
- ☆339Updated last year
- Mediapipe-based library to redact faces from videos and images☆442Updated 2 years ago
- Statistics of Common Crawl monthly archives mined from URL index files☆205Updated last week
- Your buddy in the (L)LM space.☆64Updated last year
- The Foundation Model Transparency Index☆84Updated 3 weeks ago
- The Natural Portuguese Language Benchmark (Napolab). Stay up to date with the latest advancements in Portuguese language models and their…☆71Updated 5 months ago
- A fully user-side image search engine. Accepted to CIKM 2022 demo track.☆250Updated 3 years ago
- Managing the lifecycle of machine learning to support scalability, impact, collaboration, compliance and sharing.☆91Updated this week
- Common crawl extractor☆84Updated last year
- Lightweight Nearest Neighbors with Flexible Backends☆324Updated 2 months ago
- ☆115Updated 10 months ago
- BlindBox is a tool to isolate and deploy applications inside Trusted Execution Environments for privacy-by-design apps☆63Updated 2 years ago
- The world's largest social media toxicity dataset.☆187Updated 3 years ago
- MemoryCache is an experimental development project to turn a local desktop environment into an on-device AI agent☆562Updated last year
- Creating the tools and data sets necessary to evaluate vulnerabilities in LLMs.☆26Updated 9 months ago
- The Institutional Data Initiative's pipeline for analyzing, refining, and publishing the Institutional Books 1.0 collection.☆47Updated last month
- Meta’s Anonymous Credential Service (ACS) is designed to enable it to authenticate users in a “de-identified manner,” permitting access t…☆76Updated last year
- Tech Report of the Apertus LLM Suite☆127Updated 3 months ago
- Definition for Open Weights LIcensing☆145Updated last year
- Generative AutoML for Tabular Data☆446Updated 10 months ago
- Classify data instantly using an LLM☆279Updated last year
- ☆719Updated 4 months ago
- LLM for Email Spam Detection☆117Updated 2 years ago
- Fast Text Classification with Compressors dictionary☆150Updated 2 years ago
- Completion After Prompt Probability. Make your LLM make a choice☆82Updated last year
- ☆184Updated 2 years ago
- Full text search that feels like a numpy array☆293Updated 3 weeks ago
- ☆48Updated last year
- Efficient vector database for hundred millions of embeddings.☆211Updated last year