google-research / retvecLinks
RETVec is an efficient, multilingual, and adversarially-robust text vectorizer.
☆292Updated 5 months ago
Alternatives and similar repositories for retvec
Users that are interested in retvec are comparing it to the libraries listed below
Sorting:
- UniSim is a package for efficient similarity computation, fuzzy matching, and clustering of data.☆142Updated 5 months ago
- The Foundation Model Transparency Index☆83Updated last year
- Source code for Mozilla.ai's Lumigator platform☆256Updated last week
- BlindBox is a tool to isolate and deploy applications inside Trusted Execution Environments for privacy-by-design apps☆61Updated last year
- ☆704Updated last month
- ☆18Updated last year
- Your buddy in the (L)LM space.☆64Updated last year
- Lightweight Nearest Neighbors with Flexible Backends☆306Updated 2 months ago
- GGUF implementation in C as a library and a tools CLI program☆291Updated 3 weeks ago
- GPT Takes the Bar Exam☆142Updated 2 years ago
- ☆116Updated 7 months ago
- The public specifications for the C2PA☆153Updated 3 months ago
- Official implementation of "WhisperNER: Unified Open Named Entity and Speech Recognition"☆197Updated 7 months ago
- Extend the original llama.cpp repo to support redpajama model.☆118Updated last year
- The world's largest social media toxicity dataset.☆187Updated 3 years ago
- Managing the lifecycle of machine learning to support scalability, impact, collaboration, compliance and sharing.☆87Updated this week
- Massive Multimodal Open RAG & Extraction A scalable multimodal pipeline for processing, indexing, and querying multimodal documents Eve…☆131Updated last week
- The Natural Portuguese Language Benchmark (Napolab). Stay up to date with the latest advancements in Portuguese language models and their…☆70Updated last month
- MemoryCache is an experimental development project to turn a local desktop environment into an on-device AI agent☆562Updated last year
- Backend that powers the dataset viewer on Hugging Face dataset pages through a public API.☆803Updated last week
- Meta’s Anonymous Credential Service (ACS) is designed to enable it to authenticate users in a “de-identified manner,” permitting access t…☆74Updated last year
- Efficient vector database for hundred millions of embeddings.☆207Updated last year
- ☆174Updated 2 years ago
- Lightning Fast: Faiss CPU + Onnx Quantized Multilingual Embedding Model☆23Updated last year
- Masked Python SDK wrapper for OpenAI API. Use public LLM APIs securely.☆119Updated 2 years ago
- ☆337Updated last year
- Converts JSON-Schema to GBNF grammar to use with llama.cpp☆55Updated last year
- Statistics of Common Crawl monthly archives mined from URL index files☆192Updated this week
- Alice in Wonderland code base for experiments and raw experiments data☆131Updated this week
- Definition for Open Weights LIcensing☆142Updated 11 months ago