google-research / retvec
RETVec is an efficient, multilingual, and adversarially-robust text vectorizer.
☆294Updated this week
Alternatives and similar repositories for retvec:
Users that are interested in retvec are comparing it to the libraries listed below
- UniSim is a package for efficient similarity computation, fuzzy matching, and clustering of data.☆134Updated 3 months ago
- BlindBox is a tool to isolate and deploy applications inside Trusted Execution Environments for privacy-by-design apps☆57Updated last year
- Lightweight Nearest Neighbors with Flexible Backends☆264Updated last month
- 🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy☆309Updated last year
- A Natural Portuguese Language Benchmark (Napolab) for the evaluation of language models.☆67Updated last month
- Natively pre-trained open-source Portuguese language models.☆58Updated last month
- Mediapipe-based library to redact faces from videos and images☆440Updated last year
- Bridging the Gap Between Semantic and Interaction Similarity in Recommender Systems☆98Updated this week
- Zero-trust AI APIs for easy and private consumption of open-source LLMs☆38Updated 8 months ago
- Train a model, and detect gibberish strings with it.☆61Updated 3 years ago
- Generalist and Lightweight Model for Text Classification☆110Updated this week
- TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models access…☆114Updated last year
- Labelling platform for text using weak supervision.☆260Updated 2 years ago
- ☆16Updated last year
- Creating the tools and data sets necessary to evaluate vulnerabilities in LLMs.☆23Updated 2 weeks ago
- The Foundation Model Transparency Index☆77Updated 10 months ago
- ♠️TrucoBench: Qual é o melhor LLM no truco? Resultados, análises e insights estratégicos.☆19Updated last month
- Multi-threaded matrix multiplication and cosine similarity calculations for dense and sparse matrices. Appropriate for calculating the K …☆80Updated 3 months ago
- Prompt Exploration☆55Updated this week
- Neural Search☆352Updated 3 weeks ago
- Code and data to evaluate LLMs on the ENEM, the main standardized Brazilian university admission exams.☆45Updated 3 months ago
- Lightning Fast: Faiss CPU + Onnx Quantized Multilingual Embedding Model☆23Updated 6 months ago
- Common crawl extractor☆75Updated 10 months ago
- The world's largest social media toxicity dataset.☆178Updated 2 years ago
- ☆54Updated 11 months ago
- Python API for https://vespa.ai, the open big data serving engine☆117Updated this week
- ☆576Updated 2 weeks ago
- Your buddy in the (L)LM space.☆63Updated 6 months ago
- Tranco: An improved top websites ranking☆146Updated 5 years ago
- GGUF implementation in C as a library and a tools CLI program☆263Updated 2 months ago