jacobmarks / emoji_search
Semantically Search Emojis From the Command Line!
β12Updated 11 months ago
Related projects β
Alternatives and complementary repositories for emoji_search
- π€ HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)β17Updated 7 months ago
- Using short models to classify long textsβ20Updated last year
- Efficient few-shot learning with cross-encoders.β40Updated 8 months ago
- Binary vector search example using Unum's USearch engine and pre-computed Wikipedia embeddings from Co:here and MixedBreadβ19Updated 7 months ago
- NLP with Rust for Python π¦πβ59Updated 5 months ago
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created byβ¦β27Updated 2 months ago
- Benchmark study on LanceDB, an embedded vector DB, for full-text search and vector searchβ21Updated 11 months ago
- Build Agentic workflows with function callingβ20Updated last week
- Tool to take your ML model from local to production with one-line of code.β23Updated 9 months ago
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.β101Updated 5 months ago
- A framework for simulating e-commerce data and interactions that can be used to build recommendation systemsβ10Updated 11 months ago
- Chrome Extension for exploring Hugging Face datasets πβ47Updated last month
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing β‘β59Updated last week
- Convert datasets from Hugging Face to FiftyOne for Visualizationβ10Updated 7 months ago
- Showcase how mxbai-embed-large-v1 can be used to produce binary embedding. Binary embeddings enabled 32x storage savings and 40x faster rβ¦β16Updated 7 months ago
- A RAG that can scale π§π»βπ»β11Updated 5 months ago
- β10Updated last month
- Scripts to convert datasets from various sources to Hugging Face Datasets.β57Updated 2 years ago
- β17Updated last year
- β49Updated 2 months ago
- Python API for https://vespa.ai, the open big data serving engineβ101Updated this week
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.β72Updated last year
- Generalised Contrastive Learning. This is a Repository for Google Shopping Dataset and Benchmarks followed by our novel fine-grained contβ¦β47Updated last week
- Source code and data for Like a Good Nearest Neighborβ28Updated 9 months ago
- β64Updated this week
- Check for data drift between two OpenAI multi-turn chat jsonl files.β36Updated 7 months ago
- Ready-to-go containerized RAG service. Implemented with text-embedding-inference + Qdrant/LanceDB.β48Updated 8 months ago
- Fine-tune Mistral 7B to generate fashion style suggestionsβ31Updated 10 months ago
- π€ Disaggregators: Curated data labelers for in-depth analysis.β65Updated last year
- An easy way to chunk spaCy docs.β16Updated 2 months ago