huggingface / ember
ANE accelerated embedding models!
☆16Updated 4 months ago
Alternatives and similar repositories for ember:
Users that are interested in ember are comparing it to the libraries listed below
- ☆19Updated this week
- Rust crate for some audio utilities☆23Updated last month
- implementation of https://arxiv.org/pdf/2312.09299☆20Updated 10 months ago
- ☆26Updated 4 months ago
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆21Updated 9 months ago
- Rust bindings for CTranslate2☆14Updated last year
- NLP with Rust for Python 🦀🐍☆62Updated 11 months ago
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆45Updated last year
- Profile your CoreML models directly from Python 🐍☆27Updated 6 months ago
- Proof of concept for running moshi/hibiki using webrtc☆18Updated 2 months ago
- utilities for loading and running text embeddings with onnx☆44Updated 8 months ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated last year
- Visualize multi-model embedding spaces. The first goal is to quickly get a lay of the land of any embedding space. Then be able to scroll…☆27Updated 11 months ago
- GPU accelerated client-side embeddings for vector search, RAG etc.☆66Updated last year
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.☆70Updated this week
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Updated 11 months ago
- LLama implementations benchmarking framework☆12Updated last year
- Using modal.com to process FineWeb-edu data☆20Updated last month
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated last year
- [WIP] Transformer to embed Danbooru labelsets☆13Updated last year
- 🚀🤗 A collection of templates for Hugging Face Spaces☆35Updated last year
- Training hybrid models for dummies.☆20Updated 3 months ago
- This repository contains code for cleaning your training data of benchmark data to help combat data snooping.☆25Updated 2 years ago
- Binary vector search example using Unum's USearch engine and pre-computed Wikipedia embeddings from Co:here and MixedBread☆18Updated last year
- Jupyter Notebooks and an R Notebook for encoding Pokémon embeddings and creating data visualizations.☆19Updated 10 months ago
- ☆17Updated last week
- Fast and versatile tokenizer for language models, compatible with SentencePiece, Tokenizers, Tiktoken and more. Supports BPE, Unigram and…☆22Updated last month
- python bindings for symphonia/opus - read various audio formats from python and write opus files☆58Updated last week
- Analysis on the cost of encoder based models☆11Updated 2 months ago
- ☆17Updated last month