mixedbread-ai / ofen
WIP: Ofen is a toolkit aimed at making transformer models production-ready. API included
☆15Updated 6 months ago
Alternatives and similar repositories for ofen:
Users that are interested in ofen are comparing it to the libraries listed below
- Showcase how mxbai-embed-large-v1 can be used to produce binary embedding. Binary embeddings enabled 32x storage savings and 40x faster r…☆17Updated last year
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆130Updated 4 months ago
- Crispy reranking models by Mixedbread☆22Updated last month
- ANE accelerated embedding models!☆16Updated 4 months ago
- Pre-train Static Word Embeddings☆56Updated 2 weeks ago
- implementation of https://arxiv.org/pdf/2312.09299☆20Updated 9 months ago
- NLP with Rust for Python 🦀🐍☆62Updated 10 months ago
- Training hybrid models for dummies.☆20Updated 3 months ago
- mixedbread ai python sdk☆12Updated 9 months ago
- ☆67Updated 4 months ago
- ☆46Updated last month
- Vector Database with support for late interaction and token level embeddings.☆54Updated 6 months ago
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, imp…☆174Updated 7 months ago
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.☆68Updated last week
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆45Updated last week
- Fast and versatile tokenizer for language models, compatible with SentencePiece, Tokenizers, Tiktoken and more. Supports BPE, Unigram and…☆19Updated last month
- Efficiently computing & storing token n-grams from large corpora☆23Updated 6 months ago
- llama.cpp gguf file parser for javascript☆34Updated 4 months ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated last year
- Truly flash T5 realization!☆64Updated 11 months ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆18Updated 2 years ago
- GGML implementation of BERT model with Python bindings and quantization.☆56Updated last year
- Training code for Sparse Autoencoders on Embedding models☆38Updated last month
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆66Updated 5 months ago
- ☆33Updated 2 weeks ago
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…☆26Updated last year
- Latent Large Language Models☆17Updated 8 months ago
- ☆32Updated this week
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆45Updated last year
- ☆44Updated 3 months ago