mixedbread-ai / ofen
WIP: Ofen is a toolkit aimed at making transformer models production-ready. API included
☆13Updated last month
Related projects ⓘ
Alternatives and complementary repositories for ofen
- ☆11Updated 4 months ago
- ☆106Updated 3 weeks ago
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, imp…☆158Updated 2 months ago
- Showcase how mxbai-embed-large-v1 can be used to produce binary embedding. Binary embeddings enabled 32x storage savings and 40x faster r…☆16Updated 7 months ago
- Late Interaction Models Training & Retrieval☆161Updated 2 weeks ago
- Triton backend for https://github.com/OpenNMT/CTranslate2☆32Updated last year
- mixedbread ai python sdk☆10Updated 4 months ago
- Joint speech-language model - respond directly to audio!☆30Updated 6 months ago
- ☆92Updated last month
- utilities for loading and running text embeddings with onnx☆39Updated 3 months ago
- ☆23Updated 4 months ago
- Generalist and Lightweight Model for Text Classification☆48Updated 2 months ago
- Tokun to can tokens☆15Updated last month
- Tune MPTs☆84Updated last year
- experiments with inference on llama☆105Updated 5 months ago
- Manage scalable open LLM inference endpoints in Slurm clusters☆236Updated 4 months ago
- A pipeline for LLM knowledge distillation☆77Updated 3 months ago
- ☆64Updated this week
- Synthetic Data for LLM Fine-Tuning☆93Updated 11 months ago
- ☆21Updated 5 months ago
- Google TPU optimizations for transformers models☆74Updated last week
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆64Updated 3 weeks ago
- Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs☆160Updated last week
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async API☆45Updated last month
- Efficient few-shot learning with cross-encoders.☆40Updated 8 months ago
- Structured generation in Rust☆116Updated this week
- minimal pytorch implementation of bm25 (with sparse tensors)☆88Updated 8 months ago
- Full finetuning of large language models without large memory requirements☆93Updated 10 months ago
- A framework for evaluating function calls made by LLMs☆34Updated 3 months ago