strickvl / tinboxLinks
Translator in a box
☆28Updated last week
Alternatives and similar repositories for tinbox
Users that are interested in tinbox are comparing it to the libraries listed below
Sorting:
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆47Updated last year
- Datamodels for hugging face tokenizers☆87Updated last week
- ☆53Updated last year
- ☆80Updated last year
- Tools to make language models a bit easier to use☆64Updated this week
- Python library to use Pleias-RAG models☆68Updated 9 months ago
- ☆21Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆32Updated 4 months ago
- ☆90Updated 7 months ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆76Updated 2 weeks ago
- Simple UI for debugging correlations of text embeddings☆305Updated 8 months ago
- An introduction to LLM Sampling☆79Updated last year
- Small python package to measure OCR quality and other related metrics.☆26Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆69Updated 2 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆72Updated last year
- 🤗 HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)☆17Updated last year
- PyLate efficient inference engine☆71Updated last month
- ☆162Updated last year
- Using short models to classify long texts☆21Updated 2 years ago
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆155Updated 6 months ago
- ☆67Updated last year
- PyTorch implementation for MRL☆21Updated last year
- Fine-tune ModernBERT with custom tokenizers, curriculum learning, and next-gen optimizers.☆74Updated 3 weeks ago
- ☆79Updated last year
- Optimus is a flexible and scalable framework built to train language models efficiently across diverse hardware configurations, including…☆68Updated 2 months ago
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines☆196Updated last year
- Generalist and Lightweight Model for Text Classification☆169Updated 2 weeks ago
- An easy way to chunk spaCy docs.☆22Updated last year
- Check for data drift between two OpenAI multi-turn chat jsonl files.☆39Updated last year
- ☆31Updated last year