strickvl / tinbox
Translator in a box
☆11Updated 2 months ago
Alternatives and similar repositories for tinbox:
Users that are interested in tinbox are comparing it to the libraries listed below
- Tools to make language models a bit easier to use☆42Updated this week
- ☆41Updated 2 months ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆45Updated last week
- ☆32Updated this week
- Experimental tl;dr summaries for datasets on the Hugging Face Hub!☆10Updated last year
- ☆9Updated 6 months ago
- Analysis on the cost of encoder based models☆11Updated 2 months ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆34Updated 4 months ago
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆47Updated 7 months ago
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progr…☆29Updated last week
- An easy way to chunk spaCy docs.☆19Updated 8 months ago
- ☆24Updated last year
- Small python package to measure OCR quality and other related metrics.☆21Updated last year
- Knowledge Graph Generator app☆30Updated last year
- Named Entity Recognition with an decoder-only (autoregressive) LLM using HuggingFace☆41Updated 5 months ago
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and te…☆42Updated last year
- 🤗 HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)☆17Updated last year
- An introduction to LLM Sampling☆77Updated 4 months ago
- PyTorch implementation for MRL☆18Updated last year
- ☆33Updated 2 weeks ago
- ☆77Updated 10 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆66Updated 5 months ago
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.☆59Updated 11 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆61Updated last year
- QLoRA for Masked Language Modeling☆22Updated last year
- BPE modification that implements removing of the intermediate tokens during tokenizer training.☆25Updated 4 months ago
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆30Updated 7 months ago
- Using short models to classify long texts☆21Updated 2 years ago
- Pre-train Static Word Embeddings☆55Updated last week
- An attribution library for LLMs☆38Updated 7 months ago