rwightman / genalog
Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and text alignment capabilities.
☆42Updated 10 months ago
Related projects ⓘ
Alternatives and complementary repositories for genalog
- A place to store reusable transformer components of my own creation or found on the interwebs☆44Updated 2 weeks ago
- 🤝 Trade any tensors over the network☆30Updated last year
- Index of URLs to pdf files all over the internet and scripts☆21Updated last year
- Code for NeurIPS LLM Efficiency Challenge☆54Updated 7 months ago
- QLoRA for Masked Language Modeling☆20Updated last year
- minimal pytorch implementation of bm25 (with sparse tensors)☆90Updated 8 months ago
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆34Updated last year
- ☆22Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆32Updated 2 months ago
- A library for squeakily cleaning and filtering language datasets.☆45Updated last year
- ☆24Updated last year
- LLM training in simple, raw C/CUDA☆12Updated last month
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆23Updated 8 months ago
- NLP with Rust for Python 🦀🐍☆59Updated 5 months ago
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created by…☆27Updated 2 months ago
- Make triton easier☆41Updated 5 months ago
- This repository contains code for cleaning your training data of benchmark data to help combat data snooping.☆25Updated last year
- QLoRA with Enhanced Multi GPU Support☆36Updated last year
- experiments with inference on llama☆105Updated 5 months ago
- ☆108Updated this week
- Tools to make language models a bit easier to use☆30Updated this week
- ☆43Updated 2 months ago
- Using short models to classify long texts☆20Updated last year
- ☆21Updated last week
- ☆28Updated last year
- [WIP] Transformer to embed Danbooru labelsets☆13Updated 7 months ago
- ☆45Updated 2 months ago
- PyTorch implementation for MRL☆18Updated 8 months ago
- Generalist and Lightweight Model for Text Classification☆49Updated last week
- ML/DL Math and Method notes☆57Updated 11 months ago