rwightman / genalogLinks
Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and text alignment capabilities.
ā44Updated last year
Alternatives and similar repositories for genalog
Users that are interested in genalog are comparing it to the libraries listed below
Sorting:
- š¤ Trade any tensors over the networkā30Updated 2 years ago
- A library for squeakily cleaning and filtering language datasets.ā47Updated 2 years ago
- Code for NeurIPS LLM Efficiency Challengeā59Updated last year
- minimal pytorch implementation of bm25 (with sparse tensors)ā104Updated last year
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning Pā¦ā34Updated 2 years ago
- Seemless interface of using PyTOrch distributed with Jupyter notebooksā50Updated 2 weeks ago
- NLP with Rust for Python š¦šā65Updated 4 months ago
- ā49Updated 7 months ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.ā65Updated last week
- QLoRA with Enhanced Multi GPU Supportā37Updated 2 years ago
- Index of URLs to pdf files all over the internet and scriptsā24Updated 2 years ago
- ā22Updated 2 years ago
- experiments with inference on llamaā104Updated last year
- Various handy scripts to quickly setup new Linux and Windows sandboxes, containers and WSL.ā40Updated last week
- QLoRA for Masked Language Modelingā22Updated 2 years ago
- ā48Updated last year
- ML/DL Math and Method notesā63Updated last year
- ā71Updated 3 months ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.ā33Updated 2 weeks ago
- A place to store reusable transformer components of my own creation or found on the interwebsā60Updated last week
- Simple GRPO scripts and configurations.ā59Updated 7 months ago
- Repository containing the SPIN experiments on the DIBT 10k ranked promptsā24Updated last year
- šš¤ A collection of templates for Hugging Face Spacesā35Updated last year
- ā94Updated 2 years ago
- Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.ā159Updated last year
- Code repository for the paper - "AdANNS: A Framework for Adaptive Semantic Search"ā65Updated last year
- Using short models to classify long textsā21Updated 2 years ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.ā95Updated 2 years ago
- ā23Updated 2 years ago
- ā31Updated 10 months ago