rwightman / genalogLinks
Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and text alignment capabilities.
ā44Updated 2 years ago
Alternatives and similar repositories for genalog
Users that are interested in genalog are comparing it to the libraries listed below
Sorting:
- š¤ Trade any tensors over the networkā31Updated 2 years ago
- Code for NeurIPS LLM Efficiency Challengeā60Updated last year
- A library for squeakily cleaning and filtering language datasets.ā49Updated 2 years ago
- minimal pytorch implementation of bm25 (with sparse tensors)ā104Updated 3 months ago
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning Pā¦ā35Updated 2 years ago
- QLoRA with Enhanced Multi GPU Supportā37Updated 2 years ago
- QLoRA for Masked Language Modelingā22Updated 2 years ago
- ā22Updated 2 years ago
- ā53Updated last year
- ā47Updated 2 years ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.ā32Updated 4 months ago
- experiments with inference on llamaā103Updated last year
- Simple GRPO scripts and configurations.ā59Updated last year
- Trully flash implementation of DeBERTa disentangled attention mechanism.ā76Updated 2 weeks ago
- NLP with Rust for Python š¦šā71Updated 8 months ago
- PyTorch implementation for MRLā21Updated last year
- ā23Updated 2 years ago
- ML/DL Math and Method notesā66Updated 2 years ago
- Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.ā160Updated last year
- šš¤ A collection of templates for Hugging Face Spacesā35Updated 2 years ago
- Repository containing the SPIN experiments on the DIBT 10k ranked promptsā23Updated last year
- A sample pattern for running CI tests on Modalā19Updated 9 months ago
- Pre-train Static Word Embeddingsā94Updated 5 months ago
- ā31Updated last year
- Embedding Recycling for Language modelsā38Updated 2 years ago
- Code repository for the paper - "AdANNS: A Framework for Adaptive Semantic Search"ā66Updated 2 years ago
- A place to store reusable transformer components of my own creation or found on the interwebsā72Updated this week
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorerā46Updated last year
- š¤ Disaggregators: Curated data labelers for in-depth analysis.ā67Updated 3 years ago
- Various handy scripts to quickly setup new Linux and Windows sandboxes, containers and WSL.ā40Updated 2 weeks ago