Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and text alignment capabilities.
☆44Jan 18, 2024Updated 2 years ago
Alternatives and similar repositories for genalog
Users that are interested in genalog are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation [ACL 2023]☆14Jul 11, 2023Updated 2 years ago
- ☆23Jan 27, 2025Updated last year
- Various handy scripts to quickly setup new Linux and Windows sandboxes, containers and WSL.☆40Mar 15, 2026Updated last week
- sketching algorithms implemented in chapel and python☆10Jun 8, 2017Updated 8 years ago
- Developing adversarial examples and showing their semantic generalization for the OpenAI CLIP model (https://github.com/openai/CLIP)☆26Mar 6, 2021Updated 5 years ago
- Let Me Speak Freely? A Study on the Impact of Format Restrictions on Performance of Large Language Models☆26May 31, 2025Updated 9 months ago
- Data extraction with LLM on CPU☆112Jan 8, 2024Updated 2 years ago
- ☆22Jul 9, 2020Updated 5 years ago
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆104Jan 15, 2024Updated 2 years ago
- [SIGIR 2022] The official repo for the paper "Curriculum Learning for Dense Retrieval Distillation".☆23Apr 29, 2022Updated 3 years ago
- CascadER: Cross-Modal Cascading for Knowledge Graph Link Prediction (arXiv 22)☆13Jun 17, 2022Updated 3 years ago
- execute shell commands in the Unity Editor☆11May 12, 2025Updated 10 months ago
- ☆27Feb 20, 2024Updated 2 years ago
- Multi-Modal Language Modeling with Image, Audio and Text Integration, included multi-images and multi-audio in a single multiturn.☆18Feb 20, 2024Updated 2 years ago
- A dashboard for exploring timm learning rate schedulers☆20Nov 22, 2024Updated last year
- Neural Solr = Solr 9 + Mighty Inference + Node☆18Jun 9, 2022Updated 3 years ago
- Re-implementation of local descriptor HardNet training in fasta2+kornia☆21Apr 6, 2020Updated 5 years ago
- ☆31Dec 13, 2023Updated 2 years ago
- ☆29Feb 2, 2024Updated 2 years ago
- ☆37Jan 26, 2026Updated last month
- Create a QnA bot on a pdf☆16May 27, 2023Updated 2 years ago
- ☆22Aug 31, 2021Updated 4 years ago
- DiT (training + flow matching) in Jax☆11Jan 5, 2025Updated last year
- ☆39Oct 11, 2023Updated 2 years ago
- A plugin based on scikit-image for segmenting nuclei and cells based on fluorescent microscopy images with high intensity in nuclei and/o…☆30May 1, 2025Updated 10 months ago
- Experiments to assess SPADE on different LLM pipelines.☆17Apr 7, 2024Updated last year
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆224Dec 16, 2025Updated 3 months ago
- My NER Experiments with ModernBERT and Ettin☆26Jul 17, 2025Updated 8 months ago
- ☆14Mar 31, 2025Updated 11 months ago
- Sound Separation, Omni modal☆28Sep 15, 2025Updated 6 months ago
- Probabilistic LLM evaluations. [CogSci2023; ACL2023]☆73Jul 27, 2024Updated last year
- GloSAT Historical Measurement Table Dataset☆11Dec 3, 2025Updated 3 months ago
- Stream smartphone sensor data with FastAPI, Kafka, ksqlDB, and Docker.☆11Aug 3, 2023Updated 2 years ago
- A smattering of header files dumped using classdump-dyld☆13Apr 28, 2021Updated 4 years ago
- Extract full next-token probabilities via language model APIs☆248Feb 23, 2024Updated 2 years ago
- ☆50Mar 14, 2024Updated 2 years ago
- Experiments for efforts to train a new and improved t5☆76Apr 15, 2024Updated last year
- A cloud native data mesh implementation☆12Jan 15, 2021Updated 5 years ago
- 🤖 A PyTorch library of curated Transformer models and their composable components☆894Apr 17, 2024Updated last year