Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and text alignment capabilities.
☆44May 23, 2026Updated this week
Alternatives and similar repositories for genalog
Users that are interested in genalog are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation [ACL 2023]☆14Jul 11, 2023Updated 2 years ago
- ☆23Jan 27, 2025Updated last year
- sketching algorithms implemented in chapel and python☆10Jun 8, 2017Updated 8 years ago
- Make triton easier☆50Jun 12, 2024Updated last year
- Let Me Speak Freely? A Study on the Impact of Format Restrictions on Performance of Large Language Models☆26May 31, 2025Updated 11 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Data extraction with LLM on CPU☆113Jan 8, 2024Updated 2 years ago
- ☆22Jul 9, 2020Updated 5 years ago
- [SIGIR 2022] The official repo for the paper "Curriculum Learning for Dense Retrieval Distillation".☆23Apr 29, 2022Updated 4 years ago
- CascadER: Cross-Modal Cascading for Knowledge Graph Link Prediction (arXiv 22)☆13Jun 17, 2022Updated 3 years ago
- execute shell commands in the Unity Editor☆11May 12, 2025Updated last year
- Multi-Modal Language Modeling with Image, Audio and Text Integration, included multi-images and multi-audio in a single multiturn.☆18Feb 20, 2024Updated 2 years ago
- A dashboard for exploring timm learning rate schedulers☆20Nov 22, 2024Updated last year
- Neural Solr = Solr 9 + Mighty Inference + Node☆18Jun 9, 2022Updated 3 years ago
- Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"☆59Jan 12, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆13Jul 7, 2025Updated 10 months ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Aug 12, 2023Updated 2 years ago
- Re-implementation of local descriptor HardNet training in fasta2+kornia☆21Apr 6, 2020Updated 6 years ago
- A standard bare-bone ROS Gazebo simulator for the Franka Emika Panda robot built using inbuilt Gazebo ROS controllers and RobotHW interfa…☆11May 3, 2021Updated 5 years ago
- ☆37Jan 26, 2026Updated 3 months ago
- ☆22Aug 31, 2021Updated 4 years ago
- DiT (training + flow matching) in Jax☆11Jan 5, 2025Updated last year
- ☆39Oct 11, 2023Updated 2 years ago
- ☆15Updated this week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Experiments to assess SPADE on different LLM pipelines.☆17Apr 7, 2024Updated 2 years ago
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆225Dec 16, 2025Updated 5 months ago
- My NER Experiments with ModernBERT and Ettin☆27Jul 17, 2025Updated 10 months ago
- ☆14Mar 31, 2025Updated last year
- Sound Separation, Omni modal☆28Sep 15, 2025Updated 8 months ago
- Probabilistic LLM evaluations. [CogSci2023; ACL2023]☆73Jul 27, 2024Updated last year
- GloSAT Historical Measurement Table Dataset☆11Dec 3, 2025Updated 5 months ago
- ImaginaryNet: Learning Object Detectors without Real Images and Annotations☆26Mar 11, 2023Updated 3 years ago
- Stream smartphone sensor data with FastAPI, Kafka, ksqlDB, and Docker.☆11Aug 3, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Automated machine learning for text classification☆54Updated this week
- Extract full next-token probabilities via language model APIs☆247Feb 23, 2024Updated 2 years ago
- ☆50Mar 14, 2024Updated 2 years ago
- A spatial terminal multiplexer for macOS. Terminals live on an infinite canvas that you can pan, zoom, and arrange freely.☆42Apr 2, 2026Updated last month
- Experiments for efforts to train a new and improved t5☆76Apr 15, 2024Updated 2 years ago
- Add Arabic diacritics (tashkeel/harakat) using Rust/Python/C++/WASM and NLP models☆49Oct 4, 2025Updated 7 months ago
- ☆39Nov 27, 2025Updated 5 months ago