Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data
☆23Jul 30, 2024Updated last year
Alternatives and similar repositories for pixparse
Users that are interested in pixparse are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Notebooks to demonstrate TimmWrapper☆16Jan 16, 2025Updated last year
- Github action to connect to tailscale☆20Mar 10, 2026Updated last month
- A dashboard for exploring timm learning rate schedulers☆20Nov 22, 2024Updated last year
- ☆22Apr 2, 2026Updated last week
- ☆35Jul 8, 2025Updated 9 months ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Code for paper https://arxiv.org/abs/2501.00522☆14Apr 28, 2025Updated 11 months ago
- A huge dataset for Document Visual Question Answering☆21Jul 29, 2024Updated last year
- ☆20Aug 1, 2024Updated last year
- [ICML 2025] EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM☆72Jul 16, 2025Updated 8 months ago
- Official codebase for Margin-aware Preference Optimization for Aligning Diffusion Models without Reference (MaPO).☆82Jun 11, 2024Updated last year
- The repo for: TriHuman: A Real-time and Controllable Tri-plane Representation for Detailed Human Geometry and Appearance Synthesis☆19Nov 15, 2025Updated 4 months ago
- General multi-task deep RL Agent☆186Apr 3, 2026Updated last week
- A collection of LLM token samplers in Rust☆18Nov 9, 2023Updated 2 years ago
- ☆25Dec 13, 2024Updated last year
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- [IEEE/CVF CVPR'2022] "ST-MFNet: A Spatio-Temporal Multi-Flow Network for Frame Interpolation", Duolikun Danier, Fan Zhang, David Bull☆13Oct 9, 2023Updated 2 years ago
- Application for searching images from natural language queries☆46Dec 10, 2021Updated 4 years ago
- Modified Score-Entropy-Discrete-Diffusion to do a character level ml model and integrate with Oxen☆20Apr 26, 2024Updated last year
- An open source implementation of CLIP.☆33Nov 7, 2022Updated 3 years ago
- [ACL-IJCNLP 2021] "EarlyBERT: Efficient BERT Training via Early-bird Lottery Tickets" by Xiaohan Chen, Yu Cheng, Shuohang Wang, Zhe Gan, …☆18Dec 30, 2021Updated 4 years ago
- Accelerated inference of 🤗 models using FuriosaAI NPU chips.☆27Apr 3, 2026Updated last week
- Google TPU optimizations for transformers models☆136Jan 23, 2026Updated 2 months ago
- ☆21Dec 5, 2022Updated 3 years ago
- A simple tool to guess an HuggingFace repo URL from a state dict.☆48Oct 29, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆16Aug 7, 2024Updated last year
- Generating Captions via Perceiver-Resampler Cross-Attention Networks☆17Dec 20, 2022Updated 3 years ago
- [WACV 2025] MegaFusion: Extend Diffusion Models towards Higher-resolution Image Generation without Further Tuning☆102Apr 17, 2025Updated 11 months ago
- This repo consists of code for plotting top loss images☆13May 18, 2020Updated 5 years ago
- Implementation of Stochastic Depth Networks in Keras☆13Sep 10, 2016Updated 9 years ago
- Animatediff implementation. Includes a ControlNet pipeline.☆19Dec 24, 2023Updated 2 years ago
- Load & manage evolving datasets efficiently☆23Aug 22, 2025Updated 7 months ago
- A one-stop library to standardize the inference and evaluation of all the conditional image generation models. [ICLR 2024]☆179Dec 2, 2025Updated 4 months ago
- Official implementation for "Nested Attention: Semantic-aware Attention Values for Concept Personalization" [SIGGRAPH 2025]☆27Aug 4, 2025Updated 8 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.☆93Apr 2, 2026Updated last week
- RISE-SDF: a Relightable Information-Shared Signed Distance Field for Glossy Object Inverse Rendering☆24Nov 13, 2024Updated last year
- ☆16Aug 22, 2021Updated 4 years ago
- CVPR 24 paper: Dysen-VDM: Empowering Dynamics-aware Text-to-Video Diffusion with LLMs☆14Mar 19, 2024Updated 2 years ago
- This is the project for IRM methods☆12Sep 13, 2021Updated 4 years ago
- Run CellProfiler on Terra. Contains workflows that enable a full end-to-end Cell Painting pipeline.☆11May 22, 2024Updated last year
- MLOps pipeline for NVIDIA Merlin on GKE☆41Jun 10, 2021Updated 4 years ago