JJJYmmm/Pix2SeqV2-Pytorch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/JJJYmmm/Pix2SeqV2-Pytorch)

JJJYmmm / Pix2SeqV2-Pytorch

Simple Implementation of Pix2seqV2(multi-task)

☆26

Alternatives and similar repositories for Pix2SeqV2-Pytorch

Users that are interested in Pix2SeqV2-Pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

gaopengcuhk / Pretrained-Pix2Seq
View on GitHub
Replication of Pix2Seq with Pretrained Model
☆58Nov 6, 2021Updated 4 years ago
princeton-nlp / datamux-pretraining
View on GitHub
MUX-PLMs: Pretraining LMs with Data Multiplexing
☆15Jan 29, 2023Updated 3 years ago
coderonion / awesome-anchor-free-object-detection
View on GitHub
A collection of some awesome public Anchor-Free object detection series projects.
☆23Feb 22, 2024Updated 2 years ago
pkudba / 3DHPA
View on GitHub
☆24Mar 29, 2024Updated 2 years ago
YongWookHa / im2latex
View on GitHub
Image to LaTeX pytorch model
☆14Jul 6, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Sharpiless / Pix2seq-mmdetection
View on GitHub
Unofficial implement of "Pix2seq: A Language Modeling Framework for Object Detection" on mmdetection
☆34Apr 18, 2022Updated 4 years ago
DS4SD / MolDepictor
View on GitHub
[ICCV 23] MolGrapher: Graph-based Visual Recognition of Chemical Structures
☆16Oct 27, 2025Updated 9 months ago
GeWu-Lab / TSPM
View on GitHub
Official repository for "Boosting Audio Visual Question Answering via Key Semantic-Aware Cues" in ACM MM 2024.
☆17Oct 25, 2024Updated last year
gaopengcuhk / Stable-Pix2Seq
View on GitHub
A full-fledged version of Pix2Seq
☆237Nov 6, 2021Updated 4 years ago
GeWu-Lab / LFAV
View on GitHub
Towards Long Form Audio-visual Video Understanding
☆15Jan 16, 2026Updated 6 months ago
StanfordVL / Sonicverse
View on GitHub
☆22Mar 18, 2023Updated 3 years ago
google-research / pix2seq
View on GitHub
Pix2Seq codebase: multi-tasks with generative modeling (autoregressive and diffusion)
☆945Nov 7, 2023Updated 2 years ago
3outeille / GPTQ-for-RWKV
View on GitHub
☆13Jun 3, 2023Updated 3 years ago
ukaukaaaa / GazeGNN
View on GitHub
Official Code for GazeGNN: A Gaze-guided Graph Neural Network for Chest X-ray Classification [WACV 2024]
☆21Aug 25, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
kangben258 / UETrack
View on GitHub
☆38Mar 20, 2026Updated 4 months ago
Amaodemao / BiasPainter
View on GitHub
basically all the things I used for this article
☆24Jan 8, 2025Updated last year
hyperf / http-server
View on GitHub
☆10Jun 7, 2026Updated last month
zhang-xuan1314 / ABC-Net
View on GitHub
ABC-Net for molecular image recognition
☆18Jan 3, 2022Updated 4 years ago
Triang-jyed-driung / RWKV-LM-RLHF-DPO
View on GitHub
Direct Preference Optimization for RWKV, aiming for RWKV-5 and 6.
☆11Mar 1, 2024Updated 2 years ago
Video-MAC / VideoMAC
View on GitHub
Official code for CVPR2024 “VideoMAC: Video Masked Autoencoders Meet ConvNets”
☆16May 12, 2026Updated 2 months ago
WebPAI / MRWeb
View on GitHub
☆34Mar 11, 2025Updated last year
xmed-lab / FoPro-KD
View on GitHub
TMI 2023: FoPro-KD: Fourier Prompted Effective Knowledge Distillation for Long-Tailed Medical Image Recognition
☆12Mar 19, 2024Updated 2 years ago
MLNeurosurg / opensrh
View on GitHub
OpenSRH is the first ever publicly available stimulated Raman histology (SRH) dataset and benchmark, which will facilitate the clinical t…
☆14Oct 13, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
whyb / SUTrack-ONNX
View on GitHub
[AAAI2025] SUTrack: Towards Simple and Unified Single Object Tracking. Converter to onnx model file.
☆16Apr 2, 2026Updated 3 months ago
willxxy / Text-EGM
View on GitHub
[CHIL 2024] Interpretation of Intracardiac Electrograms Through Textual Representations
☆12Sep 4, 2024Updated last year
jhabc1314 / jackdou-chinamap
View on GitHub
laravel 中国地图web Api集合
☆13Apr 27, 2023Updated 3 years ago
Nuisal / cellseg1
View on GitHub
☆18Apr 1, 2025Updated last year
XiaoqianRuan1 / IoU-filter
View on GitHub
☆17Dec 11, 2024Updated last year
Ahn-Ssu / VCM
View on GitHub
for reproducibility of VCM
☆12Mar 11, 2025Updated last year
wangdongdut / UAV-Vision
View on GitHub
☆20Apr 9, 2022Updated 4 years ago
daviddmc / open-fetal-mri
View on GitHub
awesome open source tools for fetal MRI analysis
☆12Apr 30, 2023Updated 3 years ago
ggjy / DeLVM
View on GitHub
☆120Jun 6, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ZhangCYG / U-RED
View on GitHub
☆20Jun 28, 2024Updated 2 years ago
ayesha-ishaq / Open3DTrack
View on GitHub
Code for Open3DTrack: Towards Open-Vocabulary 3D Multi-Object Tracking
☆34Mar 14, 2025Updated last year
NJUDeepEngine / CAEF
View on GitHub
Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"
☆11Oct 11, 2024Updated last year
jtshou / GPEMSR
View on GitHub
Official PyTorch code for Learning Large-Factor EM Image Super-Resolution with Generative Priors (GPEMSR, CVPR2024)
☆15Aug 24, 2025Updated 11 months ago
GeWu-Lab / MWAFM
View on GitHub
Multi-Scale Attention for Audio Question Answering
☆28Jul 19, 2023Updated 3 years ago
Reagan1311 / Mask2IV
View on GitHub
Mask2IV: Interaction-Centric Video Generation via Mask Trajectories (AAAI 2026)
☆17Jun 8, 2026Updated last month
gaozhitong / MultiShiftSeg
View on GitHub
Code for "Generalize or Detect? Towards Robust Semantic Segmentation Under Multiple Distribution Shift". (NeurIPS 24)
☆19Apr 21, 2025Updated last year