zjr2000/SPES

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zjr2000/SPES)

zjr2000 / SPES

Official Implementation for paper "Pretraining A Large Language Model using Distributed GPUs: A Memory-Efficient Decentralized Paradigm"

☆23

Alternatives and similar repositories for SPES

Users that are interested in SPES are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

NIneeeeeem / LangDC
View on GitHub
[EMNLP 2025 Oral] Official codebase for Seeing More, Saying More: Lightweight Language Experts are Dynamic Video Token Compressors.
☆18Sep 7, 2025Updated 10 months ago
iGuoYanjun / Memorize-When-Needed
View on GitHub
☆23Jun 29, 2026Updated 3 weeks ago
Zeqing-Wang / VideoVerse
View on GitHub
Official Repo for the VideoVerse
☆15Mar 29, 2026Updated 3 months ago
ZhuWenjie98 / DDE
View on GitHub
(ECCV2026) Dual Distribution Estimation for Zero-shot Noisy Test-Time Adaptation with VLMs
☆15Jul 2, 2026Updated 3 weeks ago
csslc / Self-Transcendence
View on GitHub
[ECCV 2026] Official code repository for "Self-transcendence: Is External Feature Guidance Indispensable for Accelerating Diffusion Trans…
☆37Jul 3, 2026Updated 3 weeks ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
PolyU-VCLab / WRC
View on GitHub
Weighted Reverse Convolution for Feature Upsampling
☆24May 24, 2026Updated 2 months ago
langmanbusi / CoCoEdit
View on GitHub
[ICML 2026] Official PyTorch implementation of paper “CoCoEdit: Content-Consistent Image Editing via Region Regularized Reinforcement Lea…
☆26Jun 14, 2026Updated last month
PolyU-VCLab / DepthMaster
View on GitHub
DepthMaster: Unified Monocular Depth Estimation for Perspective and Panoramic Images
☆25Jun 13, 2026Updated last month
zjr2000 / Untrimmed-Video-Feature-Extractor
View on GitHub
A simple and effective feature extractor for untrimmed videos
☆13Sep 1, 2022Updated 3 years ago
Multimedia-Analytics-Laboratory / dpdmd
View on GitHub
[ICML 2026] The offical code of Diversity-Preserved Distribution Matching Distillation for Fast Visual Synthesis
☆87Jun 2, 2026Updated last month
PolyU-VCLab / GGT-100K
View on GitHub
GGT-100K: Generative Ground Truth for Generalizable Real-World Image Restoration
☆66Jun 1, 2026Updated last month
ttgeng233 / LongVALE
View on GitHub
LongVALE: Vision-Audio-Language-Event Benchmark Towards Time-Aware Omni-Modal Perception of Long Videos. (CVPR 2025))
☆61Jun 9, 2025Updated last year
zjr2000 / GVL
View on GitHub
Official implementation for paper Learning Grounded Vision-Language Representation for Versatile Understanding in Untrimmed Videos
☆28Dec 8, 2023Updated 2 years ago
YBZh / LAPT
View on GitHub
ECCV2024, LAPT: Label-driven Automated Prompt Tuning for OOD Detection with Vision-Language Models
☆18Aug 9, 2024Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
leeruibin / hybrid-forcing
View on GitHub
☆32Apr 29, 2026Updated 2 months ago
skyhehe123 / spconv
View on GitHub
☆12Jul 18, 2024Updated 2 years ago
NIneeeeeem / LiveVLN
View on GitHub
Official codebase for LiveVLN: Breaking the Stop-and-Go Loop in Vision-Language Navigation
☆23Apr 22, 2026Updated 3 months ago
A113N-W3I / MICo-150K
View on GitHub
Official repository for the paper "MICo-150K: A Comprehensive Dataset for Multi-Image Composition".
☆111Apr 21, 2026Updated 3 months ago
arctanxarc / GENIUS
View on GitHub
☆43May 9, 2026Updated 2 months ago
shuaizhengliu / InstructRestore
View on GitHub
[NeurlPS' 25] InstructRestore: Region-Customized Image Restoration with Human Instructions
☆53Oct 23, 2025Updated 9 months ago
TencentARC / TokLIP
View on GitHub
TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation
☆236Aug 18, 2025Updated 11 months ago
NVlabs / VideoITG
View on GitHub
[CVPR 2026 Highlight] VideoITG: Multimodal Video Understanding with Instructed Temporal Grounding
☆126Apr 17, 2026Updated 3 months ago
EdwardChasel / BinaryAttention
View on GitHub
[CVPR2026] BinaryAttention: One-Bit QK-Attention for Vision and Diffusion Transformers
☆41Mar 17, 2026Updated 4 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ZhuWenjie98 / ANTS
View on GitHub
(CVPR2026 Oral) ANTS: Adaptive Negative Textual Space Shaping for OOD Detection via Test-Time MLLM Understanding and Reasoning
☆57Jul 1, 2026Updated 3 weeks ago
TencentARC / TimeLens
View on GitHub
[CVPR 2026] TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs
☆162Updated this week
A113N-W3I / TIIF-Bench
View on GitHub
Official repository for the paper "TIIF-Bench: How Does Your T2I Model Follow Your Instructions?".
☆129Jun 26, 2026Updated 3 weeks ago
leeruibin / MfM
View on GitHub
[ICLR 2026] Many-for-Many: Unify the Training of Multiple Video and Image Generation and Manipulation Tasks
☆32Feb 5, 2026Updated 5 months ago
gwenzhang / GGA
View on GitHub
[ECCV'24] A novel weakly supervised framework for 3D object detection from 2D bounding boxes. It can easily extend to novel scenarios and…
☆36Jul 26, 2024Updated last year
Joyies / GDPO
View on GitHub
Official code for GDPO-SR: Group Direct Preference Optimization for One-Step Generative Image Super-Resolution
☆65Jun 2, 2026Updated last month
YBZh / CheXOne
View on GitHub
CheXOne: A Reasoning-Enabled Vision–Language Foundation Model for Chest X-ray Interpretation
☆41Apr 12, 2026Updated 3 months ago
langmanbusi / InsViE
View on GitHub
Official PyTorch implementation of paper “InsViE-1M: Effective Instruction-based Video Editing with Elaborate Dataset Construction”
☆34Apr 3, 2026Updated 3 months ago
zhongchenzhao / PPMA
View on GitHub
Official codes for Polyline Path Masked Attention for Vision Transformer
☆17Jun 3, 2026Updated last month
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
zjr2000 / LLMVA-GEBC
View on GitHub
Winner solution to Generic Event Boundary Captioning task in LOVEU Challenge (CVPR 2023 workshop)
☆29Jan 1, 2024Updated 2 years ago
mt-cly / FPR
View on GitHub
FPR: False Positive Rectification for Weakly Supervised Semantic Segmentation (ICCV 2023)
☆24Sep 24, 2023Updated 2 years ago
lavinal712 / Awesome-Visual-Tokenizers
View on GitHub
📖 This is a repository for organizing papers, codes and other resources related to visual tokenizers.
☆17Jul 7, 2026Updated 2 weeks ago
ChrisDud0257 / AFINE
View on GitHub
Official code for our CVPR 2025 paper: "Toward Generalized Image Quality Assessment: Relaxing the Perfect Reference Quality Assumption"
☆68Sep 15, 2025Updated 10 months ago
ChrisDud0257 / SSL
View on GitHub
Official code for our Paper "SSL: A Self-similarity Loss for Improving Generative Image Super-resolution" in ACMMM 2024
☆51Jun 6, 2026Updated last month
NgCafai / Transformer
View on GitHub
Transformer: PyTorch Implementation of "Attention Is All You Need"
☆15Dec 13, 2023Updated 2 years ago
waynechu1021 / NAVIDA
View on GitHub
NaVIDA: Vision-Language Navigation with Inverse Dynamics Augmentation
☆28Apr 17, 2026Updated 3 months ago