4DVLab/Freqpolicy

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/4DVLab/Freqpolicy)

4DVLab / Freqpolicy

[NIPS 2025] FreqPolicy: Frequency Autoregressive Visuomotor Policy with Continuous Tokens

☆20

Alternatives and similar repositories for Freqpolicy

Users that are interested in Freqpolicy are comparing it to the libraries listed below

Sorting:

InternRobotics / VL-LN
View on GitHub
VL-LN Bench: Towards Long-horizon Goal-oriented Navigation with Active Dialogs
☆48Jan 5, 2026Updated last month
sailor-z / SE-GS
View on GitHub
Self-Ensembling Gaussian Splatting for Few-Shot Novel View Synthesis (ICCV 2025, Oral)
☆39Oct 31, 2025Updated 4 months ago
4DVLab / DexGrasp-Anything
View on GitHub
CVPR 2025(Highlight) DexGraspAnything: Towards Universal Robotic Dexterous Grasping with Physics Awareness
☆208Dec 22, 2025Updated 2 months ago
SanMumumu / FlowRAM
View on GitHub
[2025CVPR] FlowRAM: Grounding Flow Matching Policy with Region-Aware Mamba Framework for Robotic Manipulation
☆53Nov 11, 2025Updated 3 months ago
GaavaMa / Causal-Diffusion-Policy
View on GitHub
☆28Aug 6, 2025Updated 6 months ago
4DVLab / SemGeoMo
View on GitHub
Official implement for SemGeoMo: Dynamic Contextual Human Motion Generation with Semantic and Geometric Guidance
☆43Jul 18, 2025Updated 7 months ago
cschenxiang / FoundIR-v2
View on GitHub
☆21Dec 14, 2025Updated 2 months ago
Xinzhe99 / OpenFocus
View on GitHub
A free and open-source focus stacking software that supports multi-focus image alignment and fusion.
☆19Feb 5, 2026Updated 3 weeks ago
JiahuaDong / CCVD
View on GitHub
[AAAI2026] Bring Your Dreams to Life: Continual Text-to-Video Customization
☆36Dec 9, 2025Updated 2 months ago
xy-gao / DA3-blender
View on GitHub
Blender addon for Depth-Anything-3 3D reconstruction
☆94Dec 21, 2025Updated 2 months ago
Eyeline-Labs / VChain
View on GitHub
[ICCV 2025 Workshop Outstanding Paper Award] VChain: Chain-of-Visual-Thought for Reasoning in Video Generation
☆116Oct 7, 2025Updated 4 months ago
woven-by-toyota / DiffusionNOCS
View on GitHub
☆41Aug 27, 2024Updated last year
AnoK3111 / BC-SAM
View on GitHub
[ISBI 2024] Official PyTorch implementation of Towards Cross-Domain Single Blood Cell Image Classification via Large-Scale LoRA-based Seg…
☆11Aug 12, 2024Updated last year
chandraprvkvsh / Physics-Informed-Neural-Networks
View on GitHub
Developed and optimized Physics-Informed Neural Networks for solving non-linear Partial Differential Equations in a completely unsupervis…
☆12Feb 17, 2025Updated last year
ZiyuGuo99 / MME-CoF
View on GitHub
Are Video Models Ready as Zero-shot Reasoners?
☆84Nov 24, 2025Updated 3 months ago
aimagelab / ScanDiff
View on GitHub
This is the official repository for the paper "Modeling Human Gaze Behavior with Diffusion Models for Unified Scanpath Prediction". ICCV …
☆24Dec 4, 2025Updated 2 months ago
Xinxi-Zhang / Re-MeanFlow
View on GitHub
☆43Dec 1, 2025Updated 2 months ago
LittleFocus2201 / ICASSP2024
View on GitHub
This repository is the official implementation of ICASSP2024 paper: Highlight removal network based on an improved dichromatic reflection…
☆13Apr 18, 2024Updated last year
ZheningHuang / SpaceTimePilot
View on GitHub
[CVPR 2026] SpaceTimePilot: Generative Rendering of Dynamic Scenes Across Space and Time
☆99Jan 1, 2026Updated 2 months ago
WayneTomas / Artemis
View on GitHub
This is the official Pytorch code for our paper "Artemis: Structured Visual Reasoning for Perception Policy Learning".
☆14Dec 4, 2025Updated 2 months ago
LaVi-Lab / Rethink_CoT_Video
View on GitHub
Official code for "Rethinking Chain-of-Thought Reasoning for Videos"
☆20Dec 14, 2025Updated 2 months ago
snap-research / kontinuouskontext
View on GitHub
☆27Updated this week
kxhit / cvpr25_oral_gpu_info
View on GitHub
Just wanna see what type and how many GPUs/TPUs are used in CVPR 2025 oral papers. Fun vibe coding with LLMs.
☆12Apr 24, 2025Updated 10 months ago
fuxiAIlab / NetEaseCrowd-Dataset
View on GitHub
NetEaseCrowd dataset, a collection of data obtained from You Ling crowdsourcing platform, Fuxi AI Lab, NetEase.
☆11Dec 19, 2024Updated last year
GaozhengPei / FreqPure
View on GitHub
☆20Sep 23, 2025Updated 5 months ago
YWenxi / think-with-images-through-self-calling
View on GitHub
official repo for `thinking with images through-self-calling`
☆20Dec 28, 2025Updated 2 months ago
zhoujiahuan1991 / ICML2025-VGP
View on GitHub
Official implementation of paper "Vision Graph Prompting via Semantic Low-Rank Decomposition", ICML 2025
☆16Dec 25, 2025Updated 2 months ago
ronen94 / SAEdit
View on GitHub
The official implementation of the paper SAEdit: Token-level control for continuous image editing via Sparse AutoEncoder
☆18Oct 19, 2025Updated 4 months ago
kuleshov-group / e2d2
View on GitHub
[NeurIPS 2025] Encoder-Decoder Diffusion Language Models for Efficient Training and Inference
☆36Oct 29, 2025Updated 4 months ago
mar4945 / Vision-Based-Pure-Pursuing-Algorithm
View on GitHub
This repository contains the prject carried out for the IFAC20 MATLAB Minidrone competition held during the IFAC Conference in Berlin. T…
☆12Aug 11, 2022Updated 3 years ago
yunlong10 / Video-R4
View on GitHub
Reinforcing Text-Rich Video Reasoning with Visual Rumination
☆27Nov 24, 2025Updated 3 months ago
WangRongsheng / ReadPaper
View on GitHub
🧑‍🚀 Professional translation and reading of English academic papers in PDF format.
☆10Nov 2, 2023Updated 2 years ago
YixiangChen515 / EC-Flow
View on GitHub
[ICCV 2025] Official repo of "EC-Flow: Enabling Versatile Robotic Manipulation from Action-Unlabeled Videos via Embodiment-Centric Flow"
☆27Oct 16, 2025Updated 4 months ago
Visual-AI / JoVA
View on GitHub
JoVA: Unified Multimodal Learning for Joint Video-Audio Generation
☆30Dec 22, 2025Updated 2 months ago
AnranQi / Rags2riches
View on GitHub
Code of Rags2riches
☆20May 26, 2025Updated 9 months ago
Philo-Li / claudebot
View on GitHub
Use claude code anywhere.
☆42Feb 12, 2026Updated 2 weeks ago
hjrPhoebus / X-Dub
View on GitHub
Official project page for "From Inpainting to Editing: A Self-Bootstrapping Framework for Context-Rich Visual Dubbing" (X-Dub).
☆29Jan 31, 2026Updated last month
yz-cnsdqz / primal-release
View on GitHub
official implementation of [PRIMAL: Physically Reactive and Interactive Motor Model for Avatar Learning, ICCV'25]
☆35Oct 31, 2025Updated 4 months ago
amazon-far / BAR
View on GitHub
code & model for arxiv paper "Autoregressive Image Generation with Masked Bit Modeling"
☆35Feb 10, 2026Updated 2 weeks ago