csuhan

🐇

Focusing

Jiaming Han csuhan

🐇

Focusing

220 followers · 104 following

CUHK
https://csuhan.com

Achievements

Stars

EvolvingLMMs-Lab / Evolving-Visual-Generation

[Roadmap] Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling

TeX 93 2 Updated May 5, 2026

OpenDriveLab / RISE

[RSS 2026] Code for RISE: Self-Improving Robot Policy with Compositional World Model

Python 194 5 Updated May 7, 2026

leigest519 / OpenGame

OpenGame: Open Agentic Coding for Games

TypeScript 2,003 269 Updated Apr 22, 2026

SkyworkAI / Matrix-Game

Matrix-Game 3.0: Real-Time and Streaming Interactive World Model with Long-Horizon Memory

Python 2,184 235 Updated Mar 30, 2026

OpenEnvision / Awesome-Multimodal-Modeling

Awesome Multimodal Modeling [Covers MLLM, UMM, and NMM]

310 16 Updated May 4, 2026

OpenDCAI / OpenWorldLib

Unified Codebase for Advanced World Models.

Python 746 41 Updated May 2, 2026

tulerfeng / Gen-Searcher

Gen-Searcher: Reinforcing Agentic Search for Image Generation

Python 329 31 Updated Apr 7, 2026

Zivenzhu / GIDE

Python 4 Updated May 5, 2026

YuqingWang1029 / CubiD

[CVPR2026 Highlight] Cubic Discrete Diffusion: Discrete Visual Generation on High-Dimensional Representation Tokens https://arxiv.org/abs/2603.19232

Python 54 1 Updated Apr 10, 2026

alibaba / page-agent

JavaScript in-page GUI agent. Control web interfaces with natural language.

TypeScript 17,627 1,467 Updated Apr 28, 2026

vercel-labs / json-render

The Generative UI framework

TypeScript 14,641 786 Updated May 7, 2026

DAGroup-PKU / SpatialT2I

[CVPR 2026🔥] Enhancing Spatial Understanding in Image Generation via Reward Modeling

82 3 Updated Mar 2, 2026

aistudynow / Comfyui-bitdance

BitDance custom nodes for ComfyUI with unified loader, text encode, sampler, and VAE nodes.

Python 32 5 Updated Feb 26, 2026

shallowdream204 / BitDance

BitDance & UniWeTok: Open-source autoregressive model with binary visual tokens. A research project for building powerful multimodal autoregressive model.

Python 474 29 Updated Apr 20, 2026

MeiGen-AI / Infinite-World

[ICML 2026] | Scaling Interactive World Models to 1000-Frame Horizons via Pose-Free Hierarchical Memory

Python 156 5 Updated May 4, 2026

google-deepmind / open_x_embodiment

Jupyter Notebook 1,823 115 Updated Nov 5, 2025

OneIG-Bench / OneIG-Benchmark

[NeurIPS 2025 DB] OneIG-Bench is a meticulously designed comprehensive benchmark framework for fine-grained evaluation of T2I models across multiple dimensions, including subject-element alignment,…

Python 119 10 Updated Feb 10, 2026

NVlabs / DiffusionNFT

[ICLR 2026 Oral] DiffusionNFT: Online Diffusion Reinforcement with Forward Process

Python 809 33 Updated Feb 10, 2026

code-yeongyu / oh-my-openagent

omo; the best agent harness - previously oh-my-opencode

TypeScript 56,433 4,594 Updated May 7, 2026

anomalyco / opencode

The open source coding agent.

TypeScript 156,463 18,182 Updated May 7, 2026

verl-project / verl

verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

Python 21,131 3,810 Updated May 7, 2026

EdoardoBotta / RQ-VAE-Recommender

[Pytorch] Generative retrieval model using semantic IDs from "Recommender Systems with Generative Retrieval"

Python 783 114 Updated Apr 1, 2026

AkaliKong / MiniOneRec

Minimal reproduction of OneRec

Python 1,530 220 Updated Mar 31, 2026

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-R1, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Gemma4, Llava, …

Python 14,030 1,395 Updated May 7, 2026

RQ-Wu / DIPO

[NeurIPS 2025] | DIPO: Dual-State Images Controlled Articulated Object Generation Powered by Diverse Data

Python 48 Updated Dec 12, 2025

tulerfeng / OneThinker

🔥 OneThinker: All-in-one Reasoning Model for Image and Video [CVPR 2026]

Python 436 31 Updated Feb 28, 2026

showlab / Adv-GRPO

[CVPR 2026] An official implementation of Adv-GRPO. The Image as Its Own Reward: Reinforcement Learning with Adversarial Reward for Image Generation.

Python 82 1 Updated Feb 26, 2026

Tongyi-MAI / Z-Image

Python 11,188 758 Updated Feb 9, 2026

LogosRoboticsGroup / SPAR

From Flatland to Space (SPAR). Accepted to NeurIPS 2025 Datasets & Benchmarks. A large-scale dataset & benchmark for 3D spatial perception and reasoning in VLMs.

Python 84 1 Updated Jan 5, 2026

41xu / DEMO

[3DV 2026] Dense Motion Captioning

Python 32 Updated Jan 28, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jiaming Han csuhan

Achievements

Achievements

Block or report csuhan

Stars

EvolvingLMMs-Lab / Evolving-Visual-Generation

OpenDriveLab / RISE

leigest519 / OpenGame

SkyworkAI / Matrix-Game

OpenEnvision / Awesome-Multimodal-Modeling

OpenDCAI / OpenWorldLib

tulerfeng / Gen-Searcher

Zivenzhu / GIDE

YuqingWang1029 / CubiD

alibaba / page-agent

vercel-labs / json-render

DAGroup-PKU / SpatialT2I

aistudynow / Comfyui-bitdance

shallowdream204 / BitDance

MeiGen-AI / Infinite-World

google-deepmind / open_x_embodiment

OneIG-Bench / OneIG-Benchmark

NVlabs / DiffusionNFT

code-yeongyu / oh-my-openagent

anomalyco / opencode

verl-project / verl

EdoardoBotta / RQ-VAE-Recommender

AkaliKong / MiniOneRec

modelscope / ms-swift

RQ-Wu / DIPO

tulerfeng / OneThinker

showlab / Adv-GRPO

Tongyi-MAI / Z-Image

LogosRoboticsGroup / SPAR

41xu / DEMO