Skip to content
View csuhan's full-sized avatar
🐇
Focusing
🐇
Focusing

Block or report csuhan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[Roadmap] Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling

TeX 93 2 Updated May 5, 2026

[RSS 2026] Code for RISE: Self-Improving Robot Policy with Compositional World Model

Python 194 5 Updated May 7, 2026

OpenGame: Open Agentic Coding for Games

TypeScript 2,003 269 Updated Apr 22, 2026

Matrix-Game 3.0: Real-Time and Streaming Interactive World Model with Long-Horizon Memory

Python 2,184 235 Updated Mar 30, 2026

Awesome Multimodal Modeling [Covers MLLM, UMM, and NMM]

310 16 Updated May 4, 2026

Unified Codebase for Advanced World Models.

Python 746 41 Updated May 2, 2026

Gen-Searcher: Reinforcing Agentic Search for Image Generation

Python 329 31 Updated Apr 7, 2026
Python 4 Updated May 5, 2026

[CVPR2026 Highlight] Cubic Discrete Diffusion: Discrete Visual Generation on High-Dimensional Representation Tokens https://arxiv.org/abs/2603.19232

Python 54 1 Updated Apr 10, 2026

JavaScript in-page GUI agent. Control web interfaces with natural language.

TypeScript 17,627 1,467 Updated Apr 28, 2026

The Generative UI framework

TypeScript 14,641 786 Updated May 7, 2026

[CVPR 2026🔥] Enhancing Spatial Understanding in Image Generation via Reward Modeling

82 3 Updated Mar 2, 2026

BitDance custom nodes for ComfyUI with unified loader, text encode, sampler, and VAE nodes.

Python 32 5 Updated Feb 26, 2026

BitDance & UniWeTok: Open-source autoregressive model with binary visual tokens. A research project for building powerful multimodal autoregressive model.

Python 474 29 Updated Apr 20, 2026

[ICML 2026] | Scaling Interactive World Models to 1000-Frame Horizons via Pose-Free Hierarchical Memory

Python 156 5 Updated May 4, 2026
Jupyter Notebook 1,823 115 Updated Nov 5, 2025

[NeurIPS 2025 DB] OneIG-Bench is a meticulously designed comprehensive benchmark framework for fine-grained evaluation of T2I models across multiple dimensions, including subject-element alignment,…

Python 119 10 Updated Feb 10, 2026

[ICLR 2026 Oral] DiffusionNFT: Online Diffusion Reinforcement with Forward Process

Python 809 33 Updated Feb 10, 2026

omo; the best agent harness - previously oh-my-opencode

TypeScript 56,433 4,594 Updated May 7, 2026

The open source coding agent.

TypeScript 156,463 18,182 Updated May 7, 2026

verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

Python 21,131 3,810 Updated May 7, 2026

[Pytorch] Generative retrieval model using semantic IDs from "Recommender Systems with Generative Retrieval"

Python 783 114 Updated Apr 1, 2026

Minimal reproduction of OneRec

Python 1,530 220 Updated Mar 31, 2026

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-R1, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Gemma4, Llava, …

Python 14,030 1,395 Updated May 7, 2026

[NeurIPS 2025] | DIPO: Dual-State Images Controlled Articulated Object Generation Powered by Diverse Data

Python 48 Updated Dec 12, 2025

🔥 OneThinker: All-in-one Reasoning Model for Image and Video [CVPR 2026]

Python 436 31 Updated Feb 28, 2026

[CVPR 2026] An official implementation of Adv-GRPO. The Image as Its Own Reward: Reinforcement Learning with Adversarial Reward for Image Generation.

Python 82 1 Updated Feb 26, 2026
Python 11,188 758 Updated Feb 9, 2026

From Flatland to Space (SPAR). Accepted to NeurIPS 2025 Datasets & Benchmarks. A large-scale dataset & benchmark for 3D spatial perception and reasoning in VLMs.

Python 84 1 Updated Jan 5, 2026

[3DV 2026] Dense Motion Captioning

Python 32 Updated Jan 28, 2026
Next