Research Directions
Five sub-directions around embodied interaction world models
Scene Generation
Build a unified world model for navigation and manipulation, addressing weak long-horizon action generation and instruction understanding.
查看详情 →
Action Generation
Build a unified action understanding and generation model to address instruction generalization and action-vision alignment.
查看详情 →
Multimodal Cognition
Fuse vision, language, and spatial perception to tackle interaction-level 4D cognition and semantic understanding.
查看详情 →
Geometry Reconstruction
Predict geometry from multimodal inputs to enable fast reconstruction in dynamic scenes.
查看详情 →
Embodied Navigation
Use egocentric perception for autonomous navigation, solving localization and decision-making in complex environments.
查看详情 →

