Publications

Journal Articles

ManiTaskGen: A Comprehensive Task Generator for Benchmarking and Improving Vision-Language Agents on Embodied Decision-Making

Submitted to CVPR 2026

This paper is about ManiTaskGen, a universal system that generates a comprehensive set of feasible mobile manipulation tasks given arbitrary scenes. These tasks facilitate both benchmarking and the improvement of embodied decision-making agents.

Recommended citation: Liu Dai* ,Haina Wang*, Weikang Wan, and Hao Su. (2025). "ManiTaskGen: A Comprehensive Task Generator for Benchmarking and Improving Vision-Language Agents on Embodied Decision-Making." arXiv preprint arXiv:2505.20726.
Download Paper