[1]Wang Z#, Zheng Q*, Ma S, et al. End-to-End HOI Reconstruction Transformer with Graph-based Encoding[C]//Proceedings of the Computer Vision and Pattern Recognition Conference. 2025: 27706-27715.
[2]Mu C#, Feng D*, Zheng Q*, et al. A Robust and Efficient Visual-Inertial Initialization with Probabilistic Normal Epipolar Constraint[J]. IEEE Robotics and Automation Letters, 2025.
[3]Zheng Q, Liu D, Wang C, et al. Esceme: Vision-and-language navigation with episodic scene memory[J]. International Journal of Computer Vision, 2025, 133(1): 254-274.
[4]Zheng Q, Wang C, Wang D. Bypass network for semantics driven image paragraph captioning[J]. Computer Vision and Image Understanding, 2024, 249: 104154.
[5]Li K#, Yu B, Zheng Q, et al. Muep: A multimodal benchmark for embodied planning with foundation models [C]//Intemational Joint Conferences on Artificial Intelligence. IJCAI. 2024: 129-138.
[6]Zhang H, Liu D, Zheng Q, et al. Modeling video as stochastic processes for fine-grained video representation learning[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2023: 2225-2234.
[7]Zheng Q, Gong M, You X, et al. A unified B-spline framework for scale-invariant keypoint detection[J]. International Journal of Computer Vision, 2022, 130(3): 777-799.