Selected Publications
See the full list on my Google Scholar.
†Equal Contribution, *Corresponding Author(s)
These are the current publications from my Google Scholar profile, sorted by publication date.
-
Obstruction reasoning for robotic grasping2026 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR2026)
-
Action-guided generation of 3D functionality segmentation data2026 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops …
-
A Self-Conditioned Representation Guided Diffusion Model for Realistic Text-to-LiDAR Scene Generation2026 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR2026)
-
The Eleventh NTIRE 2026 Efficient Super-Resolution Challenge ReportNTIRE Workshop and Challenges CVPR 2026
-
GLASS: Graph and Vision-Language Assisted Semantic Shape Correspondence2026 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR …)
-
Efficient Encoder-Free Fourier-based 3D Large Multimodal Model2026 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR2026)
-
Universal 3D Shape Matching via Coarse-to-Fine Language Guidance2026 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR2026)
-
Masked Clustering Prediction for Unsupervised Point Cloud Pre-trainingAnnual AAAI Conference on Artificial Intelligence (Oral, AAAI)
-
Robust Single-Stage Fully Sparse 3D Object Detection via Detachable Latent DiffusionAnnual AAAI Conference on Artificial Intelligence (AAAI)
-
PISI: Physical Information Based Solver-Interactive Network Structure ReconstructionAlgorithms 18 (9), 584
-
PerLA: Perceptive 3D language assistant2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
-
Self-Supervised and Generalizable Tokenization for CLIP-Based 3D UnderstandingarXiv preprint arXiv:2505.18819
-
Adversarial robustness for unified multi-modal encoders via efficient calibrationarXiv preprint arXiv:2505.11895
-
TVEG: Model Selection of the Time-Varying Exponential Family Distributions Graphical ModelsIEEE Transactions on Network Science and Engineering
-
Free-form language-based robotic reasoning and graspingIROS 2025
-
PointGAC: Geometric-Aware Codebook for Masked Point ModelingProceedings of the IEEE/CVF International Conference on Computer Vision …
-
Fully-Geometric Cross-Attention for Point Cloud RegistrationInternational Conference on 3D Vision (3DV)
-
Vocabulary-Free 3D Instance Segmentation with Vision and Language AssistantInternational Conference on 3D Vision (3DV)
-
Multimodal fusion SLAM with Fourier attentionIEEE Robotics and Automation Letters 10 (2), 1050-1057
-
GSTran: Joint Geometric and Semantic Coherence for Point Cloud SegmentationICPR 2024