📝 Publications
Multimodal AIGC: Image Synthesis and Visual Forensics
-
Arxiv RealGen: Photorealistic Text-to-Image Generation via Detector-Guided Rewards
Junyan Ye*, Leiqi Zhu*, Yuncheng Guo, Dongzhi Jiang, Zilong Huang, Yifan Zhang, Zhiyuan Yan, Haohuan Fu, Conghui He, Weijia Li. -
Arxiv Echo-4o: Harnessing the Power of GPT-4o Synthetic Images for Improved Image Generation
Junyan Ye*, Dongzhi Jiang*, Zihao Wang, Leqi Zhu, Zhenghao Hu, Zilong Huang, Jun He, Zhiyuan Yan, Jinghua Yu, Hongsheng Li, Conghui He, Weijia Li. -
NeurIPS 2025 Spot the Fake: Large Multimodal Model-Based Synthetic Image Detection with Artifact Explanation
Siwei Wen*, Junyan Ye*, Peilin Feng, Hengrui Kang, Zichen Wen, Yize Chen, Jiang Wu, Wenjun Wu, Conghui He, Weijia Li. -
ICCV 2025 Skydiffusion: Leveraging BEV Paradigm for Ground-to-Aerial Image Synthesis
Junyan Ye*, Jun He, Weijia Li, Zhutao Lv, Yi Lin, Jinhua Yu, Haote Yang, Conghui He. -
Arxiv UAE: Unified Multimodal Model as Auto-Encoder
Zhiyuan Yan*, Kaiqing Lin*, Zongjian Li*, Junyan Ye*, Hui Han, Zhendong Wang, Hao Liu, Bin Lin, Hao Li, Xue Xu, Xinyan Xiao, Jingdong Wang, Haifeng Wang, Li Yuan.
Multimodal Large Language Models Benchmark
-
NeurIPS 2025 DB BLINK-Twice: You see, but do you observe? A Reasoning Benchmark on Visual Perception
Junyan Ye, Dongzhi Jiang, Jun He, Baichuan Zhou, Zilong Huang, Zhiyuan Yan, Hongsheng Li, Conghui He, Weijia Li. -
ICLR 2025 Spotlight LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models
Junyan Ye*, Baichuan Zhou*, Zilong Huang*, Junan Zhang*, Tianyi Bai*, Hengrui Kang, Jun He, Honglin Lin, Zihao Wang, Tong Wu, Zhizheng Wu, Yiping Chen, Dahua Lin, Conghui He, Weijia Li. -
AAAI 2025 UrBench: A Benchmark for Evaluating Large Multimodal Models in Multi-View Urban Scenarios
Baichuan Zhou*, Haote Yang*, Dairong Chen*, Junyan Ye*, Tianyi Bai, Jinhua Yu, Songyang Zhang, Dahua Lin, Conghui He, Weijia Li.
Cross-View Perception and Geo-localization
-
ICCV 2025 Where am I? Cross-View Geo-localization with Natural Language Descriptions
Junyan Ye, Honglin Lin*, Leyan Ou, Dairong Chen, Zihao Wang, Qi Zhu, Conghui He, Weijia Li. -
ECCV 2024 EP-BEV: Cross-view Image Geo-localization with Panorama-BEV Co-Retrieval Network
Junyan Ye, Zhutao Lv, Weijia Li, Jinhua Yu, Haote Yang, Huaping Zhong, Conghui He. -
CVPR 2024 Highlight SG-BEV: Satellite-Guided BEV Fusion for Cross-View Semantic Segmentation
Junyan Ye*, Qiyan Luo, Jinhua Yu, Huaping Zhong, Zhimeng Zheng, Conghui He, Weijia Li.