Biography

About Me πŸͺͺ

Homepage Google Scholar GitHub Gmail

I’m currently a Machine Learning Engineer at ByteDance, mainly on duty with research and development of Vision-Language Models for e-commerce safety. I received my Master of Science in Engineering in June 2024 at MCG Group, Department of Computer Science and Technology, Nanjing University, under the supervision of Assoc. Prof. Jie Tang. I also received my Bachelor of Science in Computer Science and Technology from Nanjing University in June 2021.

My research interests include Computer Vision, Multimodal Deep Learning and Generative Deep Learning, recently lie in Visual Object Tracking (VOT), Vision-Language Models and Generative Models.

News πŸ”₯

  • [ 2025.09.19 ] πŸŽ‰ MERIT is accepted by NeurIPS 2025! Code and Dataset are available now.
  • [ 2025.06.12 ] πŸ€— We propose MERIT, the first multilingual dataset for interleaved multi-condition semantic retrieval, comprising 320,000 queries with 135,000 products in 5 languages while covering 7 distinct product categories. Meanwhile, a novel fine-tuning framework named Coral is constructed to adapt pre-trained MLLMs for embedding extraction. arXiv and Project Page are available now.
  • [ 2024.03.21 ] πŸ“– A Zhihu Blog is published to explain main ideas of the paper.
  • [ 2023.10.18 ] πŸ“„ Both CVF and arXiv version of ROMTrack are updated! This is a tracker utilizing the newly proposed object modeling paradigm, significantly improving robustness. Code is available now.
  • [ 2023.07.14 ] πŸŽ‰ Good News! One paper, abbreviated as ROMTrack, is accepted by ICCV 2023.

Publications πŸ“

Academic Services πŸ’Ό

  • Journal Review :
    • IEEE Transactions on Circuits and Systems for Video Technology (TCSVT)
    • IEEE Transactions on Multimedia (TMM)
    • IEEE Transactions on Neural Networks and Learning Systems (TNNLS)
    • ACM Transactions on Multimedia Computing, Communications and Applications (TOMM)
    • Journal of Visual Communication and Image Representation (JVCIR)
  • Conference Review :
    • IEEE International Conference on Computer Vision (ICCV)
  • Teaching Assistant :
    • Introduction to Computer System (ICS)
    • Multimedia Technology

Educations πŸŽ“

  • 2021.9 - 2024.6: M.Sc., Nanjing University, Nanjing.
  • 2017.9 - 2021.6: B.Sc., Nanjing University, Nanjing.
    • Department of Computer Science and Technology.
    • 2020.9 - 2021.6: Research on Visual Object Tracking, supervised by Prof. Liming Wang.
  • 2012.9 - 2017.6: Tianyi High School, Jiangsu.
    • Both junior school and senior school.

Experiences πŸ–₯️

  • 2024.7 - Present: Machine Learning Engineer (MLE) - Multimodal.
    • Governance and Experience, Global E-commerce, Data, ByteDance, Shanghai.
    • Mainly focus on the research and development of Vision-Language Models for e-commerce safety.
  • 2023.6 - 2023.9: Machine Learning Engineer (MLE) Intern - Computer Vision.
    • Alimama, Taobao & Tmall Group, Alibaba Group, Hangzhou.
    • Mainly focus on the research and development of Multimodal & AIGC algorithms.

Honors and Awards πŸ…

  • Outstanding Graduate Student of Nanjing University, 2024.
  • Tencent Scholarship, 2024.
  • Academic Scholarship of Nanjing University,
    • 2021 (First Prize) & 2022 (Second Prize) & 2023 (Second Prize).
  • People’s Scholarship of Nanjing University, 2018 & 2019 & 2020.
    • 2018 (Second Prize) & 2019 (First Prize) & 2020 (Second Prize).
  • Third Prize in Jiangsu Mathematical Modeling Competition, 2019.
  • Silver Medal in 12th China Southeast Mathematical Olympiad, 2015.

Contact πŸ“«

Last updated on : 2025-09-19, Fri, 17:13 PM +0800