publications

2025

  1. Preprint
    thinkact-teaser.png
    ThinkAct: Vision-Language-Action Reasoning via Reinforced Visual Latent Planning
    arXiv preprint arXiv:2507.16815, 2025
  2. ICCV 2025
    condense-teaser.png
    Continual Personalization for Diffusion Models
    Yu-Chien Liao, Jr-Jen ChenChi-Pin Huang, Ci-Siang Lin , Meng-Lin Wu, and Yu-Chiang Frank Wang
    In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2025
  3. ICCVW 2025
    motionmatcher-preview.gif
    MotionMatcher: Motion Customization of Text-to-Video Diffusion Models via Motion Feature Matching
    Yen-Siang WuChi-Pin HuangFu-En Yang, and Yu-Chiang Frank Wang
    ICCV 2025 Workshop on P13N: Personalization in Generative AI, 2025
  4. CVPR 2025
    videomage-teaser.png
    VideoMage: Multi-Subject and Motion Customization of Text-to-Video Diffusion Models
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2025

2024

  1. ECCV 2024
    receler-teaser.png
    Receler: Reliable concept erasing of text-to-image diffusion models via lightweight erasers
    In European Conference on Computer Vision (ECCV), 2024
  2. ECCV 2024
    snd-teaser.png
    Select and distill: Selective dual-teacher knowledge transfer for continual learning on vision-language models
    In European Conference on Computer Vision (ECCV), 2024
  3. ICLR 2024
    rapper-teaser.png
    Rapper: Reinforced rationale-prompted paradigm for natural language explanation in visual question answering
    In The Twelfth International Conference on Learning Representations (ICLR), 2024