The rapid progress of large language models (LLMs) has catalyzed the emergence of multimodal large language models (MLLMs) that unify visual understanding and image generation within a single ...
Abstract: The human pose transfer task aims to generate synthetic person images that preserve the style of reference images while accurately aligning them with the desired target pose. However, ...
Abstract: Predicting traffic flow is vital component of Intelligent Transportation Systems (ITS), aimed at enhancing urban traffic management and optimization efforts. However, accurate prediction ...