Abstract
Most human synthesis schemes use high-performance servers, so the user interaction experience of mobile devices is not satisfied. Viewing human synthesis results on smartphones directly increases user interaction and enhances user experience. This paper proposes a smart frame selection network (SFSN) on mobile devices to reduce the traffic between smartphones and cloud. We leverage the attention and relationship model to focus on the relationship between a single frame and the entire video, which can better select important frames, thus reducing the traffic and computing effectively. In addition, we build a multi-task human synthesis system based on SFSN to process the generation tasks such as background changing, pose transfer and virtual try-on in a unified framework. Evaluation results indicate proposed approach reduces the number of frames to be processed by more than 42.2%.
| Original language | English |
|---|---|
| Pages (from-to) | 4655-4668 |
| Number of pages | 14 |
| Journal | Wireless Networks |
| Volume | 30 |
| Issue number | 6 |
| DOIs | |
| State | Published - Aug 2024 |
Keywords
- Attention and relationship model
- Cloud-device collaborative
- Human image synthesis