Transport Object Detection in Street View Imagery Using Decomposed Convolutional Neural Networks

Yunpeng Bai; Changjing Shang; Ying Li; Liang Shen; Shangzhu Jin; Qiang Shen

doi:10.3390/math11183839

Transport Object Detection in Street View Imagery Using Decomposed Convolutional Neural Networks

Yunpeng Bai, Changjing Shang, Ying Li, Liang Shen, Shangzhu Jin, Qiang Shen

School of Computer Science

Research output: Contribution to journal › Article › peer-review

4 Scopus citations

Abstract

Deep learning has achieved great successes in performing many visual recognition tasks, including object detection. Nevertheless, existing deep networks are computationally expensive and memory intensive, hindering their deployment in resource-constrained environments, such as mobile or embedded devices that are widely used by city travellers. Recently, estimating city-level travel patterns using street imagery has been shown to be a potentially valid way according to a case study with Google Street View (GSV), addressing a critical challenge in transport object detection. This paper presents a compressed deep network using tensor decomposition to detect transport objects in GSV images, which is sustainable and eco-friendly. In particular, a new dataset named Transport Mode Share-Tokyo (TMS-Tokyo) is created to serve the public for transport object detection. This is based on the selection and filtering of 32,555 acquired images that involve 50,827 visible transport objects (including cars, pedestrians, buses, trucks, motors, vans, cyclists and parked bicycles) from the GSV imagery of Tokyo. Then a compressed convolutional neural network (termed SVDet) is proposed for street view object detection via tensor train decomposition on a given baseline detector. The method proposed herein yields a mean average precision (mAP) of 77.6% on the newly introduced dataset, TMS-Tokyo, necessitating just 17.29 M parameters and a computational capacity of 16.52 G FLOPs. As such, it markedly surpasses the performance of existing state-of-the-art methods documented in the literature.

Original language	English
Article number	3839
Journal	Mathematics
Volume	11
Issue number	18
DOIs	https://doi.org/10.3390/math11183839
State	Published - Sep 2023

Keywords

convolutional neural networks
street-view object detection
tensor train decomposition

Access to Document

10.3390/math11183839

Cite this

@article{afcdb915892740539e9e3f5cbeba7658,

title = "Transport Object Detection in Street View Imagery Using Decomposed Convolutional Neural Networks",

abstract = "Deep learning has achieved great successes in performing many visual recognition tasks, including object detection. Nevertheless, existing deep networks are computationally expensive and memory intensive, hindering their deployment in resource-constrained environments, such as mobile or embedded devices that are widely used by city travellers. Recently, estimating city-level travel patterns using street imagery has been shown to be a potentially valid way according to a case study with Google Street View (GSV), addressing a critical challenge in transport object detection. This paper presents a compressed deep network using tensor decomposition to detect transport objects in GSV images, which is sustainable and eco-friendly. In particular, a new dataset named Transport Mode Share-Tokyo (TMS-Tokyo) is created to serve the public for transport object detection. This is based on the selection and filtering of 32,555 acquired images that involve 50,827 visible transport objects (including cars, pedestrians, buses, trucks, motors, vans, cyclists and parked bicycles) from the GSV imagery of Tokyo. Then a compressed convolutional neural network (termed SVDet) is proposed for street view object detection via tensor train decomposition on a given baseline detector. The method proposed herein yields a mean average precision (mAP) of 77.6% on the newly introduced dataset, TMS-Tokyo, necessitating just 17.29 M parameters and a computational capacity of 16.52 G FLOPs. As such, it markedly surpasses the performance of existing state-of-the-art methods documented in the literature.",

keywords = "convolutional neural networks, street-view object detection, tensor train decomposition",

author = "Yunpeng Bai and Changjing Shang and Ying Li and Liang Shen and Shangzhu Jin and Qiang Shen",

note = "Publisher Copyright: {\textcopyright} 2023 by the authors.",

year = "2023",

month = sep,

doi = "10.3390/math11183839",

language = "英语",

volume = "11",

journal = "Mathematics",

issn = "2227-7390",

publisher = "Multidisciplinary Digital Publishing Institute (MDPI)",

number = "18",

}

TY - JOUR

T1 - Transport Object Detection in Street View Imagery Using Decomposed Convolutional Neural Networks

AU - Bai, Yunpeng

AU - Shang, Changjing

AU - Li, Ying

AU - Shen, Liang

AU - Jin, Shangzhu

AU - Shen, Qiang

PY - 2023/9

Y1 - 2023/9

N2 - Deep learning has achieved great successes in performing many visual recognition tasks, including object detection. Nevertheless, existing deep networks are computationally expensive and memory intensive, hindering their deployment in resource-constrained environments, such as mobile or embedded devices that are widely used by city travellers. Recently, estimating city-level travel patterns using street imagery has been shown to be a potentially valid way according to a case study with Google Street View (GSV), addressing a critical challenge in transport object detection. This paper presents a compressed deep network using tensor decomposition to detect transport objects in GSV images, which is sustainable and eco-friendly. In particular, a new dataset named Transport Mode Share-Tokyo (TMS-Tokyo) is created to serve the public for transport object detection. This is based on the selection and filtering of 32,555 acquired images that involve 50,827 visible transport objects (including cars, pedestrians, buses, trucks, motors, vans, cyclists and parked bicycles) from the GSV imagery of Tokyo. Then a compressed convolutional neural network (termed SVDet) is proposed for street view object detection via tensor train decomposition on a given baseline detector. The method proposed herein yields a mean average precision (mAP) of 77.6% on the newly introduced dataset, TMS-Tokyo, necessitating just 17.29 M parameters and a computational capacity of 16.52 G FLOPs. As such, it markedly surpasses the performance of existing state-of-the-art methods documented in the literature.

AB - Deep learning has achieved great successes in performing many visual recognition tasks, including object detection. Nevertheless, existing deep networks are computationally expensive and memory intensive, hindering their deployment in resource-constrained environments, such as mobile or embedded devices that are widely used by city travellers. Recently, estimating city-level travel patterns using street imagery has been shown to be a potentially valid way according to a case study with Google Street View (GSV), addressing a critical challenge in transport object detection. This paper presents a compressed deep network using tensor decomposition to detect transport objects in GSV images, which is sustainable and eco-friendly. In particular, a new dataset named Transport Mode Share-Tokyo (TMS-Tokyo) is created to serve the public for transport object detection. This is based on the selection and filtering of 32,555 acquired images that involve 50,827 visible transport objects (including cars, pedestrians, buses, trucks, motors, vans, cyclists and parked bicycles) from the GSV imagery of Tokyo. Then a compressed convolutional neural network (termed SVDet) is proposed for street view object detection via tensor train decomposition on a given baseline detector. The method proposed herein yields a mean average precision (mAP) of 77.6% on the newly introduced dataset, TMS-Tokyo, necessitating just 17.29 M parameters and a computational capacity of 16.52 G FLOPs. As such, it markedly surpasses the performance of existing state-of-the-art methods documented in the literature.

KW - convolutional neural networks

KW - street-view object detection

KW - tensor train decomposition

UR - http://www.scopus.com/inward/record.url?scp=85176452735&partnerID=8YFLogxK

U2 - 10.3390/math11183839

DO - 10.3390/math11183839

M3 - 文章

AN - SCOPUS:85176452735

SN - 2227-7390

VL - 11

JO - Mathematics

JF - Mathematics

IS - 18

M1 - 3839

ER -

Transport Object Detection in Street View Imagery Using Decomposed Convolutional Neural Networks

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this