An efficient speech perceptual hashing authentication algorithm based on DWT and symmetric ternary string

Zhang Qiuyu; Xing Pengfei; Huang Yibo; Dong Ruihong; Yang Zhongping

doi:10.1504/IJICT.2018.10008897

An efficient speech perceptual hashing authentication algorithm based on DWT and symmetric ternary string

Zhang Qiuyu, Xing Pengfei, Huang Yibo, Dong Ruihong, Yang Zhongping

Lanzhou University of Technology

科研成果: 期刊稿件 › 文章 › 同行评审

5 引用（Scopus）

摘要

According to the situation that speech perceptual hashing methods are not appropriated for real-time speech content authentication in mobile computing environment, a novel DWT-based perceptual hashing algorithm, which uses a combination of time-domain and frequency-domain features, was proposed to protect the speech data in the cloud. Firstly, by discrete wavelet transform (DWT), a new signal in frequency-domain is generated from the original speech signal after pre-processing and intensity-loudness transform (ILT). Secondly, coefficients of low frequency wavelet decomposition are partitioned into equal-sized and non-overlapping blocks, and logarithmic short-time energy of each block is computed to obtain speech signal’s features in frequency-domain. Finally, combined with spectral flux features (SFF) of speech signal in time-domain, a ternary perceptual hashing sequence is created. Experiment results illustrate that ternary form is better to stand for hash digest than binary form, the proposed algorithm has a good robustness against content preserving operations, discrimination, good compaction and high efficiency, and detects the tamper localisation as well.

源语言	英语
页（从-至）	31-50
页数	20
期刊	International Journal of Information and Communication Technology
卷	12
期	1-2
DOI	https://doi.org/10.1504/IJICT.2018.10008897
出版状态	已出版 - 2018
已对外发布	是

访问文件

10.1504/IJICT.2018.10008897

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{8d73a81348d64f01951b0ff8f7ea8ab2,

title = "An efficient speech perceptual hashing authentication algorithm based on DWT and symmetric ternary string",

abstract = "According to the situation that speech perceptual hashing methods are not appropriated for real-time speech content authentication in mobile computing environment, a novel DWT-based perceptual hashing algorithm, which uses a combination of time-domain and frequency-domain features, was proposed to protect the speech data in the cloud. Firstly, by discrete wavelet transform (DWT), a new signal in frequency-domain is generated from the original speech signal after pre-processing and intensity-loudness transform (ILT). Secondly, coefficients of low frequency wavelet decomposition are partitioned into equal-sized and non-overlapping blocks, and logarithmic short-time energy of each block is computed to obtain speech signal{\textquoteright}s features in frequency-domain. Finally, combined with spectral flux features (SFF) of speech signal in time-domain, a ternary perceptual hashing sequence is created. Experiment results illustrate that ternary form is better to stand for hash digest than binary form, the proposed algorithm has a good robustness against content preserving operations, discrimination, good compaction and high efficiency, and detects the tamper localisation as well.",

keywords = "Discrete wavelet transform, DWT, Perceptual hashing, Speech perceptual authentication, Symmetric ternary string, Tamper localisation",

author = "Zhang Qiuyu and Xing Pengfei and Huang Yibo and Dong Ruihong and Yang Zhongping",

note = "Publisher Copyright: Copyright {\textcopyright} 2018 Inderscience Enterprises Ltd.",

year = "2018",

doi = "10.1504/IJICT.2018.10008897",

language = "英语",

volume = "12",

pages = "31--50",

journal = "International Journal of Information and Communication Technology",

issn = "1466-6642",

publisher = "Inderscience Enterprises Ltd",

number = "1-2",

}

TY - JOUR

T1 - An efficient speech perceptual hashing authentication algorithm based on DWT and symmetric ternary string

AU - Qiuyu, Zhang

AU - Pengfei, Xing

AU - Yibo, Huang

AU - Ruihong, Dong

AU - Zhongping, Yang

PY - 2018

Y1 - 2018

N2 - According to the situation that speech perceptual hashing methods are not appropriated for real-time speech content authentication in mobile computing environment, a novel DWT-based perceptual hashing algorithm, which uses a combination of time-domain and frequency-domain features, was proposed to protect the speech data in the cloud. Firstly, by discrete wavelet transform (DWT), a new signal in frequency-domain is generated from the original speech signal after pre-processing and intensity-loudness transform (ILT). Secondly, coefficients of low frequency wavelet decomposition are partitioned into equal-sized and non-overlapping blocks, and logarithmic short-time energy of each block is computed to obtain speech signal’s features in frequency-domain. Finally, combined with spectral flux features (SFF) of speech signal in time-domain, a ternary perceptual hashing sequence is created. Experiment results illustrate that ternary form is better to stand for hash digest than binary form, the proposed algorithm has a good robustness against content preserving operations, discrimination, good compaction and high efficiency, and detects the tamper localisation as well.

AB - According to the situation that speech perceptual hashing methods are not appropriated for real-time speech content authentication in mobile computing environment, a novel DWT-based perceptual hashing algorithm, which uses a combination of time-domain and frequency-domain features, was proposed to protect the speech data in the cloud. Firstly, by discrete wavelet transform (DWT), a new signal in frequency-domain is generated from the original speech signal after pre-processing and intensity-loudness transform (ILT). Secondly, coefficients of low frequency wavelet decomposition are partitioned into equal-sized and non-overlapping blocks, and logarithmic short-time energy of each block is computed to obtain speech signal’s features in frequency-domain. Finally, combined with spectral flux features (SFF) of speech signal in time-domain, a ternary perceptual hashing sequence is created. Experiment results illustrate that ternary form is better to stand for hash digest than binary form, the proposed algorithm has a good robustness against content preserving operations, discrimination, good compaction and high efficiency, and detects the tamper localisation as well.

KW - Discrete wavelet transform

KW - DWT

KW - Perceptual hashing

KW - Speech perceptual authentication

KW - Symmetric ternary string

KW - Tamper localisation

UR - http://www.scopus.com/inward/record.url?scp=85042289967&partnerID=8YFLogxK

U2 - 10.1504/IJICT.2018.10008897

DO - 10.1504/IJICT.2018.10008897

M3 - 文章

AN - SCOPUS:85042289967

SN - 1466-6642

VL - 12

SP - 31

EP - 50

JO - International Journal of Information and Communication Technology

JF - International Journal of Information and Communication Technology

IS - 1-2

ER -

An efficient speech perceptual hashing authentication algorithm based on DWT and symmetric ternary string

摘要

访问文件

其它文件与链接

指纹

引用此