Discontinuity, Nonlinearity, and Complexity
Watermarking of Electronic Patient Record in Parkinson Disease Affected Speech: A Robust and Secure Audio Hiding Technique for Smart e-healthcare Application
Discontinuity, Nonlinearity, and Complexity 10(1) (2021) 117--134 | DOI:10.5890/DNC.2021.03.008
Aniruddha Kanhe , Aghila Gnanasekaran
Department of Electronics and Communication Engineering, National Institute of Technology Puducherry India
Department of Computer Science and Engineering, National Institute of Technology Puducherry India
Download Full Text PDF
Abstract
A cloud-based framework for security of electronic patient record, in telediagnosis of Parkinson's disease (PD) speech signal has been proposed in this paper. The security and authenticity of patient's personal information is achieved using proposed Discrete Cosine Transform-Singular Value Decomposition (DCT-SVD) based audio watermarking technique. The automatic classification of PD affected speech signal from healthy person's speech signal requires high computational accuracy and in this work, it is addressed by extracting various time frequency features for the Support Vector Machine (SVM) classifier. In the proposed framework the speech signal of PD suspected person is recorded by a mobile application. The personal information is securely embedded in this speech signal using proposed DCT-SVD based audio watermarking technique, before transmitting to cloud for automatic classification. To ensure the security of electronic patient record (EPR), proposed watermarking technique is tested against various signal processing attacks and the performance of classifier has been evaluated by computing classification sensitivity, specificity, accuracy and area under the curve of receiver operating characteristics (ROC). The theoretical proofs and experimental results, show that proposed framework provides telediagnosis of PD affect patients without compromising the security and privacy of patient's records. The classification accuracy of $89\%$ with high data payload of $6kbps$ is proposed leading to a smart and secure e-healthcare application.
References
-
[1]  |
Zhang, Y., Liu, H., Su, X., Jiang, P., and Wei, D. (2015), Remote mobile health
monitoring system based on smart phone and browser/server structure, {
Journal of healthcare engineering}, {\bf 6}, 717-738.
|
-
[2]  |
Archip, A., Botezatu, N., Serban, E., Herghelegiu, P.C., and Zala, A. (2016),
An iot based system for remote patient monitoring, { 2016 17th
International Carpathian Control Conference (ICCC)}, May, pp. 1-6.
|
-
[3]  |
Stach, C. and Mitschang, B. (2016), The secure data container: An approach to
harmonize data sharing with information security, { 2016 17th IEEE
International Conference on Mobile Data Management (MDM)}, 1,
pp. 292-297.
|
-
[4]  |
Baig, M.M. and Gholamhosseini, H. (2013), Smart health monitoring systems: An
overview of design and modeling, { Journal of Medical Systems}, {\bf
37}, 9898.
|
-
[5]  |
Dimitriou, T. and Ioannis, K. (2008), Security issues in biomedical wireless
sensor networks. { 2008 First International Symposium on Applied Sciences
on Biomedical and Communication Technologies}, 1-5.
|
-
[6]  |
Office for Civil Rights and United State Department of Health and Human Services, (2015), Medical
privacy.
|
-
[7]  |
vander Marck, M., Kalf, J., Sturkenboom, I., Nijkrake, M., Munneke, M., and
Bloem, B. (2009), Multidisciplinary care for patients with parkinsons
disease. { Parkinsonism and Related Disorders}, {\bf 15}, S219-S223,
proceedings of WFN XVIII World Congress on Parkinsons Disease and Related
Disorders.
|
-
[8]  |
Ho, A.K., Bradshaw, J.L., and Iansek, R. (2000), Volume perception in
parkinsonian speech, { Movement Disorders}, {\bf 15}, 1125-1131.
|
-
[9]  |
Sewall, G.K., Jiang, J., and Ford, C.N. (2006), Clinical evaluation of
parkinsons-related dysphonia, { The laryngoscope}, {\bf 116},
1740-1744.
|
-
[10]  |
Gamboa, J., Jimenez-Jimenez, F.J., Nieto, A., Montojo, J., Orti-Pareja, M.,
Molina, J.A., Garcia-Albea, E., and Cobeta, I. (1997), Acoustic voice
analysis in patients with parkinsons disease treated with dopaminergic
drugs, { Journal of Voice}, {\bf 11}, 314 - 320.
|
-
[11]  |
Huh, Y.E., Park, J., Suh, M.K., Lee, S.E., Kim, J., Jeong, Y., Kim, H.-T.,
and Cho, J.W. (2015), Differences in early speech patterns between parkinson
variant of multiple system atrophy and parkinson`s disease, { Brain and
language}, {\bf 147}, 14-20.
|
-
[12]  |
Tykalova, T., Rusz, J., Klempir, J., Cmejla, R., and Ruzicka, E. (2017),
Distinct patterns of imprecise consonant articulation among parkinsons
disease, progressive supranuclear palsy and multiple system atrophy, {
Brain and Language}, {\bf 165}, 1-9.
|
-
[13]  |
Shahbakhi, M., Far, D.T., and Tahami, E. (2014), Speech analysis for diagnosis
of parkinsons disease using genetic algorithm and support vector machine,
{ Journal of Biomedical Science and Engineering}, {\bf 7}, 147-156.
|
-
[14]  |
Tsanas, A., Little, M.A., McSharry, P.E., Spielman, J., and Ramig, L.O.
(2012), Novel speech signal processing algorithms for high-accuracy
classification of parkinsons disease, { IEEE Transactions on Biomedical
Engineering}, {\bf 59}, 1264-1271.
|
-
[15]  |
Zhang, Y. (2017), Can a smartphone diagnose parkinson disease? a deep neural
network method and telediagnosis system implementation, { Parkinsons
Disease}, {\bf 2017}.
|
-
[16]  |
Parah, S.A., Sheikh, J.A., Ahad, F., Loan, N.A., and Bhat, G.M. (2017),
Information hiding in medical images: a robust medical image watermarking
system for e-healthcare, { Multimedia Tools and Applications}, {\bf 76},
10599-10633.
|
-
[17]  |
Loan, N.A., Parah, S.A., Sheikh, J.A., Akhoon, J.A., and Bhat, G.M. (2017),
Hiding electronic patient record (epr) in medical images: A high capacity and
computationally efficient technique for e-healthcare applications, {
Journal of Biomedical Informatics}, {\bf 73}, 125-136.
|
-
[18]  |
Parah, S.A., Sheikh, J.A., Ahad, F., and Bhat, G. (2018), High capacity and
secure electronic patient record (epr) embedding in color images for iot
driven healthcare systems, { Internet of Things and Big Data Analytics
Toward Next-Generation Intelligence}, pp. 409-437, Springer.
|
-
[19]  |
Dey, N., Ashour, A.S., Chakraborty, S., Banerjee, S., Gospodinova, E.,
Gospodinov, M., and Hassanien, A.E. (2017), Watermarking in biomedical signal
processing, { Intelligent Techniques in Signal Processing for Multimedia
Security}, pp. 345-369, Springer.
|
-
[20]  |
Alhussein, M. and Muhammad, G. (2015), Watermarking of parkinson disease speech
in cloud-based healthcare framework, { International Journal of
Distributed Sensor Networks}, {\bf 11}, 264575.
|
-
[21]  |
Ali, Z., Imran, M., Abdul, W., and Shoaib, M. (2018), { An Innovative
Algorithm for Privacy Protection in a Voice Disorder Detection System}, pp.
228-233, Springer International Publishing.
|
-
[22]  |
Bender, W., Gruhl, D., Morimoto, N., and Lu, A. (1996), Techniques for data
hiding, { IBM Systems Journal}, {\bf 35}, 313-336.
|
-
[23]  |
Cvejic, N. and Seppanen, T. (2002), Increasing the capacity of lsb-based audio
steganography, { Multimedia Signal Processing, 2002 IEEE Workshop on},
Dec, pp. 336-338.
|
-
[24]  |
Bhowal, K., Bhattacharyya, D., JyotiPal, A., and Kim, T.-H. (2013), A ga based
audio steganography with enhanced security, { Telecommun. Syst.}, {\bf
52}, 2197-220.
|
-
[25]  |
Kanhe, A., Aghila, G., Kiran, C.S., Ramesh, C., Jadav, G., and Raj, M. (2015),
Robust audio steganography based on advanced encryption standards in temporal
domain, { Advances in Computing, Communications and Informatics (ICACCI),
2015 International Conference on}, Aug, pp. 1449-1453.
|
-
[26]  |
Erfani, Y. and Siahpoush, S. (2009), Robust audio watermarking using improved ts
echo hiding, { Digital Signal Processing}, {\bf 19}, 809-814.
|
-
[27]  |
Korzhik, V., Morales-Luna, G., and Fedyianin, I. (2013), Audio watermarking
based on echo hiding with zero error probability, { International Journal
of Computer Science and Applications, Technomathematics Research
Foundation}, {\bf 10}, 1-10.
|
-
[28]  |
Fallahpour, M. and Megias, D. (2015), Audio watermarking based on fibonacci
numbers, { IEEE/ACM Transactions on Audio, Speech, and Language
Processing}, {\bf 23}, 1273-1282.
|
-
[29]  |
Fallahpour, M. and Megias, D. (2014), Robust audio watermarking based on
fibonacci numbers, { 2014 10th International Conference on Mobile Ad-hoc
and Sensor Networks}, Dec, pp. 343-349.
|
-
[30]  |
Ahani, S., Ghaemmaghami, S., and Wang, Z.J. (2015), A sparse
representation-based wavelet domain speech steganography method, { Audio,
Speech, and Language Processing, IEEE/ACM Transactions on}, {\bf 23},
80-91.
|
-
[31]  |
Shah, P., Choudhari, P., and Sivaraman, S. (2008), Adaptive wavelet packet based
audio steganography using data history, { 2008 IEEE Region 10 and the
Third international Conference on Industrial and Information Systems}, Dec,
pp. 1-5.
|
-
[32]  |
Hu, H.-T. and Chang, J.-R. (2017), Efficient and robust frame-synchronized blind
audio watermarking by featuring multilevel dwt and dct, { Cluster
Computing}, {\bf 20}, 805-816.
|
-
[33]  |
Kanhe, A. and Aghila, G. (2018), A dct-svd-based speech steganography in voiced
frames. { Circuits, Systems, and Signal Processing}.
|
-
[34]  |
Hua, G., Huang, J., Shi, Y.Q., Goh, J., and Thing, V.L. (2016), Twenty years
of digital audio watermarking-a comprehensive review, { Signal
Processing}, {\bf 128}, 222 - 242.
|
-
[35]  |
Kanhe, A. and Gnanasekaran, A. (2018), A qim-based energy modulation scheme for
audio watermarking robust to synchronization attack, { Arabian Journal for
Science and Engineering}.
|
-
[36]  |
Hu, H.-T., Lin, S.-J., and Hsu, L.-Y. (2017), Effective blind speech
watermarking via adaptive mean modulation and package synchronization in dwt
domain, { EURASIP Journal on Audio, Speech, and Music Processing}, {\bf
2017}, 10.
|
-
[37]  |
Kaur, A. and Dutta, M.K. (2017), An optimized high payload audio watermarking
algorithm based on lu-factorization, { Multimedia Systems}.
|
-
[38]  |
Elshazly, A.R., Nasr, M.E., Fouad, M.M., and Abdel-Samie, F.S. (2017), High
payload multi-channel dual audio watermarking algorithm based on discrete
wavelet transform and singular value decomposition, { Int. J. Speech
Technol.}, {\bf 20}, 951-958.
|
-
[39]  |
Sakar, B.E., Isenkul, M.E., Sakar, C.O., Sertbas, A., Gurgen, F., Delil, S.,
Apaydin, H., and Kursun, O. (2013), Collection and analysis of a parkinson
speech dataset with multiple types of sound recordings, { IEEE Journal of
Biomedical and Health Informatics}, {\bf 17}, 828-834.
|
-
[40]  |
Little, M.A., McSharry, P.E., Hunter, E.J., Spielman, J., Ramig, L.O.,
etal. (2009), Suitability of dysphonia measurements for telemonitoring of
parkinsons disease, { IEEE transactions on biomedical engineering},
{\bf 56}, 1015-1022.
|
-
[41]  |
Woldert-Jokisz, B. (2007), Saarbruecken voice database, { Institute of
Phonetics, Saarland University}.
|
-
[42]  |
KBhat, V., Sengupta, I., and Das, A. (2011), A new audio watermarking scheme
based on singular value decomposition and quantization, { Circuits,
Systems, and Signal Processing}, {\bf 30}, 915-927.
|
-
[43]  |
Ali, A.-H. (2014), An imperceptible and robust audio watermarking algorithm.
{ EURASIP Journal on Audio, Speech, and Music Processing}, {\bf 2014},
37.
|
-
[44]  |
Chang, C.-C. and Lin, C.-J. (2011), Libsvm: A library for support vector
machines, { ACM Transactions on Intelligent Systems and Technology},
{\bf 2}, 27:1-27:27.
|
-
[45]  |
Polat, K. (2012), Classification of parkinsons disease using feature weighting
method on the basis of fuzzy c-means clustering, { International Journal
of Systems Science}, {\bf 43}, 597-609.
|
-
[46]  |
Chen, H.-L., Huang, C.-C., Yu, X.-G., Xu, X., Sun, X., Wang, G., and Wang,
S.-J. (2013), An efficient diagnosis system for detection of parkinsons
disease using fuzzy k-nearest neighbor approach, { Expert Systems with
Applications}, {\bf 40}, 263 - 271.
|
-
[47]  |
Daliri, M.R. (2013), Chi-square distance kernel of the gaits for the diagnosis
of parkinsons disease, { Biomedical Signal Processing and Control},
{\bf 8}, 66 - 70.
|
-
[48]  |
Ephraim, Y. and Malah, D. (1985), Speech enhancement using a minimum mean-square
error log-spectral amplitude estimator, { IEEE Transactions on Acoustics,
Speech, and Signal Processing}, {\bf 33}, 443-445.
|
-
[49]  |
Boll, S. (1979), Suppression of acoustic noise in speech using spectral
subtraction, { IEEE Transactions on Acoustics, Speech, and Signal
Processing}, {\bf 27}, 113-120.
|
-
[50]  |
Scalart, P. and Filho, J.V. (1996), Speech enhancement based on a priori signal
to noise estimation, { 1996 IEEE International Conference on Acoustics,
Speech, and Signal Processing Conference Proceedings}, May, vol.2, pp.
629-632 vol. 2.
|