TY - GEN
T1 - Orthographic Comparison Revealed by Ambient Sentiment Classification
AU - Arunadevi, B.
AU - Saravanan, D.
AU - Villallba-Condori, Klinge
AU - Srivastava, Kriti
AU - Chakravarthi, M. Kalyan
AU - Rajan, Regin
N1 - Publisher Copyright:
© 2021 IEEE.
PY - 2021
Y1 - 2021
N2 - Several deep neural network versions have been successfully used to create parametric models that project variable-duration spoken word segments onto fixed-size vector representations, also known as acoustic word embeddings (AWEs). However; it is uncertain to what extent the distance in the growing AWE space can evaluate word-form similarity. This study investigates whether acoustic embedding space distance is related to phonological dissimilarity. The performance of supervised techniques for AWEs with different neural architectures and learning aims to solve this question. AWE models are trained in controlled settings for two languages (German and Czech) and evaluate the embeddings on two tasks: word discrimination and phonological similarity. The findings reveal that (1) the embedding space distance weakly corresponds with phonological distance in the best circumstances, and (2) enhancing word discrimination performance does not always result in models that better reflect word phonological similarity. The results emphasize the need to reconsider existing intrinsic AWE evaluations.
AB - Several deep neural network versions have been successfully used to create parametric models that project variable-duration spoken word segments onto fixed-size vector representations, also known as acoustic word embeddings (AWEs). However; it is uncertain to what extent the distance in the growing AWE space can evaluate word-form similarity. This study investigates whether acoustic embedding space distance is related to phonological dissimilarity. The performance of supervised techniques for AWEs with different neural architectures and learning aims to solve this question. AWE models are trained in controlled settings for two languages (German and Czech) and evaluate the embeddings on two tasks: word discrimination and phonological similarity. The findings reveal that (1) the embedding space distance weakly corresponds with phonological distance in the best circumstances, and (2) enhancing word discrimination performance does not always result in models that better reflect word phonological similarity. The results emphasize the need to reconsider existing intrinsic AWE evaluations.
KW - acoustic word embeddings
KW - contrastive learning
KW - deep neural networks
KW - phonological similarity
UR - http://www.scopus.com/inward/record.url?scp=85125389567&partnerID=8YFLogxK
U2 - 10.1109/ICECA52323.2021.9675995
DO - 10.1109/ICECA52323.2021.9675995
M3 - Conference contribution
AN - SCOPUS:85125389567
T3 - Proceedings of the 5th International Conference on Electronics, Communication and Aerospace Technology, ICECA 2021
SP - 834
EP - 838
BT - Proceedings of the 5th International Conference on Electronics, Communication and Aerospace Technology, ICECA 2021
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 5th International Conference on Electronics, Communication and Aerospace Technology, ICECA 2021
Y2 - 2 December 2021 through 4 December 2021
ER -