主な研究業績 |
査読付き論文/国際会議プロシーディングス (2020年以降のみ)
- Comparative Evaluation of Diverse Features in Fluency Evaluation of Spontaneous Speech, H. Deng, T. Utsuro, Akio Kobayashi, H. Nishizaki, IEICE Trans. Information and Systems E106-D(1) pp. 36–45 2023.
- Automatic Selection of Appropriate Data Augmentation Operation for Acoustic Scene Classification Model Training, T. Sugiura, A. Kobayashi, T. Utsuro, H. Nishizaki, pp. 355–358 2022.
- Neural Networks Using Multiplicative Features Based on Second-Order Statistics for Acoustic;Speech Applications, A. Kobayashi, 2022 Asia Conference on Advanced Robotics, Automation, and Control Engineering (ARACE) pp. 121–126 2022.
- End-to-End Speech to Braille Translation in Japanese, A. Kobayashi, J. Onishi, H. Nishizaki, N. Kitaoka, 2022 IEEE International Conference on Consumer Electronics (ICCE), 2022
- Comparison Of Static and Time-Sequential Features in Automatic Fluency Detection of Spontaneous Speech, H. Deng, T. Utsuro, A. Kobayashi, H. Nishizaki, 24th Conference of the Oriental COCOSDA , pp. 158–163 2021.
- Corpus Design and Automatic Speech Recognition for Deaf and Hard-of-Hearing People, A. Kobayashi, K. Yasu, H. Nishizaki, N. Kitaoka, 2021 IEEE 10th Global Conference on Consumer Electronics (GCCE), 2021.
- Audio Synthesis-based Data Augmentation Considering Audio Event Class, T. Sugiura, A. Kobayashi, T. Utsuro, H. Nishizaki 2021 IEEE 10th Global Conference on Consumer Electronics (GCCE), 2021.
- ExKaldi-RT: A Real-Time Automatic Speech Recognition Extension Toolkit of Kaldi, Y Wang, C.-S. Leow, A. Kobayashi, T. Utsuro, H. Nishizaki, 2021 IEEE 10th Global Conference on Consumer Electronics (GCCE) pp. 346–350, 2021.
- Language and Speaker-Independent Feature Transformation for End-to-End Multilingual Speech Recognition, T. Hayakawa, C.-S. Leow, A. Kobayashi, T. Utsuro, H. Nishizaki, Interspeech 2021 pp. 2431–2435 2021.
- Voice Activity Detection for Live Speech of Baseball Game Based on Tandem Connection with Speech/Noise Separation Model, Y. Nonaka, C.-S. Leow, A. Kobayashi, T. Utsuro, H. Nishizaki, Interspeech 2021, pp. 351–355, 2021.
- Speech Enhancement for Demodulated Signals under Multipath Fading Communication Channels, A. Kobayashi, 2020 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) pp. 460–464 2020.
- Spoken Dialog Training System for Customer Service Improvement, Y. Sano, C.-S. Leow, S. Iida, T. Utsuro, J. Hoshino, A. Kobayashi, H. Nishizaki, 2020 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) pp. 403–408, 2020.
- ExKaldi: A Python-Based Extension Tool of Kaldi, Y. Wang, C.-S. Leow, H. Nishizaki, A. Kobayashi, T. Utsuro, 2020 IEEE 9th Global Conference on Consumer Electronics, pp. 470–473, 2020.
- Integrating Disfluency-based and Prosodic Features with Acoustics in Automatic Fluency Evaluation of Spontaneous Speech, H. Deng, Y. Lin, T. Utsuro, A. Kobayashi, H. Nishizaki, J. Hoshino, Proc. the 12th Language Resources and Evaluation Conference, pp. 6429–6437, 2020.
- Automatic Fluency Evaluation of Spontaneous Speech Using Disfluency-Based Features, H. Deng, Y. Lin, T. Utsuro, A. Kobayashi, H. Nishizaki, J. Hoshino, Proc. ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 9239–9243, 2020."
|