SSSC Lab.

Statistical Speech & Sound Computing Lab.

(ICASSP 2024) Congratulations on the Best Student Paper Awards 👍

Kangwook Jang, Sungnyun Kim, Hoirin Kim, "STaR: Distilling Speech Temporal Relation for Lightweight Speech Self-Supervised Learning Models" ICASSP 2024, pp. 10721-10725, April. 2024.

https://ieeexplore.ieee.org/abstract/document/10447928/

Block diagrams of recent publications

[1] Y. Choi, Y. Jung, Y. Suh, H. Kim, "Learning to Maximize Speech Quality Directly Using MOS Prediction for Neural Text-to-Speech," IEEE ACCESS, Vol. 10, pp. 52621-52629, 2022, doi: 10.1109/ACCESS.2022.3175810

https://ieeexplore.ieee.org/abstract/document/9775804

[2] Y. Jung, Y. Choi, H. Lim, and H. Kim, “A Unified Deep Learning Framework for Short-Duration Speaker Verification in Adverse Environments,” IEEE ACCESS, Vol. 8, pp. 175448–175466, 2020, doi:10.1109/ACCESS.2020.3025941

https://ieeexplore.ieee.org/document/9203835

Fig 1. Overview of the proposed perceptually guided TTS with MOS prediction.

Fig 2. Illustration of the proposed integrated model combining speech enhancement (SE), speaker verification, and VAD.

International Conferences (Recent 5 years)

Hyebin Ahn, Kangwook Jang, Hoirin Kim “HuBERT-VIC: Improving Noise-Robust Automatic Speech Recognition of Speech Foundation Model via Variance-Invariance-Covariance Regularization,” Interspeech2025, pp. 3419-3423, 17-21 Aug. 2025. (ADD)

Minu Kim, Kangwook Jang and Hoirin Kim, “ParaNoise-SV: Integrated Approach for Noise-Robust Speaker Verification with Parallel Joint Learning of Speech Enhancement and Noise Extraction,” Interspeech2025, pp. 1103-1107, 17-21 Aug. 2025. (ADD)

Minu Kim, Kangwook Jang and Hoirin Kim, “Improving Cross-Lingual Phonetic Representation of Low-Resource Languages Through Language Similarity Analysis,” ICASSP 2025, Apr. 2025. (MSIT/NRF)

Sungnyun Kim*, Kangwook Jang*, Sangmin Bae, Hoirin Kim, Se-Young Yun, “Learning Video Temporal Dynamics with Cross-Modal Attention for Robust Audio-Visual Speech Recognition,” IEEE SLT 2024, pp. 457-464, 2-5 Dec. 2024. (MSIT/NRF) (* Equal Contribution)

Ji Sub Um, Hoirin Kim, “Utilizing Adaptive Global Response Normalization and Cluster-Based Pseudo Labels for Zero-Shot Voice Conversion,” Interspeech2024, pp. 2740-2744, 1-5 Sep. 2024. (MSIT/NRF)

Hyun Myung Kim, Kangwook Jang, Hoirin Kim, “One-Class Learning with Adaptive Centroid Shift for Audio Deepfake Detection,” Interspeech2024, pp. 4853-4857, 1-5 Sep. 2024. (MSIT/IITP)

Kangwook Jang, Sungnyun Kim, Hoirin Kim, "STaR: Distilling Speech Temporal Relation for Lightweight Speech Self-Supervised Learning Models" ICASSP 2024, pp. 10721-10725, Apr. 2024. (MSIT/NRF) (Best Student Paper Awarded)

Kangwook Jang*, Sungnyun Kim*, Se-Young Yun, Hoirin Kim, "Recycle-and-Distill: Universal Compression Strategy for Transformer-based Speech SSL Models with Attention Map Reusing and Masking Distillation," NeurIPS Workshop: Self-Supervised Learning Theory and Practice, Dec. 2023. (* Equal Contribution)

Myunghun Jung, Hoirin Kim, "AdaMS: Deep Metric Learning with Adaptive Margin and Adaptive Scale for Acoustic Word Discrimination," Interspeech 2023, pp. 3924-3928, Aug. 2023.

Kangwook Jang*, Sungnyun Kim*, Se-Young Yun, Hoirin Kim, "Recycle-and-Distill: Universal Compression Strategy for Transformer-based Speech SSL Models with Attention Map Reusing and Masking Distillation," Interspeech2023, pp. 316-320, Aug. 2023. (* Equal Contribution)

Myunghun Jung, Hoirin Kim, "Asymmetric Proxy Loss for Multi-View Acoustic Word Embeddings," Interspeech2022, pp. 5170-5174, Sep. 2022.

Yeonghyeon Lee*, Kangwook Jang*, Jahyun Goo, Youngmoon Jung, Hoirin Kim, "FitHuBERT: Going Thinner and Deeper for Knowledge Distillation of Speech Self-Supervised Learning," Interspeech2022, pp. 3588-3592, Sep. 2022. (MSIT/NRF) (* Equal Contribution)

Youngsik Eom, Yeonghyeon Lee, Ji Sub Um, Hoirin Kim, "Anti-Spoofing Using Transfer Learning with Variational Information Bottleneck," Interspeech2022, pp. 3568-3572, Sep. 2022. (MSIT/IITP)

Jisub Um, Yeunju Choi, Hoirin Kim, "ACNN-VC: Utilizing Adaptive Convolution Neural Network for One-Shot Voice Conversion," Interspeech2022, pp. 2998-3002, Sep. 2022. (MSIT/NRF)

Yeunju Choi, Youngmoon Jung, Hoirin Kim, “NEURAL MOS PREDICTION FOR SYNTHESIZED SPEECH USING MULTI-TASK LEARNING WITH SPOOFING DETECTION AND SPOOFING TYPE CLASSIFICATION,” SLT2021, pp. 462-469, Jan. 2021. (Virtual) (MOTIE/KEIT)

Seong Min Kye, Joon Son Chung, Hoirin Kim, “SUPERVISED ATTENTION FOR SPEAKER RECOGNITION,” SLT2021, pp. 286-293, Jan. 2021. (Virtual) (MSIT/IITP)

Joohyung Lee, Youngmoon Jung, Myunghun Jung, Hoirin Kim, “DYNAMIC NOISE EMBEDDING: NOISE AWARE TRAINING AND ADAPTATION FOR SPEECH ENHANCEMENT,” APSIPA2020, pp. 739-746, Dec. 2020. (Virtual) (ADD)

Joohyung Lee, Youngmoon Jung and Hoirin Kim, “Dual Attention in Time and Frequency Domain for Voice Activity Detection,” Interspeech2020, pp. 3670-3674, 29th Oct. 2020. (Virtual) (ADD)

Seong Min Kye, Youngmoon Jung, Hae Beom Lee, Sung Ju Hwang and Hoirin Kim, “Meta-Learning for Short Utterance Speaker Recognition with Imbalance Length Pairs,” Interspeech2020, pp. 2982-2986, 28th Oct. 2020. (Virtual) (MOTIE/KEIT)

Yeunju Choi, Youngmoon Jung and Hoirin Kim, “Deep MOS Predictor for Synthetic Speech Using Cluster-Based Modeling,” Interspeech2020, pp. 1743-1747, 27th Oct. 2020. (Virtual) (MOTIE/KEIT)

Youngmoon Jung, Seong Min Kye, Yeunju Choi, Myunghun Jung and Hoirin Kim, “Improving Multi-Scale Aggregation Using Feature Pyramid Module for Robust Speaker Verification of Variable-Duration Utterances,” Interspeech2020, pp. 1501-1505, 27th Oct. 2020. (Virtual) (MOTIE/KEIT )

Myunghun Jung, Youngmoon Jung, Jahyun Goo and Hoirin Kim, “Multi-Task Network for Noise-Robust Keyword Spotting and Speaker Verification using CTC-based Soft VAD and Global Query Attention,” Interspeech2020, pp. 931-935, 26th Oct. 2020. (Virtual) (ADD)

International Journals (Recent 5 years)

Yeunju Choi, Youngmoon Jung, Youngjoo Suh, Hoirin Kim, "Learning to Maximize Speech Quality Directly Using MOS Prediction for Neural Text-to-Speech," IEEE ACCESS, Vol. 10, pp. 52621-52629, May. 2022.

Youngmoon Jung, Yeunju Choi, Hyungjun Lim, and Hoirin Kim, "A Unified Deep Learning Framework for Short-Duration Speaker Verification in Adverse Environments," IEEE ACCESS, Vol. 8, pp. 175448-175466, Sep. 2020.

Hyungjun Lim, Younggwan Kim, and Hoirin Kim, “Cross-Informed Domain Adversarial Training for Noise-Robust Wake-up Word Detection,” IEEE SPL, Vol. 27, No. 11, pp. 1769-1773, Sep. 2020.

Hyunjun Lim, Younggwan Kim, Jahyun Goo, and Hoirin Kim, "Interlayer Selective Attention Network for Robust Personalized Wake-Up Word Detection," IEEE SPL, Vol. 27, No. 1, pp. 126-130, Jan. 2020.

Page updated

Google Sites

Report abuse