Bio

I am currently a sixth-year (final-year) Ph.D. student in CSE at OSU, advised by Prof. DeLiang Wang. Before studying at OSU, I received the bachelor degree in electronic information engineering from University of Science and Technology of China (USTC) in 2015. I will join the audio team at Facebook Reality Labs as a research scientist in 2021.

My research focuses on speech enhancement, speech separation, speech dereverberation, and deep learning. I am also interested in microphone array processing, audio-visual speech enhancement and separation, acoustic echo cancellation and keyword spotting. I serve as a reviewer for IEEE/ACM Transactions on Audio, Speech, and Language Processing, IEEE Journal of Selected Topics in Signal Processing, IEEE Signal Processing Letters, IEEE Communications Letters, and Speech Communication.

Manuscripts

[5] K. Tan, X. Zhang, and D. L. Wang, "Real-Time Speech Enhancement for Mobile Communication Based on Dual-Channel Complex Spectral Mapping", in submission to IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), under review.

[4] K. Tan and D. L. Wang, "Compressing Deep Neural Networks for Efficient Speech Enhancement", in submission to IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), under review.

[3] K. Tan, X. Zhang, and D. L. Wang, "Deep Learning Based Real-Time Speech Enhancement for Dual-Microphone Mobile Phones", in submission to IEEE/ACM Transactions on Audio, Speech, and Language Processing (IEEE/ACM TASLP), under review.

[2] K. Tan, B. Xu, A. Kumar, E. Nachmani, and Y. Adi, "SAGRNN: Self-Attentive Gated RNN for Binaural Speaker Separation with Interaural Cue Preservation", in submission to IEEE Signal Processing Letters (IEEE SPL), accepted.
Preprint   BibTeX   Demos

[1] E. W. Healy, K. Tan, E. M. Johnson, and D. L. Wang, "Real-Time Feasible Deep Learning Noise Reduction Improves Intelligibility for Hearing-Impaired Listeners", in submission to Journal of the Acoustical Society of America (JASA), under review.

Publications

Journal Articles

[4] K. Tan, Y. Xu, S.-X. Zhang, M. Yu, and D. Yu, "Audio-Visual Speech Separation and Dereverberation with a Two-Stage Multimodal Network", in IEEE Journal of Selected Topics in Signal Processing (IEEE JSTSP), vol. 14, pp. 542-553, 2020.
Paper   BibTeX   Demos

[3] K. Tan and D. L. Wang, "Learning Complex Spectral Mapping with Gated Convolutional Recurrent Networks for Monaural Speech Enhancement", in IEEE/ACM Transactions on Audio, Speech, and Language Processing (IEEE/ACM TASLP), vol. 28, pp. 380-390, 2020.
Paper   BibTeX

[2] P. Wang, K. Tan and D. L. Wang, "Bridging the Gap Between Monaural Speech Enhancement and Recognition with Distortion-Independent Acoustic Modeling", in IEEE/ACM Transactions on Audio, Speech, and Language Processing (IEEE/ACM TASLP), vol. 28, pp. 39-48, 2020.
Paper   BibTeX

[1] K. Tan, J. Chen, and D. L. Wang, "Gated Residual Networks with Dilated Convolutions for Monaural Speech Enhancement", in IEEE/ACM Transactions on Audio, Speech, and Language Processing (IEEE/ACM TASLP), vol. 27, pp. 189-198, 2019.
Paper   BibTeX


Conference Papers

[10] K. Tan and D. L. Wang, "Improving Robustness of Deep Learning Based Monaural Speech Enhancement Against Processing Artifacts", in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 6914-6918, 2020.
Paper   BibTeX

[9] H. Zhang, K. Tan and D. L. Wang, "Deep Learning for Joint Acoustic Echo and Noise Cancellation with Nonlinear Distortions", in the 20th Annual Conference of the International Speech Communication Association (INTERSPEECH), pp. 4255-4259, 2019.
Paper   BibTeX

[8] P. Wang, K. Tan and D. L. Wang, "Bridging the Gap Between Monaural Speech Enhancement and Recognition with Distortion-Independent Acoustic Modeling", in the 20th Annual Conference of the International Speech Communication Association (INTERSPEECH), pp. 471-475, 2019.
Paper   BibTeX

[7] K. Tan and D. L. Wang, "Complex Spectral Mapping with a Convolutional Recurrent Network for Monaural Speech Enhancement", in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 6865-6869, 2019.
Paper   BibTeX

[6] K. Tan, X. Zhang, and D. L. Wang, "Real-Time Speech Enhancement Using an Efficient Convolutional Recurrent Network for Dual-Microphone Mobile Phones in Close-Talk Scenarios", in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 5751-5755, 2019.
Paper   BibTeX

[5] Z.-Q. Wang, K. Tan, and D. L. Wang, "Deep Learning Based Phase Reconstruction for Speaker Separation: A Trigonometric Perspective", in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 71-75, 2019.
Paper   BibTeX

[4] K. Tan and D. L. Wang, "A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement", in the 19th Annual Conference of the International Speech Communication Association (INTERSPEECH), pp. 3229-3233, 2018.
Paper   BibTeX

[3] K. Tan and D. L. Wang, "A Two-Stage Approach to Noisy Cochannel Speech Separation with Gated Residual Networks", in the 19th Annual Conference of the International Speech Communication Association (INTERSPEECH), pp. 3484-3488, 2018.
Paper   BibTeX

[2] K. Tan, J. Chen, and D. L. Wang, "Gated Residual Networks with Dilated Convolutions for Supervised Speech Separation", in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 21-25, 2018.
Paper   BibTeX

[1] S. Zhu, K. Tan, X. Zhang, Z. Liu and B. Liu, "MICROST: A Mixed Approach for Heart Rate Monitoring During Intensive Physical Exercise Using Wrist-Type PPG Signals", in the 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), pp. 2347-2350, 2015.
Paper   BibTeX

Presentations

[5] Slides Improving Robustness of Deep Learning Based Monaural Speech Enhancement Against Processing Artifacts, IEEE ICASSP (virtually due to COVID-19 pandemic), Barcelona, Spain, May 2020.

[4] Poster Complex Spectral Mapping with a Convolutional Recurrent Network for Monaural Speech Enhancement, IEEE ICASSP, Brighton, United Kingdom, May 2019.

[3] Slides Real-Time Speech Enhancement Using an Efficient Convolutional Recurrent Network for Dual-Microphone Mobile Phones in Close-Talk Scenarios, IEEE ICASSP, Brighton, United Kingdom, May 2019.

[2] Slides Deep Learning Based Phase Reconstruction for Speaker Separation: A Trigonometric Perspective, IEEE ICASSP, Brighton, United Kingdom, May 2019.

[1] Slides Gated Residual Networks with Dilated Convolutions for Supervised Speech Separation, IEEE ICASSP, Calgary, Alberta, Canada, Apr. 2018.

Working

Jan. 2017 - present, Graduate Research Associate in Perception and Neurodynamics Laboratory (PNL) at OSU, Columbus, OH, United States

May 2020 - Aug. 2020, Research Intern at Facebook Reality Labs (formerly Oculus Research), Redmond, WA, United States

May 2019 - Aug. 2019, Research Intern at Tencent AI Lab, Bellevue, WA, United States

Dec. 2018 - Jan. 2019, Research Intern at Elevoc Technology, Shenzhen, Guangdong, China

May. 2018 - Aug. 2018, Research Intern at KITT.AI group - Baidu DuerOS, Bellevue, WA, United States

Apr. 2018 - May 2018, Research Intern at Elevoc Technology, Shenzhen, Guangdong, China

Teaching

GTA of CSE 6421 (Computer Architecture), OSU, Autumn 2016

GTA of CSE 3421 (Introduction to Computer Architecture), OSU, Autumn 2016

GTA of CSE 1110 (Introduction to Computing Technology), OSU, Spring 2016

GTA of CSE 1110 (Introduction to Computing Technology), OSU, Autumn 2015

Acknowledgments