Select Publications
Preprints
, 2024, A Joint Spectro-Temporal Relational Thinking Based Acoustic Modeling Framework, http://dx.doi.org/10.48550/arxiv.2409.15357
, 2024, Mamba in Speech: Towards an Alternative to Self-Attention, http://arxiv.org/abs/2405.12609v6
, 2023, Phonological Level wav2vec2-based Mispronunciation Detection and Diagnosis Method, http://arxiv.org/abs/2311.07037v1
, 2023, Spatial HuBERT: Self-supervised Spatial Speech Representation Learning for a Single Talker from Multi-channel Audio, http://dx.doi.org/10.48550/arxiv.2310.10922
, 2023, Variational Connectionist Temporal Classification for Order-Preserving Sequence Modeling, http://dx.doi.org/10.48550/arxiv.2309.11983
, 2022, Improving Children's Speech Recognition by Fine-tuning Self-supervised Adult Speech Representations, http://arxiv.org/abs/2211.07769v1
, 2022, Speaker- and Age-Invariant Training for Child Acoustic Modeling Using Adversarial Multi-Task Learning, http://arxiv.org/abs/2210.10231v2