Select Publications

Preprints

Nan Z; Dang T; Sethu V; Ahmed B, 2024, A Joint Spectro-Temporal Relational Thinking Based Acoustic Modeling Framework, http://dx.doi.org/10.48550/arxiv.2409.15357

Zhang X; Zhang Q; Liu H; Xiao T; Qian X; Ahmed B; Ambikairajah E; Li H; Epps J, 2024, Mamba in Speech: Towards an Alternative to Self-Attention, http://arxiv.org/abs/2405.12609v6

Shahin M; Epps J; Ahmed B, 2023, Phonological Level wav2vec2-based Mispronunciation Detection and Diagnosis Method, http://arxiv.org/abs/2311.07037v1

Dimitriadis A; Pan S; Sethu V; Ahmed B, 2023, Spatial HuBERT: Self-supervised Spatial Speech Representation Learning for a Single Talker from Multi-channel Audio, http://dx.doi.org/10.48550/arxiv.2310.10922

Nan Z; Dang T; Sethu V; Ahmed B, 2023, Variational Connectionist Temporal Classification for Order-Preserving Sequence Modeling, http://dx.doi.org/10.48550/arxiv.2309.11983

Lu R; Shahin M; Ahmed B, 2022, Improving Children's Speech Recognition by Fine-tuning Self-supervised Adult Speech Representations, http://arxiv.org/abs/2211.07769v1

Shahin M; Ahmed B; Epps J, 2022, Speaker- and Age-Invariant Training for Child Acoustic Modeling Using Adversarial Multi-Task Learning, http://arxiv.org/abs/2210.10231v2


Back to profile page