I'm currently an AI and speech algorithm designer at Goodix Technology. From 2017 to 2021, I had been a doctoral researcher at the Electrical Engineering Department (ESAT), in the research group of Stadius Center for Dynamical Systems, Signal Processing, and Data Analytics (STADIUS) under the supervision of Prof. Toon van Watershoot, Prof. Luc Geurts and Prof. Peter Kuppens, in KU Leuven, Belgium. In 2015 and 2016, I received two M.Sc. degrees in Electronics Engineering and in Artificial Intelligence, respectively, from KU Leuven, Belgium. My research is focusing on speech and audio processing with deep learning, speech emotion recognition, real-time deep learning for edge devices.

Publications

Towards Learning Robust Contrastive Embeddings For Binaural Source Localization

Frontiers in Neuroinformatics 2022

Towards Learning Robust Contrastive Embeddings For Binaural Source Localization

Duowei Tang, Maja Taseska, and Toon van Waterschoot

Proposes a weakly-supervised contrastive embedding framework for binaural sound source localization, where a neural network learns low-dimensional representations of binaural cues that preserve spatial proximity, showing robustness to reverberation, noise, and limited training data.

End-to-end speech emotion recognition using a novel context-stacking dilated convolution neural network

EURASIP Journal on Audio, Speech, and Music Processing 2021

End-to-end speech emotion recognition using a novel context-stacking dilated convolution neural network

Duowei Tang, Peter Kuppens, Luc Geurts, and Toon van Waterschoot

Proposes an end-to-end speech emotion recognition system using dilated convolutional neural networks with context stacking (DiCCOSER-CS), which eliminates recurrent layers while achieving superior performance to LSTM-based models through its large receptive field and parallelizable architecture, demonstrating improved accuracy on both regression (arousal/valence prediction) and classification tasks.

Adieu recurrence? End-to-end speech emotion recognition using a context stacking dilated convolutional network

28th European Signal Processing Conference (EUSIPCO) 2020

Adieu recurrence? End-to-end speech emotion recognition using a context stacking dilated convolutional network

Duowei Tang, Peter Kuppens, Luc Geurts, and Toon van Waterschoot

Proposes a fully convolutional end-to-end speech emotion recognition model using dilated causal convolutions and context stacking, eliminating recurrent layers while outperforming LSTM-based approaches in capturing very long temporal dependencies and achieving superior arousal/valence prediction.

Supervised Contrastive Embeddings for Binaural Source Localization

IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) 2019

Supervised Contrastive Embeddings for Binaural Source Localization

Duowei Tang, Maja Taseska, and Toon van Waterschoot

Proposes a supervised contrastive manifold learning method that learns robust embeddings preserving binaural sound source spatial relationships.

Experience

Feb. 2022 - Present

AI / Speech and Audio Processing Algorithm Designer Goodix Technology

Design efficient deep learning speech enhancement algorithms for the worldwide top-tier smartphones. Supervising and managing of internship projects.

Aug. 2017 - Oct. 2022

Ph.D. Candidate KU Leuven

Supervisor: Prof. dr. ir. Toon van Waterschoot, Prof. dr. Peter Kuppens, and Prof. dr. ir. Luc Geurts

Deep Learning for Audio/Speech Applications: A perspective on representation learning and sequence modelling.

Sept. 2015 - Aug. 2016

Internship Researcher UgenTec

Semi-supervised machine learning on partially labelled data, application to Polymerase Chain Reaction (PCR) data. Propose a semi-supervised model using k-means clustering and LSSVM.

Sept. 2014 - Aug. 2015

Internship Researcher NXP

Speaker recognition algorithms. Compare, analysis and test state-of-the-art GMM and iVector based speaker recognition algorithms.