Publications

(2022). SATTS: Speaker Attractor Text to Speech, Learning to Speak by Learning to Separate. Proc. Interspeech 2022.

Cite

(2020). System and method for sharing multimedia content with synched playback controls. Google Patents.

Cite

(2020). System and method for processing video content based on emotional state detection. Google Patents.

Cite

(2019). Device and method for generating a panoramic image. Google Patents.

Cite

(2019). Recursive Speech Separation for Unknown Number of Speakers. Proc. Interspeech 2019.

Cite

(2018). PhaseNet: Discretized Phase Modeling with Deep Neural Networks for Audio Source Separation. Proc. Interspeech 2018.

Cite

(2018). MMDenselstm: An Efficient Combination of Convolutional and Recurrent Neural Networks for Audio Source Separation. 2018 16th International Workshop on Acoustic Signal Enhancement (IWAENC).

Cite

(2017). DenseNet with pre-activated deconvolution for estimating depth map from single image. AMMDS 2017, Workshop on Activity Monitoring by Multiple Distributed Sensing.

Cite

(2012). Video Noise Reduction based on Statistical Modelling of Wavelet Coefficients. Bachelor Thesis, Tezpur University, Assam, India.

Cite