Publications

Nabarun Goswami, Tatsuya Harada (2022). SATTS: Speaker Attractor Text to Speech, Learning to Speak by Learning to Separate. Proc. Interspeech 2022.

Nabarun Goswami (2021). Method and system to generate one or more multi-dimensional videos. Google Patents.

Nabarun Goswami (2021). Unsupervised Style Modeling for Text to Speech Synthesis.

Nabarun Goswami, Madhvesh Sulibhavi (2020). System and method for sharing multimedia content with synched playback controls. Google Patents.

Pramod Chintalapoodi, Nabarun Goswami, Hemant Sadhwani, Madhvesh Sulibhavi (2020). System and method for processing video content based on emotional state detection. Google Patents.

Nabarun Goswami, Madhvesh Sulibhavi, Pramod Chintalapoodi (2019). Device and method for generating a panoramic image. Google Patents.

Naoya Takahashi, Sudarsanam Parthasaarathy, Nabarun Goswami, Yuki Mitsufuji (2019). Recursive Speech Separation for Unknown Number of Speakers. Proc. Interspeech 2019.

Naoya Takahashi, Purvi Agrawal, Nabarun Goswami, Yuki Mitsufuji (2018). PhaseNet: Discretized Phase Modeling with Deep Neural Networks for Audio Source Separation. Proc. Interspeech 2018.

Naoya Takahashi, Nabarun Goswami, Yuki Mitsufuji (2018). MMDenselstm: An Efficient Combination of Convolutional and Recurrent Neural Networks for Audio Source Separation. 2018 16th International Workshop on Acoustic Signal Enhancement (IWAENC).

Saurav Sharma, Ram Padhy, Suman Kumar Choudhury, Nabarun Goswami, Pankaj Sa (2017). DenseNet with pre-activated deconvolution for estimating depth map from single image. AMMDS 2017, Workshop on Activity Monitoring by Multiple Distributed Sensing.

Nabarun Goswami, Trisha Kalita, Rituparna Devi (2012). Video Noise Reduction based on Statistical Modelling of Wavelet Coefficients. Bachelor Thesis, Tezpur University, Assam, India.