DiffS2UT: A Semantic Preserving Diffusion Model for Textless Direct Speech-to-Speech Translation

Add to Favorites

Poster_Demo_Industry_Findings In-person 3: Speech and Multimodality (Poster)

Conference Room: East Foyer

Conference Time: December 08, 16:00-17:30 (+08) (Asia/Singapore)

Global Time: December 08, Poster_Demo_Industry_Findings In-person 3 (08:00-09:30 UTC)

TLDR:

You can open the #paper-2952 channel in a separate window.

Abstract: