Methods To Dysfluent Speech Transcription And Detection

Tech ID: 33377 / UC Case 2024-062-0

Patent Status

Country	Type	Number	Dated	Case
United States Of America	Published Application	20250246187	07/31/2025	2024-062

Brief Description

Dysfluent speech modeling requires time-accurate and silence-aware transcription at both the word-level and phonetic-level. However, current research in dysfluency modeling primarily focuses on either transcription or detection, and the performance of each aspect remains limited.

To address this problem, UC Berkeley researchers have developed a new unconstrained dysfluency modeling (UDM) approach that addresses both transcription and detection in an automatic and hierarchical manner. Furthermore, a simulated dysfluent dataset called VCTK++ enhances the capabilities of UDM in phonetic transcription. The effectiveness and robustness of UDM in both transcription and detection tasks has been demonstrated experimentally.

UDM eliminates the need for extensive manual annotation by providing a comprehensive solution.

Advantages

Comprehensive solution
Automated, hierarchical approach
Demonstrated effectiveness

Suggested uses

Diagnosis of speech disorders in, e.g., hospitals
Evaluation of early language literacy is school settings
Speech and Language Pathology

Related Materials

Contact

Learn About UC TechAlerts - Save Searches and receive new technology matches

Methods To Dysfluent Speech Transcription And Detection

Patent Status

Brief Description

Advantages

Suggested uses

Related Materials

Contact

Inventors

Other Information

Categorized As

Methods To Dysfluent Speech Transcription And Detection

Patent Status

Brief Description

Advantages

Suggested uses

Related Materials

Share This

Contact

Inventors

Other Information

Categorized As

Related cases