This paper aims at evaluating the transcription accuracy of the Google-IT’s ASR service available in the OH Portal. Data on its performance are limited to good quality recordings. Thus, we focus on suboptimal authentic materials, encompassing non-standard conversational speech recorded in noisy environments. We carry out a quanti-qualitative analysis of the linguistics and extra-linguistic parameters affecting accuracy and error distribution. The preliminary results show higher Word Error Rates for non-standard speech and low-quality recordings. Moreover, we seek the error patterns that could ease the transcription correction process for the users.

DEL ROSSO, G.A., Brambilla, S. (2022). L’accuratezza della trascrizione ASR sul parlato non-standard. L’italiano nell’OH Portal. In L. Spreafico (a cura di), Studi AItLA: Fare linguistica applicata con le "digital humanities". Milano : Officina Ventuno.

L’accuratezza della trascrizione ASR sul parlato non-standard. L’italiano nell’OH Portal

Giovina Angela del Rosso
;
Silvia Brambilla
2022-01-01

Abstract

This paper aims at evaluating the transcription accuracy of the Google-IT’s ASR service available in the OH Portal. Data on its performance are limited to good quality recordings. Thus, we focus on suboptimal authentic materials, encompassing non-standard conversational speech recorded in noisy environments. We carry out a quanti-qualitative analysis of the linguistics and extra-linguistic parameters affecting accuracy and error distribution. The preliminary results show higher Word Error Rates for non-standard speech and low-quality recordings. Moreover, we seek the error patterns that could ease the transcription correction process for the users.
2022
DEL ROSSO, G.A., Brambilla, S. (2022). L’accuratezza della trascrizione ASR sul parlato non-standard. L’italiano nell’OH Portal. In L. Spreafico (a cura di), Studi AItLA: Fare linguistica applicata con le "digital humanities". Milano : Officina Ventuno.
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11590/394463
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact