Design Space for Voice-Based Professional Reporting

Jaakko Hakulinen, Tuuli Keskinen, Markku Turunen, Sanni Siltanen

Research output: Contribution to journalArticleScientificpeer-review

5 Downloads (Pure)

Abstract

Speech technology has matured so that voice-based reporting utilizing speech-to-text can be applied in various domains. Speech has two major benefits: it enables efficient reporting and speech input improves the quality of the reports since reporting can be done as a part of the workflow without delays between work and reporting. However, designing reporting voice user interfaces (VUIs) for professional use is challenging, as there are numerous aspects from technology to organization and language that need to be considered. Based on our experience in developing professional reporting VUIs with different stakeholders representing both commercial and public sector, we define a design space for voice-based reporting systems. The design space consists of 28 dimensions grouped into five categories: Language Processing, Structure of Reporting, Technical Limitations in the Work Domain, Interaction Related Aspects in the Work Domain, and Organization. We illustrate the design space by discussing four voice-based reporting systems, designed and implemented by us, and describing a design process that utilizes it. The design space enables designers to identify critical aspects of professional reporting VUIs and optimize those for their target domain. The design space can be used as a practical tool especially by designers with limited experience on speech technologies.
Original languageEnglish
Number of pages18
JournalMultimodal Technologies and Interaction
Volume5
Issue number3
DOIs
Publication statusPublished - 11 Jan 2021
Publication typeA1 Journal article-refereed

Publication forum classification

  • Publication forum level 1

ASJC Scopus subject areas

  • Computer Science(all)
  • Human-Computer Interaction
  • Computer Science Applications

Fingerprint

Dive into the research topics of 'Design Space for Voice-Based Professional Reporting'. Together they form a unique fingerprint.

Cite this