There are many speech-to-text (STT) solutions available today that could almost instantly convert clinician-patient voice conversations into summarised medical notes in the clinical system seamlessly, greatly enhancing the clinician's efficiency and productivity.
These STT solutions possess features such as built-in medical vocabularies for audio-to-text conversion, ambient dictation, summarising capability that leverages AI technology, natural language processing to generate summary notes, and the ability to support multi-language recognition. With added capability to integrate with Electronic Medical Record (EMR) systems such as Epic, this makes the adoption of STT solutions more appealing.
However, such STT solutions that leverage their proprietary Large Language Model (LLM) hosted at their native location for the processing of audio to text and summarising capabilities poses a challenge: Users may come from countries that have policies governing data sovereignty and data residency restrictions - particularly in clinical use cases where there is the possibility of personal identifiers being stored while processed by the system hosted overseas.