Intelligent Voice is delighted to announce the release of major new version 5.0 which brings many new advances, including:
Improved Speech Recognition
New, CTC-based Automatic Speech Recognition (nASR). Models using this offer improved accuracy over existing IV models.
Updated Diarization (Biometric Speaker Separation)
The previous i-vector based Diarization system is replaced by an x-vector based system for improved speaker separation accuracy across all languages.
Further advances in punctuation, now including commas and question marks in English, together with full stops (periods) across multiple languages.
NVIDIA Triton Inference Server support
New support for running AI workloads with the addition of NVIDIA Triton Inference Server (https://developer.nvidia.com/nvidia-triton-inference-server).
Faster waveforms in SmartTranscripts (preview feature)