The Intelligent Voice system is a batch processing system designed to process hours of audio recordings as efficiently as possible. We also offer systems designed for real-time transcription, low latency keyword spotting, IoT devices and embedded applications and more - for details on requirements for these please contact us.
Minimum Requirement: single Virtual Machine with no GPU acceleration
Intelligent Voice can be installed in a single Virtual machine without GPU acceleration. This system will be functionally identical to GPU accelerated systems but lower performance. Suitable applications include:
- Low volume batch processing (0-500 hours of audio recordings per day)
- Functional system evaluation and compatibility testing
- Software development and integration testing
- Test and QA systems
The minimum requirements are:
- 4 or more x86 vCPUs, Intel "Ivy Bridge" or later / AMD "Bulldozer" or later.
- at least 64Gb RAM
- at least 500GB storage
- Red Hat 7.9 or Ubuntu 20.04
Additional vCPUs and storage are required to increase performance.
For help on sizing larger installations please see below or contact us.
High Performance GPU Single Server
To get the best performance from Intelligent Voice we recommend servers with NVIDIA GPU cards.
An example system specifications suitable for production systems:
- 2 x NVIDIA Tesla T4
- 2 x 1TB SSD
- 128 GB RAM
- 1 x AMD EPYC 7302P (or equivalent)
An example server spec for larger installations:
- 4 x NVIDIA Tesla V100 32GB
- 2 x 2TB NVMe SSD
- 512GB RAM
- 2 x AMD EPYC 7402 (or equivalent)
Additional storage will be required for storage of audio data and outputs, with the size depending on audio / video file formats, and the required retention period.
NOTE: Tesla T4 cards are being discontinued. NVIDIA A2 cards are a suitable replacement - contact support for more detailed information.
High Performance GPU Multiple Servers
Intelligent Voice can scale over any number of GPU servers. An example system higher volume processing:
1 x application server:
- 2 x 2TB NVMe SSD
- 512GB RAM
- 2 x AMD EPYC 7402 (or equivalent)
5 x GPU processing node servers:
- 4 x NVIDIA Tesla T4
- 2 x 1TB SSD
- 256 GB RAM
- 1 x AMD EPYC 7302P (or equivalent)
Larger Clusters & Geographical Distribution
Intelligent Voice can scale to millions of audio hours per day and has options of synchronizing over multiple geographic regions - please contact us for information
Comments
0 comments
Article is closed for comments.