The Intelligent Voice system is a batch processing system designed to process hours of audio recordings as efficiently as possible. We also offer systems designed for real-time transcription, low latency key word spotting, IoT devices and embedded applications and more - for details on requirements for these please contact us.
Minimum requirements - single VM, no GPU acceleration
Intelligent Voice can be installed in a single Virtual machine without GPU acceleration. This system will be functionally identical to GPU accelerated systems but lower performance. Suitable applications include:
- Low volume batch processing (0-500 hours of audio recordings per day)
- Functional system evaluation and compatibility testing
- Software development and integration testing
- Test and QA systems
High performance single server
To get the best performance from Intelligent Voice we recommend servers with NVIDIA GPU cards.
An example system specification for processing up to 10,000 hours of audio recordings per day:
- 1x AMD EPYC 7302P
- 64 GB RAM
- 2x 480GB SSD
- 2x Tesla T4
An example server spec for processing up to 30,000 hours of audio recordings per day:
- 2x AMD EPYC 7402
- 256GB RAM
- 2x 1TB NVMe SSD
- 4x NVIDIA Tesla V100 32GB
Additional storage will be required for storage of audio data and outputs, size depending on formats and retention period.
Multiple GPU servers
Intelligent Voice can scale over any GPU servers. An example system for processing up to 100,000 hours per day:
1x application server:
- 2x AMD EPYC 7402
- 256GB RAM
- 2x 1TB NVMe SSD
5x GPU processing nodes:
- 1x AMD EPYC 7302P
- 128 GB RAM
- 2x 480GB SSD
- 4x Tesla T4
Larger clusters, high availability and geographical distribution
Intelligent Voice can scale to millions of audio hours per day and has options of high availability and synchronizing over multiple geographic regions - please contact us for information
Comments
0 comments
Please sign in to leave a comment.