Supported audio formats
SpeechSense allows you to upload audio files in these formats:
LPCM
Linear pulse-code modulation
LPCM audio requirements:
- Sampling rate: Between 8 kHz and 48 kHz.
- Bit depth: 16 bit.
- Byte order: Reversed (little-endian).
- Audio data is stored as signed integers.
WAV
For WAV
LPCM format requirements also apply to WAV audio files. SpeechSense only recognizes WAV if the audio files use the LPCM encoding and comply with the format requirements.
MP3
For MP3
SpeechSense recognizes MP3 without audio file quality and header restrictions.
OggOpus
For OggOpus
SpeechSense recognizes OggOpus without audio file quality and header restrictions.