Skip to main content

Verification Models

Verification models are analyzers and comparators of the characteristics of the voice. They can be used to perform verifications.

Version selection: the platform automatically selects the latest versions of verification models.

Identity Verification Models

Identity verification models are used to analyze the level of similarity between two voices, comparing a new voice sample with a previoulsy created voiceprint. They can be used to perform identity verifications.

Model selection: the platform automatically selects appropriate identity verification models for the chosen voiceprints. Every voiceprint model has a matching identity verification model.

Identity Model Descriptions

Model NameHandleVersionStatusDescription
Digitaldigitalv1.1✅ AvailableThe digital identity verification model is specifically designed and trained for managing higher quality audio, with sample rates of 16KHz or higher. The English and Spanish languages make up a significant part of its training dataset.
Phonephonev1.1✅ AvailableThe phone identity verification model is specifically designed and trained for managing phone audio, with sample rates of 8KHz or higher. The telephone channel and the English and Spanish languages make up a significant part of its training dataset.
Defaultdefaultv1⚠️ DeprecatedThe default identity verification model is deprecated and will be removed in a future platform release.

Identity Model Requirements

Model NameAudio Max DurationAudio Min Voice DurationAudio Min FrequencyAudio File StateVoiceprint ModelVoiceprint Model VersionVoiceprint State
Digital2 minutes1.5 seconds16000 HzAvailabledigitalv1Computed
Phone2 minutes1.5 seconds8000 HzAvailablephonev1Computed
Default2 minutes1.5 seconds16000 HzAvailabledefaultv1Computed

Authenticity Verification Models

Authenticity verification models are analyzers of the authenticity and liveliness of the voice. They can be used to perform authenticity verifications.

Authenticity Model Descriptions

Model NameHandleVersionStatusDescription
Digitaldigitalv2✅ AvailableThe digital authenticity verification model aims to determine if the voice is real or generated/modified by AI (deepfake). It is the recommended model for everything but the telephone channel. The digital channel and the English and Spanish languages make up a significant part of its training dataset.
Phonephonev2✅ AvailableThe phone authenticity verification model aims to determine if the voice is real or generated/modified by AI (deepfake). It is the recommended model for the telephone channel and is the only model that supports audios of 8 KHz. The English and Spanish languages make up a significant part of its training dataset.
Defaultdefaultv10⚠️ DeprecatedThe default authenticity verification model is deprecated and will be removed in a future platform release.
Telephonetelephonev6⚠️ DeprecatedThe telephone authenticity verification model is deprecated and will be removed in a future platform release.

Authenticity Model Requirements

Model NameChunk DurationAudio Max DurationAudio Min FrequencyFile StateAudio Chunks Minimum Voice Duration
Digital4 seconds5 minutes16000 HzAvailable750 milliseconds
Phone4 seconds5 minutes8000 HzAvailable750 milliseconds
Default4 seconds5 minutes16000 HzAvailable750 milliseconds
Telephone4 seconds5 minutes8000 HzAvailable750 milliseconds

Message Verification Models

Message verification models are used to compare the content of a voice sample with an expected message to verify if the two of them match. They can be used as a voice captcha. They can be used to perform message verifications.

Message Model Descriptions

Model NameHandleVersionStatusDescription
Digitaldigitalv1✅ AvailableThe digital message verification model is specifically designed and trained for managing higher quality audio, with sample rates of 16KHz or higher.It supports a wide range of languages and expects a single phrase.
Phonephonev1✅ AvailableThe phone message verification model is the recommended model for the telephone channel and is the only model that supports audios of 8 KHz.It supports a wide range of languages and expects a single phrase.
Defaultdefaultv2⚠️ DeprecatedThe default message verification model is deprecated and will be removed in a future platform release.

Message Model Requirements

Model NameAudio Max DurationAudio Minimum Voice DurationAudio Min FrequencyFile StateMessage Maximum LengthSupports DigitsLanguage
Digital30 seconds2.5 seconds16000 HzAvailable200 charactersNoAll languages supported
Phone30 seconds2.5 seconds8000 HzAvailable200 charactersNoAll languages supported
Default30 seconds2.5 seconds16000 HzAvailable200 charactersNoAll languages supported

Supported Languages

Must be one of the following:
en, zh, de, es, ru, ko, fr, ja, pt, tr, pl, ca, nl, ar, sv, it, id, hi, fi, vi, he, uk, el, ms, cs, ro, da, hu, ta, no, th, ur, hr, bg, lt, la, mi, ml, cy, sk, te, fa, lv, bn, sr, az, sl, kn, et, mk, br, eu, is, hy, ne, mn, bs, kk, sq, sw, gl, mr, pa, si, km, sn, yo, so, af, oc, ka, be, tg, sd, gu, am, yi, lo, uz, fo, ht, ps, tk, nn, mt, sa, lb, my, bo, tl, mg, as, tt, haw, ln, ha, ba, jw, su or yue.