Get started
Create an Infobip account
To use any Infobip product, you need to sign up for a free account. See: Create an Account for more details.
If you already have an account and our other solutions, contact sales or support to enable different Voice and Video capabilities based on your specific needs.
Free trial
Once you create an account, you are automatically enrolled in a free trial until you run out of allocated free credits.
Here are some specific details for Voice and Video to keep in mind during the free trial:
- You are entitled to:
- • 15 inbound calls from phone
- • 15 outbound calls to phones
- • 30 calls over webRTC
- Voice calls in the trial period can be directed only to a mobile phone number you verified during the signup process.
- Voice calls in the trial period are limited to a 5-minute duration
- Among all possible voice add-ons, only a few listed below are enabled during trial. Contact Sales if you need to have other options activated.
- • Calls API Conferencing and webRTC Rooms
- • Automated Machine Detection
- Voice channel is available in more than 200 countries worldwide. Signup and trial users are limited to the countries specified in this list. If you are interested in any country not specified in this list, contact our Sales organisation.
Upgrade to a paying account
At any time, you can decide to finish the free trial and upgrade your account. Use the Upgrade options button on the Home page to view available billing options.
Numbers and senders
For each of our voice solutions, you can select a phone number to be used as the caller ID. This caller ID may or may not be displayed as-is on the end-users' phone depending on the voice connection used to reach the destination. If you need to ensure your caller ID is always displayed, reach out to our Support team.
To process inbound calls with our voice solutions, you need to use Infobip voice numbers, unless your inbound calls will reach the Infobip platform over SIP or webRTC. You can lease voice numbers from Infobip and set up the desired voice action on these at any time.
Refer to Voice Numbers for more details.
Inbound and outbound calls for self-signup customers
Outbound voice calls for self-signup users are available in 215 countries. Inbound voice calls for self-signup users are available in 30 countries worldwide.
Check out the list of supported countries.
If you are interested in one of the unsupported countries, contact our Sales to help you with that.
NOTE
Outbound calls to the United States are not enabled by default on self-signup accounts. To add the US as a destination, please reach out to our support or Sales to validate your use case(s) complies with the FCC's robot-calling restrictions.
Add-ons and options
You can combine different add-ons depending on your business needs with Voice and Video channel. To enable these add-ons, contact sales or support.
- Recording
- Answering Machine Detection
- SIP trunking
- Conferences
- Text-to-speech
- Speech capture
- Audio streaming
Recording
With the Recording add-on, you can record all voice and video communications, whether you chose to record the whole conversations or part of these.
You can activate Recording in different places depending on the voice and video solution you use:
- During voice action setup on voice number: when configuring your Voice number, various voice actions (Forward to IVR, Forward to Phone, Forward to SIP, etc) come with the ability to automatically record the inbound calls and any child call that would be connected to it.
- When creating a Broadcast over the voice channel.
- When a new SIP trunk has been created over the portal, clicking on that trunk allows activating the recording of all traffic going through it.
- When creating a new webRTC token for one of your webRTC users.
- When using Calls API, the recording of Calls, Dialogs and Conferences can be started and stopped at any time with the recording API methods.
- When using Click to Call, IVR, or Advanced Voice Messages APIs, as optional parameters for your requests.
The complete Recording facility is made of 3 complementary add-ons:
- Recording: whether your account is allowed to trigger voice and video call recordings.
- Recording Storage: required if your account does not use our SFTP facility.
- Video composition: required if you plan to have video conferences or rooms with multiple participants, and when all participants' recordings should be merged into a single media file.
COUNTRY REGULATIONS AND COMPLIANCE
Certain types of Voice and Video calls and call recordings might be subjected to specific country regulations. Before you set up and start using Voice and Video, make sure you've checked the country's telecom regulations.
Infobip Cloud or SFTP
You may choose to have all your voice and video recordings stored on Infobip's own cloud storage, or immediately pushed to your SFTP server once the recording is complete.
Your SFTP server address and credentials can only be set up in the web interface, in the Settings section under the Recording tab of the Voice channel application.
INFO
If your SFTP server is unreachable, recording files will be discarded and not stored on the Infobip cloud storage.
Retrieve voice and video recordings from your account
You can find Voice and video recordings on your account, under the Recordings section of the Voice channel application.
Recordings are split into 3 different categories:
Calls |
Recordings of single-leg calls, typically used by:
|
---|---|
Conferences |
Recordings of multi-party calls (2 participants or more), typically used by:
|
Dialogs |
Recordings of 2 party calls, typically used by:
|
INFO
For a limited period, you may find recordings performed over SIP trunks (Forward to IP action), Voice Messages, IVR, Number Masking, and Click to Call under the Analyze/Recordings section of your account.
We are gradually transitioning these recordings to a new Voice recording page.
Retrieve voice and video recordings via API
You can retrieve recordings in 2 ways, depending on the Voice API you are using:
- For recordings performed with Calls API, see our related product documentation.
- For recordings performed with IVR API, check our dedicated search and download methods.
Answering Machine Detection
Answering Machine Detection (AMD) is a feature that determines whether a machine (voicemail answering machine) or a human answered the call. AMD can be applied to automated outbound calls (outbound IVR, Text-to-Speech, pre-recorded automated calls, or click-to-call).
AMD can be used in the web interface (over Broadcast or Moments using Flow) or API.
Here is a diagram showing how Answering Machine Detection works as a feature on the Voice platform.
For this feature to work, you have to configure what you want to do when an answering machine answers the call you initiated for your customer. Whether to hang up the call or continue and your message might end up in the end-user's voicemail.
AMD was programmed with a detection time of 4 seconds minimum. If the AMD detects silence after the call has been answered, it will interpret it as if it was an answering machine. However, if any noise is detected once the call has been answered, AMD will interpret that as if a human answered the call.
Our AMD mechanism is 95% accurate for Spanish and Portuguese languages in countries like Spain, Colombia, Mexico, Peru, and Brazil respectively. For other markets, accuracy is around 80%, with constant work on improving the model.
By having AMD, you will save money by avoiding leaving voice messages on people's answering machines. For example, it wouldn't be a good idea to leave private voice messages, such as one-time PINs) on the end-user's voicemail.
On the other hand, if you want your end users to hear your message later in their voicemail, configure AMD to play the voice message anyway. In this case, when an answering machine is detected, we will wait for the answering machine message to finish, and then play your message. And then it will be saved in your end-user's voicemail.
SIP trunking
This add-on is required to use SIP trunks.
Conferences
The Conferencing add-on is required to use Calls API Conferences and WebRTC Rooms.
Text-to-speech
Text-to-speech (TTS) is used to convert a written message into an audio file. That file is then played to your customers over the voice & video product or API triggering its usage. You can use it for both promotional and transactional traffic. With this feature, you don't have to bother with pre-recorded audio. Additionally, you save time and are able to speed up your go-to-market strategy.
Text-to-speech languages
For text-to-speech conversion, we offer more than 100 languages and accents. A full list of supported languages is shown below.
Language | Abbreviation | Gender | Name | Character limit | Default voice | Voice provider |
---|---|---|---|---|---|---|
Arabic |
ar |
Female | Hoda | 1000 | Azure - Standard | |
Arabic |
ar |
Male | Naayf | 1000 | Azure - Standard | |
Arabic |
ar |
Female | Zeina | 3000 | Amazon Polly - Standard | |
Arabic |
ar |
Female |
Aisha (beta) |
5000 | Google - Standard | |
Arabic |
ar |
Male |
Farooq (beta) |
5000 | ✓ | Google - Standard |
Arabic |
ar |
Male |
Hussein (beta) |
5000 | Google - Standard | |
Arabic |
ar |
Female |
Amal (beta) |
5000 | Google - Standard | |
Bengali | bn | Male |
Sayan (beta) |
5000 | ✓ | Google - Standard |
Bengali | bn | Female |
Sushmita (beta) |
5000 | Google - Standard | |
Bulgarian | bg | Male | Ivan | 1000 | ✓ | Azure - Standard |
Catalan | ca | Female | Conchita | 1000 | Amazon Polly - Standard | |
Catalan | ca | Female | Herena | 1000 | Azure - Standard | |
Chinese (Simplified) |
zh-cn |
Female | Huihui | 1000 | Azure - Standard | |
Chinese (Simplified) |
zh-cn |
Male | Kangkang | 1000 | Azure - Standard | |
Chinese (Simplified) |
zh-cn |
Female | Yaoyao | 1000 | Azure - Standard | |
Chinese (Simplified) |
zh-cn |
Female | Zhiyu | 3000 | Amazon Polly - Standard | |
Chinese (Simplified) |
zh-cn |
Female |
Akemi (beta) |
5000 | ✓ | Google - Standard |
Chinese (Simplified) |
zh-cn |
Male |
Chen (beta) |
5000 | Google - Standard | |
Chinese (Simplified) |
zh-cn |
Male |
Huang (beta) |
5000 | Google - Standard | |
Chinese (Simplified) |
zh-cn |
Female |
Liu (beta) |
5000 | Google - Standard | |
Chinese (Simplified) |
zh-cn |
Male |
Wang (beta) |
5000 | Google - Standard | |
Chinese (Simplified) |
zh-cn |
Male |
Zhang (beta) |
5000 | Google - Standard | |
Chinese (Simplified) |
zh-cn |
Female |
Lin (beta) |
5000 | Google - Standard | |
Chinese (Traditional) |
zh-tw |
Female | Hanhan | 1000 | Azure - Standard | |
Chinese (Traditional) |
zh-tw |
Female | Yating | 1000 | ✓ | Azure - Standard |
Chinese (Traditional) |
zh-tw |
Male | Zhiwei | 1000 | Azure - Standard | |
Croatian |
hr | Male | Matej | 1000 | ✓ | Azure - Standard |
Czech | cs | Male | Jakub | 1000 | ✓ | Azure - Standard |
Danish | da | Female | Helle | 1000 | Azure - Standard | |
Danish | da | Male | Mads | 3000 | Amazon Polly - Standard | |
Danish | da | Female | Naja | 3000 | ✓ | Amazon Polly - Standard |
Dutch | nl | Female | Hanna | 1000 | Azure - Standard | |
Dutch | nl | Female | Lotte | 3000 | ✓ | Amazon Polly - Standard |
Dutch | nl | Male | Ruben | 3000 | Amazon Polly - Standard | |
English | en | Male | Benjamin | 1000 | Azure - Standard | |
English | en | Male | Guy | 1000 | Azure - Standard | |
English | en | Female | Ivy | 3000 | Amazon Polly - Standard | |
English | en | Female | Jane | 1000 | Azure - Standard | |
English | en | Female | Jessica | 1000 | Azure - Standard | |
English | en | Female | Joanna | 3000 | ✓ | Amazon Polly - Standard |
English | en | Male | Joey | 3000 | Amazon Polly - Standard | |
English | en | Male | Justin | 3000 | Amazon Polly - Standard | |
English | en | Female | Kendra | 3000 | Amazon Polly - Standard | |
English | en | Female | Kimberly | 3000 | Amazon Polly - Standard | |
English | en | Male | Matthew | 3000 | Amazon Polly - Standard | |
English | en | Female | Salli | 3000 | Amazon Polly - Standard | |
English | en | Female | Zira | 1000 | Azure - Standard | |
English (Australian) |
en-au |
Female | Catherine | 1000 | Azure - Standard | |
English (Australian) |
en-au |
Female | Hayley | 1000 | Azure - Standard | |
English (Australian) |
en-au |
Female | Nicole | 3000 | ✓ | Amazon Polly - Standard |
English (Australian) |
en-au |
Male | Russell | 3000 | Amazon Polly - Standard | |
English (British) |
en-gb |
Female | Amy | 3000 | ✓ | Amazon Polly - Standard |
English (British) |
en-gb |
Male | Brian | 3000 | Amazon Polly - Standard | |
English (British) |
en-gb |
Female | Emma | 3000 | Amazon Polly - Standard | |
English (British) |
en-gb |
Male | George | 1000 | Azure - Standard | |
English (British) |
en-gb |
Female | Hazel | 1000 | Azure - Standard | |
English (British) |
en-gb |
Female | Rosie | 1000 | Azure - Standard | |
English (Canadian) |
en-ca | Female | Heather | 1000 | ✓ | Azure - Standard |
English (Canadian) |
en-ca | Female | Alice | 1000 | Azure - Standard | |
English (Indian) |
en-in |
Female | Aditi | 3000 | Amazon Polly - Standard | |
English (Indian) |
en-in |
Female | Heera | 1000 | ✓ | Azure - Standard |
English (Indian) |
en-in |
Female | Priya | 1000 | Azure - Standard | |
English (Indian) |
en-in |
Female | Raveena | 3000 | Amazon Polly - Standard | |
English (Indian) |
en-in |
Male | Ravi | 1000 | Azure - Standard | |
English (Irish) |
en-ie |
Male | Sean | 1000 | ✓ | Azure - Standard |
English (Welsh) |
en-gb-wls |
Male | Geraint | 3000 | ✓ | Amazon Polly - Standard |
Finnish | fi | Female | Heidi | 1000 | Azure - Standard | |
Finnish | fi | Female |
Evelin (beta) |
5000 | Google - Standard | |
French | fr | Female | Celine | 3000 | ✓ | Amazon Polly - Standard |
French | fr | Female | Hortense | 1000 | Azure - Standard | |
French | fr | Female | Juliette | 1000 | Azure - Standard | |
French | fr | Female | Lea | 3000 | Amazon Polly - Standard | |
French | fr | Male | Mathieu | 3000 | Amazon Polly - Standard | |
French | fr | Male | Picart | 1000 | Azure - Standard | |
French (Canadian) |
fr-ca |
Female | Caroline | 1000 | Azure - Standard | |
French (Canadian) |
fr-ca |
Female | Chantal | 3000 | ✓ | Amazon Polly - Standard |
French (Canadian) |
fr-ca |
Female | Harmonie | 1000 | Azure - Standard | |
French (Swiss) |
fr-ch |
Male | Guillaume | 1000 | ✓ | Azure - Standard |
German | de | Male | Hans | 3000 | Amazon Polly - Standard | |
German | de | Female | Hedda | 1000 | Azure - Standard | |
German | de | Female | Marlene | 3000 | ✓ | Amazon Polly - Standard |
German | de | Male | Stefan | 3000 | Azure - Standard | |
German | de | Female | Vicki | 3000 | Amazon Polly - Standard | |
German | de | Female | Angela | 1000 | Azure - Standard | |
German (Austria) |
de-at |
Male | Michael | 1000 | ✓ | Azure - Standard |
German (Swiss) |
de-ch |
Male | Karsten | 1000 | ✓ | Azure - Standard |
Greek | el | Male | Stefanos | 1000 | ✓ | Azure - Standard |
Greek | el | Female |
Sophia (beta) |
5000 | Google - Standard | |
Gujarati | gu | Male |
Dinesh (beta) |
5000 | ✓ | Google - Standard |
Gujarati | gu | Female |
Leela (beta) |
5000 | Google - Standard | |
Hebrew | he | Male | Asaf | 1000 | ✓ | Azure - Standard |
Hindi | hi | Female | Aadita | 3000 | ✓ | Amazon Polly - Standard |
Hindi | hi | Male | Hemant | 1000 | Azure - Standard | |
Hindi | hi | Female | Kalpana | 1000 | Azure - Standard | |
Hindi | hi | Female | Aarushi (beta) | 5000 | Google - Standard | |
Hindi | hi | Male | Akash (beta) | 5000 | Google - Standard | |
Hindi | hi | Male | Daman (beta) | 5000 | Google - Standard | |
Hindi | hi | Female | Divya (beta) | 5000 | Google - Standard | |
Hungarian | hu | Male | Szabolcs | 1000 | ✓ | Azure - Standard |
Icelandic | is | Female | Dora | 3000 | ✓ | Amazon Polly - Standard |
Icelandic | is | Male | Karl | 3000 | Amazon Polly - Standard | |
Indonesian | id | Male | Andika | 1000 | ✓ | Azure - Standard |
Indonesian | id | Male |
Arif (beta) |
1000 | Google - Standard | |
Indonesian | id | Female |
Indah (beta) |
5000 | Google - Standard | |
Indonesian | id | Male |
Reza (beta) |
5000 | Google - Standard | |
Indonesian | id | Female |
Nurul (beta) |
5000 | Google - Standard | |
Italian | it |
Female |
Bianca | 3000 | Amazon Polly - Standard | |
Italian | it |
Female |
Carla | 3000 | ✓ | Amazon Polly - Standard |
Italian | it | Male | Cosimo | 1000 | Azure - Standard | |
Italian | it | Male | Giorgio | 3000 | Amazon Polly - Standard | |
Italian | it | Female | Lucia | 1000 | Azure - Standard | |
Italian | it | Female |
Gianna (beta) |
5000 | Google - Standard | |
Japanese |
ja | Female | Ayumi | 1000 | Azure - Standard | |
Japanese |
ja | Female | Haruka | 1000 | Azure - Standard | |
Japanese |
ja | Male | Ichiro | 1000 | Azure - Standard | |
Japanese |
ja | Female | Mizuki | 3000 | ✓ | Amazon Polly - Standard |
Japanese |
ja | Male | Takumi | 3000 | Amazon Polly - Standard | |
Kannada | kn | Male |
Shashank (beta) |
5000 | ✓ | Google - Standard |
Kannada | kn | Female |
Namratha (beta) |
5000 | Google - Standard | |
Korean | ko | Female | Heami | 1000 | Azure - Standard | |
Korean | ko | Female | Seoyeon | 3000 | Amazon Polly - Standard | |
Korean | ko | Female |
Sumi (beta) |
5000 | ✓ | Google - Standard |
Korean | ko | Female |
Jina (beta) |
5000 | Google - Standard | |
Korean | ko | Male |
Himchan (beta) |
5000 | Google - Standard | |
Korean | ko | Male |
Minho (beta) |
5000 | Google - Standard | |
Malay | ms | Male | Rizwan | 1000 | ✓ | Azure - Standard |
Malayam | ml | Male |
Vishnu (beta) |
5000 | ✓ | Google - Standard |
Malayam | ml | Female |
Kirti (beta) |
5000 | Google - Standard | |
Norwegian | no | Female | Hulda | 1000 | Azure - Standard | |
Norwegian | no | Female | Liv | 2000 | ✓ | Amazon Polly - Standard |
Polish | pl | Female | Ewa | 3000 | ✓ | Amazon Polly - Standard |
Polish | pl | Male | Jacek | 3000 | Amazon Polly - Standard | |
Polish | pl | Male | Jan | 3000 | Amazon Polly - Standard | |
Polish | pl | Female | Maja | 3000 | Amazon Polly - Standard | |
Polish | pl | Female | Paulina | 1000 | Azure - Standard | |
Portuguese |
pt-pt |
Male | Cristiano | 3000 | Amazon Polly - Standard | |
Portuguese |
pt-pt |
Female | Helia | 1000 | Azure - Standard | |
Portuguese |
pt-pt |
Female | Ines | 3000 | Amazon Polly - Standard | |
Portuguese |
pt-pt |
Female |
Abrielle (beta) |
5000 | ✓ | Google - Standard |
Portuguese |
pt-pt |
Male |
Henriques (beta) |
5000 | Google - Standard | |
Portuguese |
pt-pt |
Male |
Jeraldo (beta) |
5000 | Google - Standard | |
Portuguese |
pt-pt |
Female |
Jacinda (beta) |
5000 | Google - Standard | |
Portuguese (Brazilian) |
pt-br |
Female | Camila | 3000 | Amazon Polly - Standard | |
Portuguese (Brazilian) |
pt-br |
Male | Daniel | 1000 | Azure - Standard | |
Portuguese (Brazilian) |
pt-br |
Female | Heloisa | 1000 | Azure - Standard | |
Portuguese (Brazilian) |
pt-br |
Male | Ricardo | 3000 | Amazon Polly - Standard | |
Portuguese (Brazilian) |
pt-br |
Female | Vitoria | 3000 | ✓ | Amazon Polly - Standard |
Romanian | ro | Male | Andrei | 1000 | Azure - Standard | |
Romanian | ro | Female | Carmen | 3000 | ✓ | Amazon Polly - Standard |
Russian |
ru | Female | Ekaterina | 1000 | Azure - Standard | |
Russian |
ru | Female | Irina | 1000 | Azure - Standard | |
Russian |
ru | Male | Maxim | 3000 | Amazon Polly - Standard | |
Russian |
ru | Male | Pavel | 1000 | Azure - Standard | |
Russian |
ru | Female | Tatyana | 3000 | ✓ | Amazon Polly - Standard |
Slovak |
sk | Male | Filip | 1000 | ✓ | Azure - Standard |
Slovenian | sl | Male | Lado | 1000 | ✓ | Azure - Standard |
Spanish | es | Male | Enrique | 3000 | Amazon Polly - Standard | |
Spanish | es | Female | Juana | 1000 | Azure - Standard | |
Spanish | es | Female | Laura | 1000 | Azure - Standard | |
Spanish | es | Female | Linda | 3000 | Amazon Polly - Standard | |
Spanish | es | Male | Miguel | 3000 | Amazon Polly - Standard | |
Spanish | es | Male | Pablo | 1000 | Azure - Standard | |
Spanish | es | Female | Penelope | 3000 | ✓ | Amazon Polly - Standard |
Spanish | es | Female | Lupe | 3000 | Amazon Polly - Standard | |
Spanish | es | Female |
Gabriela (beta) |
5000 | Google - Standard | |
Spanish (Mexican) |
es-mx |
Female | Hilda | 1000 | Azure - Standard | |
Spanish (Mexican) |
es-mx |
Female | Mia | 3000 | ✓ | Amazon Polly - Standard |
Spanish (Mexican) |
es-mx |
Male | Raul | 1000 | Azure - Standard | |
Swedish | sv | Female | Astrid | 3000 | ✓ | Amazon Polly - Standard |
Swedish | sv | Female | Hedvig | 1000 | Azure - Standard | |
Tamil | ta | Male | Valluvar | 1000 | ✓ | Azure - Standard |
Tamil | ta | Male |
Ganesh (beta) |
5000 | Google - Standard | |
Tamil | ta | Female |
Shruti (beta) |
5000 | Google - Standard | |
Telugu | te | Female | Chitra | 1000 | ✓ | Azure - Standard |
Telugu | te | Male |
Vijay (beta) |
5000 | Google - Standard | |
Telugu | te | Female |
Samantha (beta) |
5000 | Google - Standard | |
Thai | th | Male | Pattara | 1000 | Azure - Standard | |
Thai | th | Female |
Natchaya (beta) |
5000 | Google - Standard | |
Turkish | tr | Female | Filiz | 3000 | ✓ | Amazon Polly - Standard |
Turkish | tr | Female | Seda | 1000 | Azure - Standard | |
Ukrainian | uk | Female | Ulyana | 5000 | ✓ | Google - Standard |
Vietnamese | vi | Male | An | 1000 | ✓ | Azure - Standard |
Vietnamese | vi | Female |
Lien (beta) |
5000 | Google - Standard | |
Vietnamese | vi | Male |
Quan (beta) |
5000 | Google - Standard | |
Vietnamese | vi | Female |
Mai (beta) |
5000 | Google - Standard | |
Vietnamese | vi | Male |
Thao (beta) |
5000 | Google - Standard | |
Welsh | wls | Female | Gwyneth | 3000 | ✓ | Amazon Polly - Standard |
NOTE
Please note that the languages and voices marked as beta are in a pre-production stage. Infobip does not have the technical support obligation for these languages.
SSML support
Speech Synthesis Markup Language (SSML) serves as a powerful tool to finely tune the text-to-speech synthesis process. With SSML, you gain the ability to infuse synthesized speech with natural-sounding inflections, emphasis, pauses, and other speech characteristics. This control allows you to tailor the output to match your unique requirements and create more engaging and lifelike voice experiences for your users.
Infobip supports SSML for Google and Amazon Polly voices. Azure standard voices do not currently support SSML.
The following list shows the supported tags that can be used across Voice API products:
SSML supported tags | Description |
---|---|
<speak> | Identifying SSML-enhanced text |
<break> | Adding a pause |
<say-as> | Controlling how special types of words are spoken |
<p> | Adding a pause between paragraphs |
<s> | Adding a pause between sentences |
<emphasis> | Emphasizing words |
<sub> | Pronouncing acronyms and abbreviations |
<phoneme> | Using phonetic pronunciation |
<prosody> | Controlling volume, speaking rate, and pitch |
Google and Amazon offer other SSML tags, such as par, audio, seq, and so on, but they are not supported in this current implementation.
This is an example of an SSML-enhanced tag text used in a Voice Message API: <speak>
"text": "<speak>Hello. This is a test message sent from Infobip Voice Message API. Your confirmation code is 12345.</speak>"
For more information about the complete syntax of these SSML tags, see:
- For Google standard voices: https://cloud.google.com/text-to-speech/docs/ssml?hl=en
- For Amazon Polly voices: https://docs.aws.amazon.com/polly/latest/dg/supportedtags.html
Speech capture
Infobip Speech Capture feature collects end-user’s speech and returns a text with the recognized content. This is currently only available via API in:
- IVR API scenarios, with the capture action type
- Calls API, with the Capture speech method
Support languages for Speech recognition
Check out the table to see which abbreviation you need to use when selecting a specific language in the API request.
Language | Abbreviation |
---|---|
Afrikaans (South Africa) | af-ZA |
Albanian (Albania) | sq-AL |
Amharic (Ethiopia) | am-ET |
Arabic (Algeria) |
ar-DZ |
Arabic (Bahrain) |
ar-BH |
Arabic (Egypt) |
ar-EG |
Arabic (Iraq) |
ar-IQ |
Arabic (Israel) |
ar-IL |
Arabic (Jordan) |
ar-JO |
Arabic (Kuwait) |
ar-KW |
Arabic (Lebanon) |
ar-LB |
Arabic (Libya) | ar-LY |
Arabic (Morocco) |
ar-MA |
Arabic (Oman) |
ar-OM |
Arabic (Qatar) | ar-QA |
Arabic (Saudi Arabia) | ar-SA |
Arabic (State of Palestine) |
ar-PS |
Arabic (Syria) |
ar-SY |
Arabic (Tunisia) |
ar-TN |
Arabic (United Arab Emirates) | ar-AE |
Arabic (Yemen) |
ar-YE |
Armenian (Armenia) | hy-AM |
Azerbaijani (Azerbaijan) | az-AZ |
Basque (Spain) | eu-ES |
Bengali (Bangladesh) | bn-BD |
Bengali (India) | bn-IN |
Bosnian (Bosnia and Herzegovina) | bs-BA |
Bulgarian (Bulgaria) | bg-BG |
Burmese (Myanmar) | my-MM |
Catalan (Spain) |
ca-ES |
Chinese (Cantonese, Traditional) | zh-HK |
Chinese (Mandarin, Simplified) | zh-CN |
Chinese (Mandarin, Taiwan) | zh-TW |
Croatian (Croatia) |
hr-HR |
Czech (Czech Republic) | cs-CZ |
Danish (Denmark) | da-DK |
Dutch (Belgium) | nl-BE |
Dutch (Netherlands) | nl-NL |
English (Australia) |
en-AU |
English (Canada) |
en-CA |
English (Ghana) | en-GH |
English (Great Britain) |
en-GB |
English (Hong Kong) |
en-HK |
English (India) | en-IN |
English (Ireland) |
en-IE |
English (Kenya) |
en-KE |
English (New Zealand) | en-NZ |
English (Nigeria) |
en-NG |
English (Pakistan) |
en-PK |
English (Philippines) |
en-PH |
English (Singapore) |
en-SG |
English (South Africa) | en-ZA |
English (Tanzania) |
en-TZ |
English (US) |
en-US |
Estonian (Estonia) |
et-EE |
Filipino (Philippines) | fil-pH |
Finnish (Finland) | fi-FI |
French (Belgium) | fr-BE |
French (Canada) | fr-CA |
French (France) | fr-FR |
French (Switzerland) | fr-CH |
Galician (Spain) | gl-ES |
Georgian (Georgia) | ka-GE |
German (Austria) | de-AT |
German (Germany) | de-DE |
German (Switzerland) | de-CH |
Greek (Greece) | el-GR |
Gujarati (India) | gu-IN |
Hebrew (Israel) | he-IL |
Hindi (India) | hi-IN |
Hungarian (Hungary) | hu-HU |
Icelandic (Iceland) | is-IS |
Indonesian (Indonesia) | id-ID |
Irish (Ireland) | ga-IE |
Italian (Italy) | it-IT |
Italian (Switzerland) | it-CH |
Japanese (Japan) | ja-JP |
Javanese (Indonesia) | jv-ID |
Kannada (India) | kn-IN |
Kazakh (Kazakhstan) | kk-KZ |
Khmer (Cambodia) | km-KH |
Korean (South Korea) | ko-KR |
Lao (Laos) | lo-LA |
Latvian (Latvia) | lv-LV |
Lithuanian (Lithuania) | lt-LT |
Macedonian (North Macedonia) | mk-MK |
Malay (Malaysia) | ms-MY |
Malayalam (India) | ml-IN |
Maltese (Malta) | mt-MT |
Marathi (India) | mr-IN |
Mongolian (Mongolia) | mn-MN |
Nepali (Nepal) | ne-NP |
Norwegian Bokmål (Norway) | no-NO |
Persian (Iran) | fa-IR |
Polish (Poland) |
pl-PL |
Portuguese (Brazil) |
pt-BR |
Portuguese (Portugal) |
pt-PT |
Punjabi (Gurmukhi India) | pa-Guru-IN |
Romanian (Romania) | ro-RO |
Russian (Russia) |
ru-RU |
Serbian (Serbia) | sr-RS |
Sinhala (Sri Lanka) | si-LK |
Slovak (Slovakia) | sk-SK |
Slovenian (Slovenia) | sl-SI |
Spanish (Argentina) |
es-AR |
Spanish (Bolivia) |
es-BO |
Spanish (Chile) |
es-CL |
Spanish (Colombia) |
es-CO |
Spanish (Costa Rica) |
es-CR |
Spanish (Cuba) |
es-CU |
Spanish (Dominican Republic) |
es-DO |
Spanish (Ecuador) |
es-EC |
Spanish (El Salvador) | es-SV |
Spanish (Equatorial Guinea) | es-GQ |
Spanish (Guatemala) | es-GT |
Spanish (Honduras) |
es-HN |
Spanish (Mexico) |
es-MX |
Spanish (Nicaragua) |
es-NI |
Spanish (Panama) |
es-PA |
Spanish (Paraguay) | es-PY |
Spanish (Peru) |
es-PE |
Spanish (Puerto Rico) |
es-PR |
Spanish (Spain) | es-ES |
Spanish (USA) |
es-US |
Spanish (Uruguay) |
es-UY |
Spanish (Venezuela) |
es-VE |
Sundanese (Indonesia) | su-ID |
Swahili (Kenya) | sw-KE |
Swahili (Tanzania) | sw-TZ |
Swedish (Sweden) | sv-SE |
Tamil (India) | ta-IN |
Tamil (Malaysia) | ta-MY |
Tamil (Singapore) | ta-SG |
Tamil (Sri Lanka) | ta-LK |
Telugu (India) | te-IN |
Thai (Thailand) | th-TH |
Turkish (Turkey) | tr-TR |
Ukrainian (Ukraine) | uk-UA |
Urdu (India) | ur-IN |
Urdu (Pakistan) | ur-PK |
Uzbek (Uzbekistan) | uz-UZ |
Vietnamese (Vietnam) |
vi-VN |
Zulu (South Africa) | zu-ZA |
Enhance recognition for specific words or expressions
Depending on the API you use for Speech recognition, you might be offered the opportunity to define key phrases or hints. Keyphrases are used to match captured speech.
If the full captured text contains one of the specified phrases, that phrase will be highlighted on the outcome of your Speech recognition action. Each key phrase can contain up to 5
words and the number of keyphrases is unlimited.
Audio streaming
Audio streaming is a feature of our Calls API platform which allows duplicating (forking) the audio of a call towards an external service of your choice using websocket as the transport protocol.