Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.smartinterview.ai/llms.txt

Use this file to discover all available pages before exploring further.

Why a multilingual voice survey?

Voice surveys let respondents speak their answers instead of typing — capturing richer, more spontaneous verbatim, especially in markets where literacy or typing comfort varies. When you run the same study across multiple countries (FR, DE, US, UK, IN, BR, MX, JP, CN…), three challenges emerge:
  1. Respondents must be able to speak in their own language and dialect.
  2. The author needs one survey — not one per market — with translated question text.
  3. The analyst must read all responses in a single working language (typically English) for cross-market analysis.
Smartinterview handles all three. This tutorial walks you through it end-to-end. Moreover, in settings you are able to select the conversation mode with the survey and how the respondent interact with its own voice and the artificial voice.
  1. Auto - AI read question and mic input, by clicking.
  2. Manual - mic input only, no TTS for auto-speak.
  3. Full (conversational) - activate the microphone at the beginning and speak on the survey to trigger the cursor / multi-choice.
  4. Off - no text to speech model.
Capture D’écran 2026 05 05 À 20 40 54

1. Add a voice question

In the Survey Builder, add a new question and set its Response Mode to one of:
Response ModeBehaviorWhen to use
VocalMicrophone with a text fallback (respondent can switch)Default — most flexible
Vocal onlyMicrophone enforced, no typingPure voice studies, accessibility, low-literacy markets
TextText only, no microphoneSensitive contexts, noisy environments
VideoVideo recording with optional transcriptionQualitative, body-language studies
The Vocal modes activate Smartinterview’s transcription pipeline automatically. The respondent sees a microphone button, the transcript streams or appears once they stop, and the final text is saved as the answer.
Capture D’écran 2026 05 05 À 20 29 27

2. Pick the survey’s editing language

In Settings → Editing Language, choose the language you’ll author the survey in (e.g. English). This becomes the canonical version of every question. Translations are layered on top of it later.
💡 The Editing Language is shared across the builder. Switching it from English to Français re-renders the editor in French — but the canonical English text is preserved.
Capture D’écran 2026 05 05 À 20 29 50

3. Set the language per voice question (optional override)

By default, transcription auto-detects the language the respondent speaks. This works well in single-market studies. For multi-market studies, open the Question Language Panel (right-hand side of the builder) on each voice question and pick a specific language/dialect tag from the list:
en-US   English (United States)
en-GB   English (United Kingdom)
fr-FR   Français (France)
de-DE   Deutsch (Deutschland)
es-MX   Español (México)
es-ES   Español (España)
pt-BR   Português (Brasil)
pt-PT   Português (Portugal)
hi-IN   हिन्दी (भारत)
zh-CN   中文 (简体)
zh-TW   中文 (繁體)
ja-JP   日本語 (日本)
ar-SA   العربية (السعودية)

A specified tag is passed to Whisper as a language hint — meaning the model commits to that language instead of detecting it. This is critical when:
  • The respondent’s accent is similar to a neighboring language (Portuguese ↔ Spanish, Hindi ↔ Urdu, etc.)
  • Mixed-language phrases are common (Hinglish, Spanglish)
  • You want consistent, predictable behavior across waves
⚠️ Pick the dialect, not just the language. pt-BR and pt-PT produce noticeably different transcriptions on the same audio. Same for es-MX vs es-ES, and zh-CN vs zh-TW. Confirm the priority dialects with Sid (see action item above).
Capture D’écran 2026 05 05 À 20 30 12

4. Translate question text to each market language

For each market you’ll publish to, translate the question text:
  1. Open the question in the editor.
  2. In the Editing Language dropdown, switch to the target language (e.g. Português).
  3. Either type the translated text directly, or click ✨ Translate to auto-translate from the canonical version.
  4. Repeat for choices, button labels, and rich-text content.
Translations are stored under each question’s uischema.options.translations[lang]. At runtime, the renderer reads the URL locale (/fr/..., /en/..., /pt-BR/...) and serves the matching translation; if none exists, it falls back to the canonical text.
💡 One survey, many URLs. A single survey published at surveys/abc can be opened as /en/surveys/abc, /fr/surveys/abc, /de/surveys/abc, etc. The respondent sees questions in their language.
Capture D’écran 2026 05 05 À 20 33 49

5. Review responses — translate everything to English for multi-market analysis

Once responses are in, head to Dashboard → Analytics → Responses.

The “View Language” toggle

At the top of the responses table, the View Language dropdown lets you display every answer in a single language — typically English for multi-market studies. When you switch to English:
  1. Smartinterview translates all answer texts in batch via gpt-4o-mini (the original audio and original transcript are preserved).
  2. Translations are cached in survey.config.data_translations.en — subsequent loads are instant.
  3. The original-language text remains accessible (toggle back any time).

Why this matters for multi-market

A wave covering FR + DE + BR + IN + JP produces verbatims in 5 languages. Without unified translation:
  • Analysts can’t read 4 out of 5 markets.
  • Cross-market themes are invisible.
  • Quote selection for the readout becomes a per-market chore.
With View Language → English:
  • Every response reads in English in one table.
  • Open-ended answers can be classified together in the Pulse Classifier with a single English code-frame (instead of one per market).
  • Quotes can be lifted directly into the readout, with the original-language verbatim attached for verification.
💡 Always export both. When downloading, the Excel export includes the original transcript and the English translation as separate columns. Keep both — the original is your evidence trail.
Capture D’écran 2026 05 05 À 20 35 18
Capture D’écran 2026 05 05 À 20 35 40


Workflow recap

┌──────────────────────────────┐
│  1. ADD VOICE QUESTIONS                        │
│     Response Mode: Vocal / Vocal only / Video  │
└──────────────────────────────┘

┌──────────────────────────────┐
│  2. SET EDITING LANGUAGE                       │
│     The canonical version of every question    │
└──────────────────────────────┘

┌───────────────────────────────────┐
│  3. SET PER-QUESTION LANGUAGE/DIALECT                  │
│     pt-BR vs pt-PT · es-MX vs es-ES · zh-CN vs zh-TW   │
│     ⚠ Align dialect priorities with Sid                │
└───────────────────────────────────┘

┌───────────────────────────────────┐
│  4. TRANSLATE QUESTION TEXT FOR EACH MARKET            │
│     One survey, many URL locales                       │
└───────────────────────────────────┘

┌─────────────────────────────────────┐
│  6. ANALYZE — VIEW LANGUAGE → ENGLISH                    │
│     Unified analysis · classification · export both       │
└─────────────────────────────────────┘

Reference: supported languages

Smartinterview supports 57 languages for voice transcription and TTS, including: English (US/UK/AU/IN), French (FR/CA), German, Spanish (ES/MX/AR), Portuguese (PT/BR), Italian, Dutch, Arabic (SA/EG), Hindi, Mandarin (CN/TW), Japanese, Korean, Thai, Vietnamese, Indonesian, Malay, Turkish, Polish, Russian, Ukrainian, Greek, Czech, Hungarian, Romanian, Hebrew, Swahili, Bengali, Urdu, Tamil, Telugu, and 30+ others. Best of luck with your multi-market voice study! 🎙️🌍