Classifying verbatim transcripts into levels 1 and 2: tutorial

Classifying open-ended verbatim transcripts (levels 1 and 2) with smartinterview

Once the data has been collected, whether via SmartInterview or an external file, the next step is to transform the open-ended responses into usable thematic codes. Smartinterview allows you to:

Define a coding plan (1 or 2 levels of depth) (3 coming soon)
Automatically generate themes using AI, with customized instructions
Precisely control the number of codes per respondent using a rules system
Import a training set to guide classification
Pre-classify a sample and correct the results
Run the full classification on all responses
Evaluate code quality with a MECE (mutually exclusive, commonly exhaustive) correlation matrix
Export results to Excel
Analyze results on the dashboard

This article explains, step by step, how to perform a 1 or 2-level classification in the platform.

1. Choose the data source

The codification accepts two sources:

Source	Usage	When to use it
SmartInterview Survey	Select an existing survey, then an open-ended question	You collected responses via SmartInterview
Excel file	Import a file containing the transcripts	You have data from an external tool

Excel file:

Survey:

File import: column selection

When importing a file, you must tell the system:

The respondent column (unique identifier for each respondent)
- If your file does not contain an identifier, choose “No column”: the system will automatically number the respondents from 1 to N
The column of responses to be classified (the verbatim comments)

_Tip: The system automatically detects common columns (Respondent_ID, Serial, Responses, Answer, etc.) from the file headers._

[Screenshot: selecting columns in the configuration]

2. Choose the classification depth

Depth 1 level (L1 only)

A flat list of main themes. Each answer is associated with one or more themes.

Use case: exploratory studies, initial rapid analysis, short verbatim transcripts.

Depth 2 levels (L1 + L2)

Main themes (L1) with attached sub-themes (L2). The structure is hierarchical: each sub-theme belongs to a single parent theme.

Use case: in-depth studies requiring fine granularity, distinction of nuances within the same theme, coding conforming to market research standards.

In this example:

Ease of use → L1 (main theme)
Lines with an ID → L2 (sub-themes)

3. Define the coding plan

You also have two ways to create your code plan:

A - Import an Excel codeframe (as in the example)
B - Let the AI generate the themes.

Option A: Importer les codes via Excel

If you already have a code-frame, import it directly.

Format for 1 level

A file with at least one column containing the theme labels:

ID	Label
1	Interface is intuitive
2	Ease of use
3	Performance is fast
4	App crashes
5	Nothing

Format for 2 levels

The file must be structured L1 and L2 hierarchically. The system automatically detects the ID and Label columns from the headers. Option 1: Separate level columns (in an Excel sheet):

L1	L2
Ease of use	Interface is intuitive
Ease of use	Navigation is confusing
Ease of use	Easy to complete tasks
Ease of use	Sensation is smooth
Ease of use	The shape is nice

Option 2: With identifiers and Parent_ID:

💡 Tip: You must store the topics in a separate sheet of your Excel file (e.g., a “Topics” tab). The system will prompt you to select the sheet containing the codes.

Preview and filtering

After the import, a preview of the code frame is displayed with:

The number of topics detected (updated automatically)
The ability to filter by column (useful for excluding certain categories)
The ability to manually exclude individual rows

Option B: Generate the codes using AI

If you do not have a pre-existing code plan, the AI analyzes a sample of your responses and automatically discovers recurring themes.

How it works

The system samples up to 400 responses from your file.
AI identifies recurring themes and formulates them into clear labels.
The themes are sorted by estimated frequency (the indicative number of respondents concerned).
The themes are automatically numbered (sequential IDs).

Provide personalized instructions (guidelines)

You can guide the generation by providing text instructions in the “Guidelines” field:

These instructions directly influence:

The vocabulary used for the labels
The level of granularity (more or fewer themes)
The analytical perspective (sensory, emotional, functional, etc.)
The language of the labels

⚠️ Important: These instructions are in Beta. They work well for guiding the generation process, but results may vary. Always check the generated themes.

Generation in 2-level mode

In 2-level mode, the process involves two steps:

L1 generation: AI identifies the main themes
Automatic L2 generation: For each L1 theme, AI automatically generates sub-themes based on the corresponding responses.

L1s that do not yet have sub-themes are automatically detected, and the system starts generating the missing L2s before starting the classification.

Theme editor

Whether imported or generated, themes appear in the theme editor (left sidebar), where you can:

Action	Comment
Renaming a theme	Click on the label and edit it directly
Delete a theme	Click on the trash can icon 🗑️
Add a theme	Click the + button at the bottom of the list
Reorder the themes	Drag and drop using the handle ≡
Unfold/fold the L2	Click the arrow ▶ next to an L1 theme
Regenerate the themes	Click the ✨ button to restart AI generation
Regenerate the L2 of a parent	Click on ✨ next to a specific L1 theme

💡 The estimated frequencies (indicative number displayed next to each theme) are recalculated after each classification. Before the first classification, they come from the AI’s estimate during generation.

4. Configure the classification rules

The rules control how many codes can be assigned to each respondent. They are applied at three levels: during pre-classification, on imported examples, and during full classification.

Rules for level 1

Setting	Description	Default
Max codes	Maximum number of themes per respondent	0 (unlimited)

Exemple*_ : With Max codes = 3, a respondent can only receive a maximum of 3 themes, even if their answer mentions more.*

Rules for 2 levels

In 2-level mode, three additional parameters allow for fine control:

Setting	Code interne	Description	Default
Max L1	maxCodesL1	Maximum number of main themes per respondent	0 (unlimited)
Max L2	maxCodesL2	Maximum overall number of sub-themes per respondent	0 (unlimited)
Max L2 par L1	maxCodesL2PerL1	Maximum number of subtopics per parent topic	0 (unlimited)

Order of application of the rules:

Max L1: Limits the number of main themes (Pass 1)
Max L2/L1: Limits subtopics by parent (Pass 2, by calls)
Max L2 global: Final ceiling after merging all sub-themes (post-processing)

💡 Tip: The Max L2/L1 ratio is particularly useful when some L1 themes are very broad and might monopolize all the sub-themes. For example, with Max L2/L1 = 2, each parent theme can only contribute a maximum of 2 sub-themes, ensuring a balanced distribution.

5. Import a training set (past data) (optional)

Why import examples?

A training set (or a few-shot examples) allows us to show examples of already coded verbatim transcripts. These examples are sent as context to guide each batch of classification. Importing is recommended when:

The themes are nuanced or closely related.
You want continuity within a project or between several projects.
You have specific coding conventions (e.g., certain expressions must always be classified under a particular theme).
You want to replicate an existing classification on new data.
Pre-classification without examples yields unsatisfactory results.

Training file format

The Excel file should look like this:

Answer	ANSW_1aCOMM1	ANSW_2aCOMM2	ANSW_3aCOMM3	ANSW_4aCOMM4
The interface remains fluid from beginning to end, very close to a premium application.	21
The application is fine, nothing particularly remarkable.	18	207
Very smooth navigation, some pleasant animations	18	207
Sometimes a little choppy, and some sections seem poorly optimized.	45	212	233	240

The system automatically detects columns containing codes by comparing them to the themes defined in your coding plan. Columns whose values correspond to known themes are automatically identified.

⚠️ Limit: 30 examples are kept. The codes must be the same as those used in the initial code plan of the newly imported files.

Verification and translation

Each imported example is displayed with:

The text of the response (verbatim)
The assigned theme badges (color-coded)
An individual translation button with language selection (French, English, German)

The translation allows you to check the content of the examples in your working language, without modifying the data sent to the classification.

6. Pre-classify a sample

What is pre-classification?

Before running the classification on the entire dataset, the system classifies the first 30 responses as a test. This is the most important step to validate the quality of your coding plan. The pre-classification uses the exact same algorithm as the full classification, but on a smaller sample to allow for quick verification.

What the pre-classification shows you

For each answer, you see:

The verbatim text (with keywords corresponding to the themes highlighted)
The assigned L1 badges (with color coding)
The assigned L2 badges (if depth = 2), grouped under their L1 parent badges
A summary: number of classified responses out of the total

Correct the results

The pre-classification is interactive; you can correct each line:

Action	Gesture	Effect
Remove a theme	Click on the badge ×	The topic has been removed from this response.
Add an L1 theme	Click on the + next to the L1 badges	Drop-down menu with all available L1 themes
Add an L2 theme	Click on the + next to the L2 badges	Filtered dropdown menu: only sub-themes of already assigned L1 courses are offered
Search for a theme	Type in the menu search field	Real-time filtering of available themes

💡 Validation area: All lines between your first and last correction are considered validated. They are highlighted in blue and automatically become valid for the complete classification.

7. Launch the full classification

When to start the classification?

Start the full classification when:

The pre-classification themes match your expectations.
Any necessary corrections are made to the first 30 lines.
Training data is imported.
The rules (Max codes) are correctly configured.

What’s happening in the background

The responses are divided into batches.
Each batch is sent to the AI with:
- List of available themes
- Training examples (imported + pre-classification corrections)
- Configured boundary rules
In 2-level mode:
- Pass 1: L1 classification on all batches
- Pass 2: For each assigned L1 theme, L2 classification by parent
- Post-processing: Application of the global L2 ceiling (Max L2)

Result

After classification, you see:

A success banner: “Classification complete: N classified responses.”
The first 30 responses with their assigned codes (editable)
The imported examples (expandable section, if a training set was used)
The correlation matrix (see next section)

8. Evaluate the results using the correlation matrix

The MECE principle

A quality coding plan must be MECE:

Mutually Exclusive: Each theme covers a distinct aspect. Two themes should not describe the same thing.

Collectively Exhaustive: The set of themes covers all responses. No verbatim response should remain without relevant code.

Read the co-occurrence matrix

The matrix displays the percentage of respondents who received two themes simultaneously. The diagonal is always 100% (a theme is always correlated with itself).

	Interface is intuitive (122)	App is fast (6)	Navigation is confusing. (14)	App crashes or freezes (28)
Interface is intuitive (122)	100%	33%	21%	0%
App is fast (6)	33%	100%	0%	0%
Navigation is confusing (14)	21%	0%	100%	0%
App crashes or freezes (28)	0%	0%	0%	100%

How to interpret the matrix

Signal	Value	Signification	Recommended action
🔴 High correlation	50%	The two themes often overlap: possibly redundant.	Merge themes or rephrase definitions
🟠 Average correlation	20-50%	The themes are related but distinct: acceptable	Check a few answers to confirm
🟢 Weak correlation	< 20%	The themes are indeed mutually exclusive.	Nothing to change
⚪ Zero correlation	0%	The themes never coexist	Perfect for antagonistic themes (e.g., “Nothing” vs. the others)
⚠️ Low staff	(1-2)	The topic concerns very few respondents	Perhaps too specific; consider merging with a parent theme or removing it in a single-level encoding.

💡 Highly correlated cells are highlighted in color to quickly identify problems. Exemple d’analyse

In the matrix above:

App is fast × Interface is intuitive = 33% → These two sensations are sometimes mentioned together. This is normal for a product consumed by inhalation: the themes remain distinct.
Nothing × everything else = 0% → Perfect: respondents who have nothing to say are not categorized under other themes.
Interface is intuitive (122) is the dominant theme: 122 out of 232 respondents, or more than half.

Acting on the results

If the matrix reveals problems:

Click “Back to codes” to return to the theme editor.
Merge redundant themes or rewrite ambiguous definitions.
Rerun the classification; the corrections made to the first 30 lines are saved as training examples (“Re-classify with corrections” button).

This iterative cycle, classify → evaluate → adjust → re-classify, allows us to gradually achieve a robust and MECE coding plan.

9. Export the results

Once the classification is validated, click on “Download Excel” to obtain a structured file:

Leaf	Content	Description
FilesQO	Classified data	Each respondent with their text and assigned codes (columns L1 and L2 if applicable)
Topics	Plan de codification	The complete list of themes with their identifiers, organized hierarchically
Top Topics	Frequency synthesis	The most frequent themes with their counts and percentages

Practical advice

How many themes should be defined?

Number of responses	Recommended L1 topics	Recommended L2 topics
< 100	5 – 10	2 – 4 par L1
100 – 500	10 – 20	3 – 6 par L1
500	15 – 30	5 – 10 par L1

When to use 1 level vs 2 levels?

Criteria	Level 1	2 levels
Rapid exploratory objective	✅
Initial data analysis	✅
Fine granulation required		✅
Long and detailed transcripts		✅
Verbatims courts (< 20 mots)	✅

Get Started

Integration

Study design

Codification

Pulse Qualitative

Account & Billing

​Classifying open-ended verbatim transcripts (levels 1 and 2) with smartinterview

​1. Choose the data source

​Excel file:

​Survey:

​File import: column selection

​2. Choose the classification depth

​Depth 1 level (L1 only)

​Depth 2 levels (L1 + L2)

​3. Define the coding plan

​Option A: Importer les codes via Excel

​Format for 1 level

​Format for 2 levels

​Preview and filtering

​Option B: Generate the codes using AI

​How it works

​Provide personalized instructions (guidelines)

​Generation in 2-level mode

​Theme editor

​4. Configure the classification rules

​Rules for level 1

​Rules for 2 levels

​5. Import a training set (past data) (optional)

​Why import examples?

​Training file format

​Verification and translation

​6. Pre-classify a sample

​What is pre-classification?

​What the pre-classification shows you

​Correct the results

​7. Launch the full classification

​When to start the classification?

​What’s happening in the background

​Result

​8. Evaluate the results using the correlation matrix

​The MECE principle

​Read the co-occurrence matrix

​How to interpret the matrix

​Acting on the results

​9. Export the results

​Practical advice

​How many themes should be defined?

​When to use 1 level vs 2 levels?