T Rec G.108.1 200005 I!!pdf e
T Rec G.108.1 200005 I!!pdf e
ITU-T G.108.1
TELECOMMUNICATION (05/2000)
STANDARDIZATION SECTOR
OF ITU
Summary
This ITU-T Recommendation provides guidance for transmission planners on how to evaluate those
effects impacting end-to-end speech transmission performance which are not covered by the
E-model (ITU-T Recommendation G.107 [2] – The E-model, a computational model for use in
transmission planning) and its associated Planning Guide (ITU-T Recommendation G.108 [3] –
Application of the E-model – A planning guide). Procedures for informal subjective and objective
evaluations that can be used to complement the E-model are provided here.
Source
ITU-T Recommendation G.108.1 was prepared by ITU-T Study Group 12 (1997-2000) and
approved under the WTSC Resolution 1 procedure on 18 May 2000.
Keywords
Conversational impacts, E-model, speech transmission quality, voice quality.
NOTE
In this Recommendation, the expression "Administration" is used for conciseness to indicate both a
telecommunication administration and a recognized operating agency.
ã ITU 2001
All rights reserved. No part of this publication may be reproduced or utilized in any form or by any means,
electronic or mechanical, including photocopying and microfilm, without permission in writing from the ITU.
1 Scope
This ITU-T Recommendation is intended to provide guidance on the conversational impairments
which are not covered by the E-model and which are thus not included in ITU-T Recommendation
G.108 [3]. Those impairments have been investigated thoroughly by ITU-T during the study period
1997-2000.
2 References
The following ITU-T Recommendations and other references contain provisions which, through
reference in this text, constitute provisions of this Recommendation. At the time of publication, the
editions indicated were valid. All Recommendations and other references are subject to revision; all
users of this Recommendation are therefore encouraged to investigate the possibility of applying the
most recent edition of the Recommendations and other references listed below. A list of the currently
valid ITU-T Recommendations is regularly published.
[1] ITU-T Recommendation G.100 (1993), Definitions used in Recommendations on general
characteristics of international telephone connections and circuits.
[2] ITU-T Recommendation G.107 (2000), The E-model, a computational model for use in
transmission planning.
[3] ITU-T Recommendation G.108 (1999), Application of the E-model: a planning guide.
[4] ITU-T Recommendation G.167 (1993), Acoustic echo controllers.
[5] ITU-T Recommendation G.168 (2000), Digital network echo cancellers.
[6] ITU-T Recommendation P.50 (1999), Artificial voices.
[7] ITU-T Recommendation P.57 (1996), Artificial ears.
[8] ITU-T Recommendation P.58 (1996), Head and torso simulator for telephonometry.
[9] ITU-T Recommendation P.59 (1993), Artificial conversational speech.
[10] ITU-T Recommendation P.64 (1999), Determination of sensitivity/frequency characteristics
of local telephone systems.
[11] ITU-T Recommendation P.340 (2000), Transmission characteristics and speech quality
parameters of hands-free terminals.
[12] ITU-T Recommendation P.501 (2000), Test signals for use in telephonometry.
[13] ITU-T Recommendation P.502 (2000), Objective test methods for speech communication
systems using complex test signals.
[14] ITU-T Recommendation P.581 (2000), Use of head and torso simulator (HATS) for hands-
free terminal testing.
[15] ITU-T Recommendation P.800 (1996), Methods for subjective determination of
transmission quality.
[16] ITU-T Recommendation P.831 (1998), Subjective performance evaluation of network echo
cancellers.
3.1 Abbreviations
This ITU-T Recommendation uses the following abbreviations:
DCME Digital Circuit Multiplication Equipment
ECD Echo Cancelling Device
HATS Head and Torso Simulator
HEC Half Echo Canceller
ITU International Telecommunication Union
ITU-T International Telecommunication Union − Telecommunication Standardization Sector
(former CCITT)
NLP Non-Linear Processor
RCV Receive
SCT Short Conversational Test
SND Send
TCLw Terminal Coupling Loss weighted
VAD Voice Activity Detection
3.2 Definitions
This ITU-T Recommendation defines the following term:
The term "Lab Conditions" is used in this ITU-T Recommendation in order to describe a testing
environment which is well controllable and which allows for speech quality testing under defined
and reproducible conditions.
4 General considerations
When evaluating the end-to-end speech transmission quality, it may be found that networks and
terminals impact the speech quality of a telephone connection quite significantly: coding and
decoding processes, introduction of additional delay, packetization and signal processing techniques
as implemented, e.g. in echo cancelling devices or DCME are mainly deployed in the network
domain but can increasingly be found in terminal devices as well. The frequency response and
loudness ratings of a connection are mainly determined by the terminals, the background noise and
the background noise transmission are highly influenced by the terminal and the acoustical
environment the terminal is exposed to. The conversational properties which are the most important
ones in a conversation are determined by the terminal in combination with the network: double talk
capability, switching characteristics and delay are dominant impairments which are often introduced.
Realistic Conditions
"Human Factor" End-to-end speech
transmission quality
Conversational Tests
2 subjects involved
(one at the near end of
the telephone connection, Difficulty in
the other at the far end) communicating
Sound quality
Comparison of individual
parameters under defined
conditions
Listening-only
Tests Classification of
2 artificial heads used disturbances
(one at the near end of
Measurement Conditions the telephone connection,
(exactly defined and the other at the far end)
Database for
identical for each test 1 subject as "observer"
further tests
and reproducible)
T1212320-00
NOTE – The assignment of "near end" and "far end" is chosen according to the E-model (ITU-T Recommendation G.107).
Realistic Conditions
"Human Factor" End-to-end speech
transmission quality
Conversational Tests
2 subjects involved
(one at the near end of
the telephone connection, Difficulty in
the other at the far end) communicating
Sound quality
Comparison of individual
parameters under defined
conditions
Listening-only
Tests Classification of
2 artificial heads used disturbances
(one at the near end of
Measurement Conditions the telephone connection,
(exactly defined and the other at the far end)
Database for
identical for each tests 1 subject as "observer"
further tests
and reproducible)
T1212330-00
NOTE – The assignment of "near end" and "far end" is chosen according to the E-model (ITU-T Recommendation G.107).
Realistic Conditions
"Human Factor" End-to-end speech
transmission quality
Conversational Tests
2 subjects involved
(one at the near end of
the telephone connection, Difficulty in
the other at the far end) communicating
Sound quality
Comparison of individual
parameters under defined
conditions
Listening-only
Tests Classification of
2 artificial heads used disturbances
(one at the near end of
Measurement Conditions the telephone connection,
(exactly defined and the other at the far end)
Database for
identical for each tests 1 subject as "observer"
further tests
and reproducible)
T1212340-00
NOTE – The assignment of "near end" and "far end" is chosen according to the E-model (ITU-T Recommendation G.107).
These parameter correlations have been identified in tests with hands-free terminals and network
echo cancelling devices. It should be noted, that these parameters are of general nature, the type of
device which introduces such a degradation into a telephone connection is of minor importance for
the subject exposed to this degradation
greeting
inquiry
question
precision
offer
order
questions
information treating of order
farewell
T1212350-00
Due to their fixed structure, the dialogues during SCTs may be significantly shorter than during
more traditional types of conversational tests; while offering this advantage SCTs are still
maintaining the various dialogue sections required for concise conversational tests. Various possible
situations (e.g. inquiries with railway information, travel agency, pizza service, theatre reservation,
medical appointment, flight schedule, car rental, etc.) increase the probability that the conversations
carried out remain interesting for the test subjects, and that different type of vocabulary is used.
Pictograms, tables, etc., can be used as formal and quasi-standardized means of test instruction. As
none of the subjects involved in a conversational test has a priori knowledge which information the
other subject is requesting, they cannot shorten the conversation after some dialogues (as it happens
when the same scenario is being repeated several times).
These types of tests have been used by several Administrations with success, e.g. for the subjective
evaluations of hands-free terminals and echo cancellers and are for further study.
One Administration found a very useful multi-criteria approach which had been employed to collect
the opinion of the subjects involved in the test; this approach follows the suggestions in ITU-T
Recommendation P.831 [16]. The following two types of questionnaire had been prepared for
conducting this test:
• a first type of questionnaire for each of the two subjects involved in the conversation under
test;
• a second type of questionnaire for the observer who has the task to listen to the very same
conversation under test as a third party.
Series E Overall network operation, telephone service, service operation and human factors
Series F Non-telephone telecommunication services
Series M TMN and network maintenance: international transmission systems, telephone circuits,
telegraphy, facsimile and leased circuits
Geneva, 2001