ITU-T - P.808
Subjective evaluation of speech quality with a crowdsourcing approach
| Organization: | ITU-T |
| Publication Date: | 1 June 2021 |
| Status: | active |
| Page Count: | 36 |
scope:
This Recommendation contains advice to administrations on conducting subjective tests of speech quality with a crowdsourcing approach. It focuses on listening-only tests including the absolute category rating (ACR), degradation category rating (DCR), comparison category rating (CCR), and methods for evaluating the subjective quality of speech in noise. Other rating tasks as well as conversational tests in the crowd are still under study in ITU-T Study Group 12. The methods described here are to be seen as complementary to the recommended methods in [ITU-T P.800] and [ITU-T P.835]; those methods are carried out in a laboratory environment which is better controlled, whereas the crowdsourcing-based methods described here cover a wider range of realistic listening environments and devices and thus their external validity may be higher.
Crowdsourcing-based methods are not expected to replace laboratory testing, as there are fundamental differences between both methods regarding their conception, the participants and their motivation, as well as technical and environmental factors, as detailed in [b-ITU-T Technical]. As a consequence, the results from crowdsourcing-based methods can be expected to deviate to a certain extent from those of laboratory testing. Depending on the target of the evaluation, the appropriate method has to be selected.
Further guidance on the general approach of crowdsourcing-based testing can be found in [b-ITU-T Technical].
Document History