Labelling Rules
Labelling Rules
1. We are looking for a quality inspection supplier this time, and you need to conduct quality
inspection on the audio that has been marked;
2. The quality inspection standards are completely consistent with the labeling specifications
written in this document. Quality control colleagues need to listen to the entire audio in its
entirety. If there are individual transcription spelling/sentence segmentation errors, they need
help to correct them. If it is obvious that the annotator has misunderstood the rules, return the
entire article.
3. The annotation personnel who have been returned with data need to make modifications. If
there are unclear rules and inconsistent standards, contact the responsible personnel to organize
records and send them to Aopeng for evaluation and confirmation.
4. As the quality inspection standards are consistent with the annotation specifications written in
this document, the quality inspection supplier also needs to annotate the original audio during
the trial phase, and determine whether the quality inspection supplier has the ability to execute
the project based on the trial results.
Firstly, read the content of the 'Annotation Standards', and then click on the' Key Questions and
Answers' online form to carefully review it
Online forms are very important!!!!
[Labeling Standards]
Before annotating, the annotator needs to assess the audio quality:
Confirm whether it is the data of a two person conversation in the corresponding language. If
not, directly determine that the entire audio is invalid
No need to label (if a whole audio segment is not a two person conversation, the data is invalid!)
Confirm the background sound of the audio, and any data with BGM laughter or applause will
directly determine that the entire audio is invalid
No need to label
Accent, accent data does not need to be annotated
1. For English data - if the speaker has a non European or American accent (which can be
understood as invalid data as long as the human ear judges that it is not a regular English
pronunciation, such as Indian English), the entire audio is directly judged as invalid
2. For Chinese data - if the speaker has a distinct regional accent, the entire audio will be deemed
invalid
be careful!! Regardless of whether the template is selected as valid or invalid, the "Valid Data"
button on the left side at the bottom of the page should never be moved! Click directly to
confirm completion and load the next one“
Thi
s
con
ten
t is
for
int
ern
al
use
onl
y
be careful!! If the entire audio is invalid, select "Invalid" and click Ctrl+Delete to clear all audio
segments. Select "none" for the line attribute to submit! (If delete cannot be restored, use with
caution)
Text pre transcription is configured in Appen task, and the model is used to automatically
recognize and distinguish the speaker's role.
However, the tool recognition is not entirely accurate. The task you need to do is to manually
move and modify the boxes and text marked incorrectly by the tool, and correct the machine's
cutting errors
be careful!! Do not let the pre recognition results interfere with manual judgment. Whether it's
the recognition of characters or the completeness and accuracy of each transcribed text
sentence. Based on human hearing and judgment, manual annotation shall prevail. (Note that it
is not based on pre identification)
1. Handling of front and rear silent segments: If the front and rear silent segments exceed 500ms,
they will be marked; No labeling is required within 500ms# none
2. Use tag # none for the middle silent section
In the original audio, if the pronunciation of the first and last sentences is cut off by the machine,
the incomplete sentences need to be framed as a whole, and the invalid data at the beginning
and end should be cut off when selecting #. Until the complete sentence appears, start selecting
the corresponding character and transcribing it
Other instructions
1. Clicking on the "little robot" in the text box will link to pre transcribed text, saving time and
improving efficiency
2. Platform function keys, click on the "?" in the upper right corner to view the function guide
2. When selecting speaker 1 and speaker 2, the transcription content cannot be empty
3. The effective fragment cannot exceed 10 seconds
4. Required attributes
5. The first 'none' label in the left column is the platform default and cannot be deleted or
changed. At the same time, due to the platform's configuration of the "tag must be selected"
feature limitation, submissions cannot be made without selecting tags, which means that each
line must have two tags to submit for acceptance.
Thi
s
con
ten
t is
for
int
ern
al
use
onl
y
Thi
s
con
ten
t is
for
int
ern
al
use
onl
y