Ebu STL
Ebu STL
Contents
1. Introduction ....................................................................................................................................................3
2. Medium for exchange.....................................................................................................................................3
3. Operating system and filename ......................................................................................................................3
4. Structure of the datafile ..................................................................................................................................5
4.1. Basic structure of the datafile .................................................................................................................5
4.2. General Subtitle Information (GSI) block ..............................................................................................5
4.3. Text and Timing Information (TTI) block............................................................................................10
5. Character code tables....................................................................................................................................14
Appendix 1 Character code pages used in the GSI block .................................................................................15
Appendix 2 Character code tables used in TTI blocks ........................................................................................19
Appendix 3 Code table for languages used in TTI blocks..................................................................................25
Appendix 4 3-letter codes for use in the CO field ...............................................................................................27
2
EBU - Subtitling data exchange format tech 3264
1. Introduction
The importance of subtitling is increasing in the television and film industries. In many European
countries, subtitling is the most common method of conveying the content of foreign language dialogue to the
audience, and a broadcaster's audience may now include several major linguistic groups (notably a satellite-
broadcaster).
Subtitles are also provided increasingly by broadcasters to meet the needs of the significant numbers of
deaf and hard-of-hearing viewers.
A broadcaster may use two methods of providing subtitles to its audience. Subtitles may be inlaid into
the picture at source. This is known as in-vision or "open" subtitling. Subtitles may also be conveyed to the
audience via encoded data added to the transmitted signal (e.g. teletext), also known as "closed" subtitling.
Considering that:
- a broadcaster may wish to buy or sell programmes complete with subtitles already available in an
appropriate language;
- a broadcaster may use an external company for the supply of some or all its subtitles;
- broadcasters may wish to buy compatible subtitling equipment from a number of independent suppliers;
- broadcasters may wish to harmonize the storage of subtitle data for in-vision and encoded subtitles;
the EBU has standardised a data file format for use with a personal computer to enable the exchange of in-
vision and teletext subtitles. The format is intended for use by broadcasters at both national and international
levels and is described below.
3
tech 3264 EBU - Subtitling data exchange format
4
EBU - Subtitling data exchange format tech 3264
GSI TTI TTI TTI TTI TTI TTI ….. TTI TTI
.
1024 128 128 128 128 128 128 ….. 128 128
The GSI block carries general information such as display standard, language, programme title in
original and local language, etc.
A TTI block normally includes the information necessary to define one subtitle. Exceptionally,
additional TTI blocks, called Extension Blocks, may be used.
The GSI and TTI blocks each include user-definable areas.
1. Where a character code or a decimal value is described in hexadecimal notation, the suffix h is used:
e.g. FFh.
5
tech 3264 EBU - Subtitling data exchange format
6
EBU - Subtitling data exchange format tech 3264
7
tech 3264 EBU - Subtitling data exchange format
8
EBU - Subtitling data exchange format tech 3264
Publisher (PUB)
Name of the publisher of the subtitle list. 32 characters are available.
9
tech 3264 EBU - Subtitling data exchange format
4 1 Cumulative Status CS
13 1 Vertical Position VP
14 1 Justification Code JC
15 1 Comment Flag CF
10
EBU - Subtitling data exchange format tech 3264
TTI TTI TTI TTI TTI TTI TTI TTI TTI TTI TTI TTI TTI TTI TTI
SN: 149 150 151 152 152 152 153 154 155 156 156 157 158 159 160 …
EBN: FFh FFh FFh 00h 01h FFh FFh FFh FFh FFh FFh FFh FFh FFh FFh
TTI TTI TTI TTI TTI TTI TTI TTI TTI TTI TTI TTI TTI TTI TTI
… 880 881 882 883 883 883 883 884 885 886 886 887 888 889 890 …
FFh FFh FFh 00h 01h FFh FFh FFh FFh FFh FFh FFh FFh FFh FFh
There may be one or more intermediate subtitles with CS code 02h (or none at all, if there are two
subtitles in the cumulative set).
11
tech 3264 EBU - Subtitling data exchange format
12
EBU - Subtitling data exchange format tech 3264
If JC is set to 00h, the text and box positioning are determined by all the spaces and control characters
which accompany the subtitle text in the Text Field..
If JC is set to 01h, 02h or 03h, all leading and trailing spaces in the Text Field are ignored. The overall
horizontal alignment of the subtitle display area will be determined by the users' equipment, and the text will be
left or right justified, or centred, within that display area according to the content of the JC code.
13
tech 3264 EBU - Subtitling data exchange format
14
EBU - Subtitling data exchange format tech 3264
Appendix 1
Character code pages used in the GSI block
This Appendix reproduces the five code pages accepted for used in the General Subtitle
Information (GSI) block, in the context of international subtitle exchanges. These are the five code
pages supported by MS-DOS/PC-DOS, version 3.3.
15
tech 3264 EBU - Subtitling data exchange format
16
EBU - Subtitling data exchange format tech 3264
17
tech 3264 EBU - Subtitling data exchange format
18
EBU - Subtitling data exchange format tech 3264
Appendix 2
First byte →
Second
byte 0 1 2 3 4 5 6 7 8 9 A B C D E F
↓ 0 SP 0 @ P ` p NBSP
1 ! 1 A Q a q
2 " 2 B R b r
3 # 3 C S c s
4 ¤ 4 D T d t ×
5 % 5 E U e u
6 & 6 F V f v
7 ' 7 G W g w •
8 ( 8 H X h x ÷
9 ) 9 I Y i y
A * : J Z j z ! " #
B + ; K [ k { $ % & '
C , < L \ l | ← ( ) *
D - = M ] m } ↑ +
E . > N ^ n ~ → ,
F / ? O _ o ↓ - SHY
Notes:
(1) The SPACE character is located on position 20h of the code table
(2) NBSP = no-break space
(3) SHY = soft hyphen
19
tech 3264 EBU - Subtitling data exchange format
First byte →
Second
byte 0 1 2 3 4 5 6 7 8 9 A B C D E F
↓ 0 SP 0 @ P p NBSP
N°
1 ! 1 A Q a q
2 " 2 B R b r
3 # 3 C S c s
4 $ 4 D T d t
5 % 5 E U e u !
7 ' 7 G W g w ( ) * + , -
8 ( 8 H X h x . / 0 1 2 3
9 ) 9 I Y i y 4 5 6 7 8 9
A * : J Z j z 4 : ; < = >
B + ; K [ k { ? @ A B C D
C , < L \ l ¦ E F G H I J
D - = M ] M } SHY K L M N §
E . > N ^ N ~ O P Q R S T
F / ? O _ o U V W X Y Z
Note: For the Ruthenian language, the characters in code positions Ah/5h [\]^_`]ah/5h][!\]^bc
bcde^fc`]gh]i]^_`]jk]bcldcfmnocehp
20
EBU - Subtitling data exchange format .tech 3264
First byte →
Second
byte 0 1 2 3 4 5 6 7 8 9 A B C D E F
↓ 0 SP 0 @ P p NBSP
1 ! 1 A Q a q
2 " 2
B R b r
3 # 3 C S c s
4 $ 4 D T d t
5 % 5 E U e u
6 & 6 F V f v
7 ' 7 G W g w ! "
8 ( 8 # H X h x $ % &
9 ) 9 ' I Y i y ( ) *
A * : J Z j z + , -
B + ; K [ k { . / 0
C , < L \ l ¦ 1 2 3
D - = M ] M } SHY 4 5
E . > N ^ N ~ 6 7
F / ? O _ o 8 9 :
21
tech 3264 . EBU - Subtitling data exchange format
First byte →
Second
byte 0 1 2 3 4 5 6 7 8 9 A B C D E F
↓ 0 SP 0 @ P ` p NBSP
1 ! 1 A Q a q ±
2 " 2 B R b r ²
3 # 3 C S c s £ ³
4 $ 4 D T d t
5 % 5 E U e u
6 & 6 F V f v ¦
7 ' 7 G W g w § ! "
8 ( 8 H X h x ¨ # $ % & '
9 ) 9 I Y i y © ( ) * + ,
A * : J Z j z - . / 0 1
B + ; K [ k { « » 2 3 4 5
C , < L \ l | ¬ 6 7 8 9 :
E . > N ^ n ~ ? @ A B C
F / ? O _ o - D E F G
22
EBU - Subtitling data exchange format .tech 3264
First byte →
Second
byte 0 1 2 3 4 5 6 7 8 9 A B C D E F
↓ 0 SP 0 @ P p NBSP °
1 ! 1 A Q a q ±
2 " 2 B R b r ¢ ²
3 # 3 C S c s £ ³
4 $ 4 D T d t ¤ ´
5 % 5 E U e u ¥ µ
6 & 6 F V f v ¦ ¶
7 ' 7 G W g w § ·
8 ( 8 H X h x ¨ ¸
9 ) 9 I Y i y © ¹
A * : J Z j z × ÷
B + ; K [ k { « »
C , < L \ l | ¬ ¼
D - = M ] m } SHY ½
E . > N ^ n ~ ® ¾
F / ? O _ o ¯
23
tech 3264 EBU - Subtitling data exchange format
Notes
(1) Presumed at the start of each display row
(2) Action "set at" other are "set after"
(3) Two consecutive codes are sent, actio takes place between them
(4) No action at level 1
(5) Reserved code
24
EBU - Subtitling data exchange format tech 3264
Appendix 3
Code table for languages used in TTI blocks
This Appendix lists the codes used in the Language Code (LC) field of the GSI block. These codes
indicate the language used in the Text Field of the Text and Timing Information (TTI) blocks. The codes are
identical to those used in the systems of the MAC/packet family, and are reproduced from the EBU
specification for these systems [3].
25
tech 3264 EBU - Subtitling data exchange format
Other languages
26
EBU - Subtitling data exchange format tech 3264
Appendix 4
3-letter codes for use in the CO field
The three-letter (•Alpha-3") codes given in the table below are for use in the Country o f Origin (CO)
field of the GSI block. The codes are those given in ISO Standard 3166-1988 [5]. The country names indicated
here are not necessarily the official names of the countries or territories. The information given here does not
imply, on the part of the European Broadcasting Union, any opinion regarding the political status of these
countries or territories.
ISO ISO
"Alpha-3" Country "Alpha-3" Country
code code
27
tech 3264 EBU - Subtitling data exchange format
28
EBU - Subtitling data exchange format tech 3264
ISO ISO
"Alpha-3" Country "Alpha-3" Country
code code
29
tech 3264 EBU - Subtitling data exchange format
Notes (relevant only in the context of the EBU subtitle data exchange format):
1. Codes shown in brackets were formerly attributed to certain countries. The new codes should be used
instead (e.g. UMI should always replace WAK).
2. Codes in the series AAA-AAZ, QMA-QZZ, XAA-XZZ, and ZZA-ZZZ should not be used in the CO field.
3. The escape code OOO should not be used in the CO field.
30
EBU - Subtitling data exchange format tech 3264
Bibliography
Editeur responsable : G.T. Waters, Ancienne Route 17A, CH-1218 Grand-Saconnex/Genève, Suisse.
31