0% found this document useful (0 votes)

167 views

A Brief History of Formats

The document discusses various data formats that can be used in SEGY files, including EBCDIC, ASCII, 16-bit integers, 32-bit integers, IBM floating point, and IEEE floating point. It also describes the structure of a standard SEGY file, which includes an EBCDIC header, binary header, trace header, and data samples for each trace. The binary header contains key parameters like sample interval, number of samples, and format code. Together, the headers and data samples allow seismic trace data to be exchanged in a standardized format.

Uploaded by

jifarina

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

167 views

A Brief History of Formats

Uploaded by

jifarina

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

A brief history of formats

Data can be stored in a computer in various formats. The SEGY

format itself can contain
several formats within the same file depending on what data is being
represented.
A reminder;
There are 8 bits in a byte regardless of the computer.
1 Kilobyte is 1024 bytes or 2**10.
1 Megabyte is 1,048,576 bytes, or 1024 kilobytes or 2**20.
1 Gigabyte is 1,073,741,824 bytes, or 1024 megabytes, or 2**30.
The formats most likely to be encountered are;
EBCDIC Stands for "Extended Binary Coded Digital Interchange
Character". A format for representing text data that was used at the
time the SEGY standard was developed. Has been largely supplanted
by ASCII (see below). Text data in SEGY files is still written in EBCDIC
for reasons of backward compatibility although some PC based
systems use ASCII.
ASCII Stands for "American Standard Code for Information
Interchange". This is the standard format for text information in all
North American and European computers. ASCII uses 7 bits to
represent all the letters of the alphabet, the numbers 0-9, and all
special characters and punctuation. The 8th bit is a sign bit. With 7
bits ASCII can represent only 128 characters which is enough for
English and most Western European languages but is inadequate for
Asian languages with much larger alphabets. ASCII is now being
replaced by 32 bit character sets which can represent all the
characters in all the alphabets in the world. It is unlikely however that
we will see these character sets in SEGY for some time to come.
16 bit integer or short integer. A 16 bit (2 byte ) integer. Short
integers are now largely obsolete. Modern computers deal with
numbers 32 bits (or more) at a time. In fact modern computers take
longer to deal with 16 bit numbers than with 32 bit numbers. A 16 bit
integer can represent a range of values from -32767 to +32767,
(2**15). The 16th bit is the sign bit. Obviously a 16 bit integer

cannot represent a UTM coordinate or a trace number in a large 3D

data volume. It was used primarily to save space and because the
most powerful computers in existence at the time worked with data in
16 bit chunks.
32 bit or long integer. The format used for integers in modern
computers. This format can represent 2**31 or +-2,147,483,648. 32
bit integers have been used since the inception of the SEGY standard
for large values such as UTM coordinates.
IBM floating point.. A 32 bit (4 byte) floating point format. This was
the standard floating point format at the time the SEGY standard was
set down. It is not used in any modern computers, (even those made
by IBM). Most SEGY data is still written in IBM float however and any
program dealing with SEGY must be able to make the conversion.
IEEE floating point. A 32 bit (4 byte) floating point format. The
modern standard for floating point values. Used internally by most
computers. Some SEGY data, particularly that which is written for a
PC based system contains IEEE floating point data.
A further consideration is byte order. Personal computers user "little
endian" or "low order byte first". Sun and other workstations use "big
endian" or "high order byte first".
To understand the difference consider the number "1" written in
binary as a 16 bit (2 byte) integer. In low byte order it would appear
as
00000001 00000000
In high byte order it would appear as
00000000 00000001
Obviously reading a low order byte number as high order would result
in an error. Instead of "1" the number would be interpreted as "256".
Most SEGY data has been written with the high order byte first but
this is changing as PCs are used more and more. Any program
dealing with SEGY data should be able to work with either byte order.

Segy format overview

The SEGY format has been adapted by the SEG as a standard for
trace sequential seismic data. The SEGY format is widely supported
and is in fact used almost exclusively used for the exchange of
seismic data. All geophysical interpretation workstations read SEGY
and some even use SEGY as their internal format.
With a standard so widely used there are of course, millions of tapes
and disk files in existence containing SEGY data.
SegyTool does not read SEGY data from tape so the procedures for
reading SEGY data from tape will not be covered here. The essential
layout of a SEGY data set is the same whether on disk or tape.
The SEGY standard is made up of;
1

An EBCDIC format header of exactly 3200 bytes. There is one

EBCDIC header per SEGY file.This header contains text which
(hopefully ) describes the area name, line name, shotpoint range,
recording parameters, and processing history. Not all EBCDIC header
are so informative but most do contain the area name and line name.
Information is usually written in 40 lines of 80 characters each.

Binary header of exactly 400 bytes. There is only one binary header
per file. This header contains the number of samples, sample rate,
and format code. The layout of the binary header is as follows.
o

Bytes 17-18 Sample interval in microseconds for this file.

Bytes 19-20 Sample interval in microseconds as originally

recorded in the field.

Bytes 21-22 Number of samples for this file.

Bytes 23-24 Number of samples as originally recorded in the

field.

Bytes 25-26 Format code.

1 = IBM float

2 = 32 bit integer

3 = 16 bit integer

6 = IEEE float

There are many other data fields in the binary header but the above
represent the critical values for viewing and editing.

Trace header of exactly 240 bytes. There is one trace header per trace. The
header contains information about the trace such as shotpoint number, CDP,
and survey locations. The number of samples and sample rate for each
trace are also written in the header.

Data samples. Each trace consists of a trace header followed by n data

samples where n is the number of samples per trace as defined in the trace
header. Note that most programs that read SEGY disk files, including
SegyTool, set the number of samples by the value in the binary header and
assume a consistent number of bytes for each trace. The number of bytes
per sample is dependant upon the format of the data samples. Floating
point and 32 bit itegers use 4 bytes per sample, 16 bit integer uses 2 bytes,
8 bit only one.The most common sample formats are IBM float and 16 bit
integer, although SeisX uses IEEE float instead of IBM. This is for
performance reasons as IEEE is the native floating point format for the

computers SeisX is run on. 32 bit and 8 bit integer samples are rarely, if
ever, seen. Note that the sample format has nothing to do with the format
of trace header values such as shotpoints or XY coordinates.
3 and 4 above are repeated for each trace in the file.
The number of samples multiplied by the sample rate in milliseconds yields
the record length. The number of bytes per trace can be computed from the
number of samples multiplied by the bytes per sample plus 240 bytes for
the trace header. The overall size of the file will be exactly the number of
bytes per trace times the number of traces plus 400 bytes for the binary
header and 3200 bytes for the EBCDIC header.

Akai Disk & File Formats PDF
100% (1)
Akai Disk & File Formats PDF
9 pages
Igcse Computer Science Revision Mind Map - Data Reperssenting
No ratings yet
Igcse Computer Science Revision Mind Map - Data Reperssenting
1 page
Assembly Programming:Simple, Short, And Straightforward Way Of Learning Assembly Language
From Everand
Assembly Programming:Simple, Short, And Straightforward Way Of Learning Assembly Language
Sherwyn Allibang
5/5 (2)
ZGY Utility
0% (1)
ZGY Utility
9 pages
SEGY To ASCII C++
0% (1)
SEGY To ASCII C++
10 pages
Segyoutput
No ratings yet
Segyoutput
3 pages
IFT211(LECTURE NOTE)
No ratings yet
IFT211(LECTURE NOTE)
39 pages
A Level Extra CS Notes (CH-1,3,4,8)
No ratings yet
A Level Extra CS Notes (CH-1,3,4,8)
17 pages
KINGDOM Glossary: Numerics
No ratings yet
KINGDOM Glossary: Numerics
39 pages
These Notes Are Designed To Provide An Introductory-Level Knowledge Appropriate To Understanding The Basics of Digital Data Formats
No ratings yet
These Notes Are Designed To Provide An Introductory-Level Knowledge Appropriate To Understanding The Basics of Digital Data Formats
5 pages
1018-010007C - SEG Y Format
No ratings yet
1018-010007C - SEG Y Format
27 pages
All Help
No ratings yet
All Help
22 pages
Seg 2
No ratings yet
Seg 2
20 pages
Chapter 1 Revision
No ratings yet
Chapter 1 Revision
50 pages
00_Guide SegyMAT
No ratings yet
00_Guide SegyMAT
41 pages
Data Representation
No ratings yet
Data Representation
5 pages
Bilgisayar Mimarisi 2 - sayı sistemleri ve dönüşümler
No ratings yet
Bilgisayar Mimarisi 2 - sayı sistemleri ve dönüşümler
30 pages
SEG Y Format PDF
No ratings yet
SEG Y Format PDF
25 pages
Scorpion System Four: SEG Y Format
No ratings yet
Scorpion System Four: SEG Y Format
25 pages
Assgn3-Endian Format
No ratings yet
Assgn3-Endian Format
3 pages
地震数据体格式和位数说明
No ratings yet
地震数据体格式和位数说明
24 pages
A SEG-Y File Toolbox For Matlab
No ratings yet
A SEG-Y File Toolbox For Matlab
14 pages
chapter1_computerscience_IGCSE
No ratings yet
chapter1_computerscience_IGCSE
3 pages
Dictionary of Computing
From Everand
Dictionary of Computing
Handz Valentin, Sr
No ratings yet
Master System Architecture: Architecture of Consoles: A Practical Analysis, #15
From Everand
Master System Architecture: Architecture of Consoles: A Practical Analysis, #15
Rodrigo Copetti
2/5 (1)
Chapter 1 Data Representation
No ratings yet
Chapter 1 Data Representation
1 page
intriguing notes by adam
No ratings yet
intriguing notes by adam
40 pages
04 Segy Format
100% (1)
04 Segy Format
7 pages
Digital Engineering: Complex System Design
From Everand
Digital Engineering: Complex System Design
S Mathioudakis
No ratings yet
2 Dload From SMT
No ratings yet
2 Dload From SMT
3 pages
Week05 Lecture
No ratings yet
Week05 Lecture
5 pages
Data Representation
No ratings yet
Data Representation
21 pages
Complete Notes
No ratings yet
Complete Notes
96 pages
Data and Program Representation
No ratings yet
Data and Program Representation
7 pages
1.1.3 data storage
No ratings yet
1.1.3 data storage
3 pages
Definitions
No ratings yet
Definitions
6 pages
Csi 03
No ratings yet
Csi 03
54 pages
IGCSE CS - Paper 1
No ratings yet
IGCSE CS - Paper 1
14 pages
Text Representation
No ratings yet
Text Representation
4 pages
SEGD 2.1 File Format Rev. 1.2
100% (1)
SEGD 2.1 File Format Rev. 1.2
24 pages
SREC (file format) - Wikipedia
No ratings yet
SREC (file format) - Wikipedia
7 pages
OL Revision Guide 2023 by MAK
No ratings yet
OL Revision Guide 2023 by MAK
49 pages
Fundamentals: OS ROM Bios CPU
No ratings yet
Fundamentals: OS ROM Bios CPU
3 pages
Tsird2.1 Draft1 Markup
No ratings yet
Tsird2.1 Draft1 Markup
3 pages
SREC (File Format) : Motorola S-Record Is A File Format, Created by Motorola, That
No ratings yet
SREC (File Format) : Motorola S-Record Is A File Format, Created by Motorola, That
7 pages
Geological Survey of Canada Open File XXXX
No ratings yet
Geological Survey of Canada Open File XXXX
31 pages
Chapter 1
No ratings yet
Chapter 1
7 pages
DEC's VAX by Michael Collins 2002 SPR
No ratings yet
DEC's VAX by Michael Collins 2002 SPR
13 pages
Disecting Wave File Hex Editor
No ratings yet
Disecting Wave File Hex Editor
14 pages
IGCSE Computer Studies Notes On Data Storage (Zafar Ali Khan)
No ratings yet
IGCSE Computer Studies Notes On Data Storage (Zafar Ali Khan)
14 pages
Practical Reverse Engineering: x86, x64, ARM, Windows Kernel, Reversing Tools, and Obfuscation
From Everand
Practical Reverse Engineering: x86, x64, ARM, Windows Kernel, Reversing Tools, and Obfuscation
Bruce Dang
No ratings yet
Coding Systems For Text-Based Data: Ascii and Ebcdic
No ratings yet
Coding Systems For Text-Based Data: Ascii and Ebcdic
3 pages
Computer Science IGCSE
No ratings yet
Computer Science IGCSE
15 pages
IT NOTES UNIT 1-SECRET NOTES
No ratings yet
IT NOTES UNIT 1-SECRET NOTES
29 pages
Chapter 6: Memory and Data Storage Aims and Objectives:: - Midi - MP3 - MP4
No ratings yet
Chapter 6: Memory and Data Storage Aims and Objectives:: - Midi - MP3 - MP4
14 pages
5 04092023191225 C NumberSystem Lecture Part 2
No ratings yet
5 04092023191225 C NumberSystem Lecture Part 2
9 pages
Cambridge IGCSE Computer Science Notes P
No ratings yet
Cambridge IGCSE Computer Science Notes P
5 pages
Digital Field Tape Format Standards - SEG-D
No ratings yet
Digital Field Tape Format Standards - SEG-D
36 pages
Seg D Rev0
No ratings yet
Seg D Rev0
36 pages
zArchitecture
No ratings yet
zArchitecture
5 pages
Install OpenProject With DEB - RPM Packages
No ratings yet
Install OpenProject With DEB - RPM Packages
26 pages
Chapter 1 Introduction
No ratings yet
Chapter 1 Introduction
1 page
Chapter 4 - MP DSB
No ratings yet
Chapter 4 - MP DSB
42 pages
CE505-N Computer Networks
No ratings yet
CE505-N Computer Networks
4 pages
Excel To Access Using Vb6
No ratings yet
Excel To Access Using Vb6
2 pages
EcoStruxure Panel Server - PAS800L
No ratings yet
EcoStruxure Panel Server - PAS800L
3 pages
Chapter 1 V6.1
No ratings yet
Chapter 1 V6.1
75 pages
Css Ppt - Grade 11 (Ict)
No ratings yet
Css Ppt - Grade 11 (Ict)
55 pages
Release Notes Enterprise Home Screen 4.2.40
No ratings yet
Release Notes Enterprise Home Screen 4.2.40
2 pages
Configurebdffile
No ratings yet
Configurebdffile
2 pages
11th-Computer-Science-2nd-Revision-Test-2025-Question-Paper-Download
No ratings yet
11th-Computer-Science-2nd-Revision-Test-2025-Question-Paper-Download
2 pages
Installation of Shrutlekhan Rajbhasha
No ratings yet
Installation of Shrutlekhan Rajbhasha
50 pages
Mock Exam 621
No ratings yet
Mock Exam 621
3 pages
How Active Directory Replication Topology Works
No ratings yet
How Active Directory Replication Topology Works
51 pages
Need For Speed Underground Setup Log
No ratings yet
Need For Speed Underground Setup Log
53 pages
PDC 5 - Networks of Workstations (Distributed Memory)
No ratings yet
PDC 5 - Networks of Workstations (Distributed Memory)
19 pages
5c AVR Timer (16 Bit)
No ratings yet
5c AVR Timer (16 Bit)
78 pages
Empirix Case Study
No ratings yet
Empirix Case Study
2 pages
Message Queues (ActiveMQs and Kafka)
No ratings yet
Message Queues (ActiveMQs and Kafka)
7 pages
Cloud Native Journey v01 160218234048 PDF
No ratings yet
Cloud Native Journey v01 160218234048 PDF
35 pages
Fortios Processors
No ratings yet
Fortios Processors
238 pages
Getac Software Development Kit Spec R17 20130314
No ratings yet
Getac Software Development Kit Spec R17 20130314
79 pages
Exp# 6 Ethernet CSMA/CD Protocol: CS2307 - Network Lab Simulator Programs
No ratings yet
Exp# 6 Ethernet CSMA/CD Protocol: CS2307 - Network Lab Simulator Programs
4 pages
GetSusp 3 X Product Guide
No ratings yet
GetSusp 3 X Product Guide
16 pages
Startup and Shut Down
No ratings yet
Startup and Shut Down
18 pages
CSE 204 - Module 4 ppt1
No ratings yet
CSE 204 - Module 4 ppt1
90 pages
Quick Start Guide of 1080P AHD DVR-A3-420×285mm-V1.1-20160324
No ratings yet
Quick Start Guide of 1080P AHD DVR-A3-420×285mm-V1.1-20160324
2 pages
Chameleon Chips Seminar Report 09
No ratings yet
Chameleon Chips Seminar Report 09
39 pages
Perforce Usage
No ratings yet
Perforce Usage
31 pages
Copia de ACE - Workbook - v2.0
No ratings yet
Copia de ACE - Workbook - v2.0
81 pages

A Brief History of Formats

Uploaded by

A Brief History of Formats

Uploaded by

A brief history of formats

Data can be stored in a computer in various formats. The SEGY

cannot represent a UTM coordinate or a trace number in a large 3D

Segy format overview

An EBCDIC format header of exactly 3200 bytes. There is one

Bytes 17-18 Sample interval in microseconds for this file.

Bytes 19-20 Sample interval in microseconds as originally

Bytes 21-22 Number of samples for this file.

Bytes 23-24 Number of samples as originally recorded in the

Bytes 25-26 Format code.

Data samples. Each trace consists of a trace header followed by n data

You might also like