Man Page For Ffmpeg
Man Page For Ffmpeg
txt
FFMPEG(1)
FFMPEG(1)
NAME
ffmpeg - ffmpeg video converter
SYNOPSIS
ffmpeg [global options] [[infile options][-i infile]]... {[outfile
options] outfile}...
DESCRIPTION
ffmpeg is a very fast video and audio converter that can also grab from
a live audio/video source. It can also convert between arbitrary sample
rates and resize video on the fly with a high quality polyphase filter.
ffmpeg reads from an arbitrary number of input "files" (which can be
regular files, pipes, network streams, grabbing devices, etc.),
specified by the "-i" option, and writes to an arbitrary number of
output "files", which are specified by a plain output filename.
Anything found on the command line which cannot be interpreted as an
option is considered to be an output filename.
Each input or output file can in principle contain any number of
streams of different types (video/audio/subtitle/attachment/data).
Allowed number and/or types of streams can be limited by the container
format. Selecting, which streams from which inputs go into output, is
done either automatically or with the "-map" option (see the Stream
selection chapter).
To refer to input files in options, you must use their indices
(0-based). E.g. the first input file is 0, the second is 1 etc.
Similarly, streams within a file are referred to by their indices. E.g.
"2:3" refers to the fourth stream in the third input file. See also the
Stream specifiers chapter.
As a general rule, options are applied to the next specified file.
Therefore, order is important, and you can have the same option on the
command line multiple times. Each occurrence is then applied to the
next input or output file. Exceptions from this rule are the global
options (e.g. verbosity level), which should be specified first.
Do not mix input and output files -- first specify all input files,
then all output files. Also do not mix options which belong to
different files. All options apply ONLY to the next input or output
file and are reset between files.
-1-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
To force the frame rate of the input file (valid for raw formats
only) to 1 fps and the frame rate of the output file to 24 fps:
ffmpeg -r 1 -i input.m2v -r 24 output.avi
The format option may be needed for raw input files.
STREAM SELECTION
By default ffmpeg includes only one stream of each type (video, audio,
subtitle) present in the input files and adds them to each output file.
It picks the "best" of each based upon the following criteria; for
video it is the stream with the highest resolution, for audio the
stream with the most channels, for subtitle its the first subtitle
stream. In the case where several streams of the same type rate
equally, the lowest numbered stream is chosen.
You can disable some of those defaults by using "-vn/-an/-sn" options.
For full manual control, use the "-map" option, which disables the
defaults just described.
OPTIONS
All the numerical options, if not specified otherwise, accept in input
a string representing a number, which may contain one of the
International System number postfixes, for example K, M, G. If
i is appended after the postfix, powers of 2 are used instead of
powers of 10. The B postfix multiplies the value for 8, and can be
appended after another postfix or used alone. This allows using for
example KB, MiB, G and B as postfix.
Options which do not take arguments are boolean options, and set the
corresponding value to true. They can be set to false by prefixing with
"no" the option name, for example using "-nofoo" in the command line
will set to false the boolean option with name "foo".
Stream specifiers
Some options are applied per-stream, e.g. bitrate or codec. Stream
specifiers are used to precisely specify which stream(s) does a given
option belong to.
A stream specifier is a string generally appended to the option name
and separated from it by a colon. E.g. "-codec:a:1 ac3" option contains
"a:1" stream specifer, which matches the second audio stream. Therefore
it would select the ac3 codec for the second audio stream.
A stream specifier can match several stream, the option is then applied
to all of them. E.g. the stream specifier in "-b:a 128k" matches all
audio streams.
An empty stream specifier matches all streams, for example "-codec
copy" or "-codec: copy" would copy all the streams without reencoding.
-2-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
Show license.
Decoding available
Encoding available
-codecs
Show available codecs.
The fields preceding the codec names have the following meanings:
D
Decoding available
Encoding available
V/A/S
Video/audio/subtitle codec
S
-3-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
-bsfs
Show available bitstream filters.
-protocols
Show available protocols.
-filters
Show available libavfilter filters.
-pix_fmts
Show available pixel formats.
-sample_fmts
Show available sample formats.
-loglevel loglevel | -v loglevel
Set the logging level used by the library. loglevel is a number or
a string containing one of the following values:
quiet
panic
fatal
error
warning
info
verbose
debug
By default the program logs to stderr, if coloring is supported by
the terminal, colors are used to mark errors and warnings. Log
coloring can be disabled setting the environment variable
AV_LOG_FORCE_NOCOLOR or NO_COLOR, or can be forced setting the
environment variable AV_LOG_FORCE_COLOR. The use of the
environment variable NO_COLOR is deprecated and will be dropped in
a following FFmpeg version.
-report
Dump full command line and console output to a file named
"program-YYYYMMDD-HHMMSS.log" in the current directory. This file
can be useful for bug reports. It also implies "-loglevel
verbose".
Note: setting the environment variable "FFREPORT" to any value has
the same effect.
AVOptions
These options are provided directly by the libavformat, libavdevice and
libavcodec libraries. To see the list of available AVOptions, use the
-help option. They are separated into two categories:
-4-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
generic
These options can be set for any container, codec or device.
Generic options are listed under AVFormatContext options for
containers/devices and under AVCodecContext options for codecs.
private
These options are specific to the given container, device or codec.
Private options are listed under their corresponding
containers/devices/codecs.
For example to write an ID3v2.3 header instead of a default ID3v2.4 to
an MP3 file, use the id3v2_version private option of the MP3 muxer:
ffmpeg -i input.flac -id3v2_version 3 out.mp3
All codec AVOptions are obviously per-stream, so the chapter on stream
specifiers applies to them
Note -nooption syntax cannot be used for boolean AVOptions, use -option
0/-option 1.
Note2 old undocumented way of specifying per-stream AVOptions by
prepending v/a/s to the options name is now obsolete and will be
removed soon.
Main options
-f fmt (input/output)
Force input or output file format. The format is normally auto
detected for input files and guessed from file extension for output
files, so this option is not needed in most cases.
-i filename (input)
input file name
-y (global)
Overwrite output files without asking.
-n (global)
Do not overwrite output files but exit if file exists.
-c[:stream_specifier] codec (input/output,per-stream)
-codec[:stream_specifier] codec (input/output,per-stream)
Select an encoder (when used before an output file) or a decoder
(when used before an input file) for one or more streams. codec is
the name of a decoder/encoder or a special value "copy" (output
only) to indicate that the stream is not to be re-encoded.
For example
ffmpeg -i INPUT -map 0 -c:v libx264 -c:a copy OUTPUT
encodes all video streams with libx264 and copies all audio
streams.
-5-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
now|([(YYYY-MM-DD|YYYYMMDD)[T|t| ]]((HH[:MM[:SS[.m...]]])|(HH[MM[SS[.m...]]]))[Z|z])
If the value is "now" it takes the current time. Time is local
time unless Z or z is appended, in which case it is interpreted
as UTC. If the year-month-day part is not specified it takes the
current year-month-day.
-metadata[:metadata_specifier] key=value (output,per-metadata)
Set a metadata key/value pair.
An optional metadata_specifier may be given to set metadata on
streams or chapters. See "-map_metadata" documentation for details.
This option overrides metadata set with "-map_metadata". It is also
possible to delete metadata by using an empty value.
For example, for setting the title in the output file:
ffmpeg -i in.avi -metadata title="my title" out.flv
-6-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
-7-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
The
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
qvga
320x240
vga 640x480
svga
800x600
xga 1024x768
uxga
1600x1200
qxga
2048x1536
sxga
1280x1024
qsxga
2560x2048
hsxga
5120x4096
wvga
852x480
wxga
1366x768
wsxga
1600x1024
wuxga
1920x1200
woxga
2560x1600
wqsxga
3200x2048
wquxga
3840x2400
whsxga
6400x4096
whuxga
7680x4800
-9-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
cga 320x200
ega 640x350
hd480
852x480
hd720
1280x720
hd1080
1920x1080
-aspect[:stream_specifier] aspect (output,per-stream)
Set the video display aspect ratio specified by aspect.
aspect can be a floating point number string, or a string of the
form num:den, where num and den are the numerator and denominator
of the aspect ratio. For example "4:3", "16:9", "1.3333", and
"1.7777" are valid argument values.
-croptop size
-cropbottom size
-cropleft size
-cropright size
All the crop options have been removed. Use -vf
crop=width:height:x:y instead.
-padtop size
-padbottom size
-padleft size
-padright size
-padcolor hex_color
All the pad options have been removed. Use -vf
pad=width:height:x:y:color instead.
-vn (output)
Disable video recording.
-bt tolerance
Set video bitrate tolerance (in bits, default 4000k). Has a
minimum value of: (target_bitrate/target_framerate). In 1-pass
mode, bitrate tolerance specifies how far ratecontrol is willing to
deviate from the target average bitrate value. This is not related
to min/max bitrate. Lowering tolerance too much has an adverse
effect on quality.
-maxrate bitrate
Set max video bitrate (in bit/s).
-minrate bitrate
Set min video bitrate (in bit/s).
encode:
-10-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
ffmpeg -i myfile.avi -b:v 4000k -minrate 4000k -maxrate 4000k -bufsize 1835k out.m2v
It is of little use elsewise.
-bufsize size
Set video buffer verifier buffer size (in bits).
-vcodec codec (output)
Set the video codec. This is an alias for "-codec:v".
-same_quant
Use same quantizer as source (implies VBR).
Note that this is NOT SAME QUALITY. Do not use this option unless
you know you need it.
-pass n
Select the pass number (1 or 2). It is used to do two-pass video
encoding. The statistics of the video are recorded in the first
pass into a log file (see also the option -passlogfile), and in the
second pass that log file is used to generate the video at the
exact requested bitrate. On pass 1, you may just deactivate audio
and set output to null, examples for Windows and Unix:
ffmpeg -i foo.mov -c:v libxvid -pass 1 -an -f rawvideo -y NUL
ffmpeg -i foo.mov -c:v libxvid -pass 1 -an -f rawvideo -y /dev/null
-passlogfile prefix (global)
Set two-pass log file name prefix to prefix, the default file name
prefix is ffmpeg2pass. The complete file name will be
PREFIX-N.log, where N is a number specific to the output stream
Note that this option is overwritten by a local option of the same
name when using "-vcodec libx264". That option maps to the x264
option stats which has a different syntax.
-vlang code
Set the ISO 639 language code (3 letters) of the current video
stream.
-vf filter_graph (output)
filter_graph is a description of the filter graph to apply to the
input video. Use the option "-filters" to show all the available
filters (including also sources and sinks). This is an alias for
"-filter:v".
Advanced Video Options
-pix_fmt[:stream_specifier] format (input/output,per-stream)
Set pixel format. Use "-pix_fmts" to show all the supported pixel
formats.
-sws_flags flags (input/output)
-11-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
-b_qoffset offset
qp offset between P- and B-frames
-i_qoffset offset
qp offset between P- and I-frames
-rc_eq equation
Set rate control equation (see section "Expression Evaluation")
(default = "tex^qComp").
When computing the rate control equation expression, besides the
standard functions defined in the section "Expression Evaluation",
the following functions are available:
bits2qp(bits)
qp2bits(qp)
and the following constants are available:
iTex
pTex
tex
mv
fCode
iCount
mcVar
var
isI
isP
isB
avgQP
qComp
avgIITex
avgPITex
avgPPTex
avgBPTex
avgTex
-rc_override[:stream_specifier] override (output,per-stream)
Rate control override for specific intervals, formatted as
"int,int,int" list separated with slashes. Two first values are the
beginning and end frame numbers, last one is quantizer to use if
positive, or quality factor if negative.
-me_method method
Set motion estimation method to method.
(from lowest to best quality):
zero
Try just the (0, 0) vector.
phods
log
-13-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
x1
hex
umh
epzs
(default method)
full
exhaustive search (slow and marginally better than epzs)
-dct_algo algo
Set DCT algorithm to algo. Available values are:
0
FF_DCT_AUTO (default)
FF_DCT_FASTINT
FF_DCT_INT
FF_DCT_MMX
FF_DCT_MLIB
FF_DCT_ALTIVEC
-idct_algo algo
Set IDCT algorithm to algo. Available values are:
0
FF_IDCT_AUTO (default)
FF_IDCT_INT
FF_IDCT_SIMPLE
FF_IDCT_SIMPLEMMX
FF_IDCT_LIBMPEG2MMX
FF_IDCT_PS2
FF_IDCT_MLIB
FF_IDCT_ARM
FF_IDCT_ALTIVEC
FF_IDCT_SH4
10
FF_IDCT_SIMPLEARM
-er n
Set error resilience to n.
1
FF_ER_CAREFUL (default)
-14-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
FF_ER_COMPLIANT
FF_ER_AGGRESSIVE
FF_ER_VERY_AGGRESSIVE
-ec bit_mask
Set error concealment to bit_mask. bit_mask is a bit mask of the
following values:
1
-bf frames
Use frames B-frames (supported for MPEG-1, MPEG-2 and MPEG-4).
-mbd mode
macroblock decision
0
-4mv
Use four motion vector by macroblock (MPEG-4 only).
-part
Use data partitioning (MPEG-4 only).
-bug param
Work around encoder bugs that are not auto-detected.
-strict strictness
How strictly to follow the standards.
-aic
Enable Advanced intra coding (h263+).
-umv
Enable Unlimited Motion Vector (h263+)
-deinterlace
Deinterlace pictures.
-ilme
Force interlacing support in encoder (MPEG-2 and MPEG-4 only). Use
this option if your input file is interlaced and you want to keep
-15-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
-q:a.
-ac[:stream_specifier] channels (input/output,per-stream)
Set the number of audio channels. For output streams it is set by
default to the number of input audio channels. For input streams
this option only makes sense for audio grabbing devices and raw
demuxers and is mapped to the corresponding demuxer options.
-an (output)
Disable audio recording.
-acodec codec (input/output)
Set the audio codec. This is an alias for "-codec:a".
-sample_fmt[:stream_specifier] sample_fmt (output,per-stream)
Set the audio sample format. Use "-sample_fmts" to get a list of
supported sample formats.
Advanced Audio options:
-atag fourcc/tag (output)
Force audio tag/fourcc. This is an alias for "-tag:a".
-audio_service_type type
Set the type of service that the audio stream contains.
ma
ef
Effects
vi
Visually Impaired
hi
Hearing Impaired
di
Dialogue
co
Commentary
em
Emergency
vo
Voice Over
ka
Karaoke
-absf bitstream_filter
Deprecated, see -bsf
Subtitle options:
-slang code
Set the ISO 639 language code (3 letters) of the current subtitle
stream.
-scodec codec (input/output)
Set the subtitle codec. This is an alias for "-codec:s".
-17-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
-sn (output)
Disable subtitle recording.
-sbsf bitstream_filter
Deprecated, see -bsf
Audio/Video grab options
-isync (global)
Synchronize read on input.
Advanced options
-map
[-]input_file_id[:stream_specifier][,sync_file_id[:stream_specifier]]
(output)
Designate one or more input streams as a source for the output
file. Each input stream is identified by the input file index
input_file_id and the input stream index input_stream_id within the
input file. Both indices start at 0. If specified,
sync_file_id:stream_specifier sets which input stream is used as a
presentation sync reference.
The first "-map" option on the command line specifies the source
for output stream 0, the second "-map" option specifies the source
for output stream 1, etc.
A "-" character before the stream identifier creates a "negative"
mapping. It disables matching streams from already created
mappings.
For example, to map ALL streams from the first input file to output
ffmpeg -i INPUT -map 0 output
For example, if you have two audio streams in the first input file,
these streams are identified by "0:0" and "0:1". You can use "-map"
to select which streams to place in an output file. For example:
ffmpeg -i INPUT -map 0:1 out.wav
will map the input stream in INPUT identified by "0:1" to the
(single) output stream in out.wav.
For example, to select the stream with index 2 from input file
a.mov (specified by the identifier "0:2"), and stream with index 6
from input b.mov (specified by the identifier "1:6"), and copy them
to the output file out.mov:
ffmpeg -i a.mov -i b.mov -c copy -map 0:2 -map 1:6 out.mov
To select all video and the third audio stream from an input file:
ffmpeg -i INPUT -map 0:v -map 0:a:2 OUTPUT
-18-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
To map all the streams except the second audio, use negative
mappings
ffmpeg -i INPUT -map 0 -map -0:a:1 OUTPUT
Note that using this option disables the default mappings for this
output file.
-map_channel
[input_file_id.stream_specifier.channel_id|-1][:output_file_id.stream_specifier]
Map an audio channel from a given input to an output. If
output_file_id.stream_specifier are not set, the audio channel will
be mapped on all the audio streams.
Using "-1" instead of input_file_id.stream_specifier.channel_id
will map a muted channel.
For example, assuming INPUT is a stereo audio file, you can switch
the two audio channels with the following command:
ffmpeg -i INPUT -map_channel 0.0.1 -map_channel 0.0.0 OUTPUT
If you want to mute the first channel and keep the second:
ffmpeg -i INPUT -map_channel -1 -map_channel 0.0.1 OUTPUT
The order of the "-map_channel" option specifies the order of the
channels in the output stream. The output channel layout is guessed
from the number of channels mapped (mono if one "-map_channel",
stereo if two, etc.). Using "-ac" in combination of "-map_channel"
makes the channel gain levels to be updated if channel layouts
dont match (for instance two "-map_channel" options and "-ac 6").
You can also extract each channel of an INPUT to specific outputs;
the following command extract each channel of the audio stream
(file 0, stream 0) to the respective OUTPUT_CH0 and OUTPUT_CH1:
ffmpeg -i INPUT -map_channel 0.0.0 OUTPUT_CH0 -map_channel 0.0.1 OUTPUT_CH1
The following example split the channels of a stereo input into
streams:
ffmpeg -i stereo.wav -map 0:0 -map 0:0 -map_channel 0.0.0:0.0 -map_channel 0.0.1:0.1
-y out.ogg
Note that currently each output stream can only contain channels
from a single input stream; you cant for example use
"-map_channel" to pick multiple input audio channels contained in
different streams (from the same or different files) and merge them
into a single output stream. It is therefore not currently
possible, for example, to turn two separate mono streams into a
single stereo stream. However spliting a stereo stream into two
-19-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
s[:stream_spec]
per-stream metadata. stream_spec is a stream specifier as
described in the Stream specifiers chapter. In an input
metadata specifier, the first matching stream is copied from.
In an output metadata specifier, all matching streams are
copied to.
c:chapter_index
per-chapter metadata. chapter_index is the zero-based chapter
index.
p:program_index
per-program metadata. program_index is the zero-based program
index.
If metadata specifier is omitted, it defaults to global.
By default, global metadata is copied from the first input file,
per-stream and per-chapter metadata is copied along with
streams/chapters. These default mappings are disabled by creating
any mapping of the relevant type. A negative file index can be used
to create a dummy mapping that just disables automatic copying.
For example to copy metadata from the first stream of the input
file to global metadata of the output file:
ffmpeg -i in.ogg -map_metadata 0:s:0 out.mp3
To do the reverse, i.e. copy global metadata to all audio streams:
ffmpeg -i in.mkv -map_metadata:s:a 0:g out.mkv
Note that simple 0 would work as well in this example, since global
metadata is assumed by default.
-map_chapters input_file_index (output)
Copy chapters from input file with index input_file_index to the
next output file. If no chapter mapping is specified, then chapters
are copied from the first input file with at least one chapter. Use
a negative file index to disable any chapter copying.
-debug category
-20-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
motion vector
pict
picture info
pts
qp per-block quantization parameter (QP)
rc
rate control
skip
startcode
thread_ops
threading operations
vis_mb_type
visualize block types
vis_qp
visualize quantization parameter (QP), lower QP are tinted
greener
-benchmark (global)
Show benchmarking information at the end of an encode. Shows CPU
time used and maximum memory consumption. Maximum memory
consumption is not supported on all systems, it will usually
display as 0 if not supported.
-timelimit duration (global)
Exit after ffmpeg has been running for duration seconds.
-dump (global)
Dump each input packet to stderr.
-hex (global)
When dumping packets, also dump the payload.
-21-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
-ps size
Set RTP payload size in bytes.
-re (input)
Read input at native frame rate. Mainly used to simulate a grab
device.
-loop_input
Loop over the input stream. Currently it works only for image
streams. This option is used for automatic FFserver testing. This
option is deprecated, use -loop 1.
-loop_output number_of_times
Repeatedly loop output for formats that support looping such as
animated GIF (0 will loop the output infinitely). This option is
deprecated, use -loop.
-threads count
Thread count.
-vsync parameter
Video sync method.
0, passthrough
Each frame is passed with its timestamp from the demuxer to the
muxer.
1, cfr
Frames will be duplicated and dropped to achieve exactly the
requested constant framerate.
2, vfr
Frames are passed through with their timestamp or dropped so as
to prevent 2 frames from having the same timestamp.
-1, auto
Chooses between 1 and 2 depending on muxer capabilities. This
is the default method.
With -map you can select from which stream the timestamps should be
taken. You can leave either video or audio unchanged and sync the
remaining stream(s) to the unchanged one.
-async samples_per_second
Audio sync method. "Stretches/squeezes" the audio stream to match
the timestamps, the parameter is the maximum samples per second by
which the audio is changed. -async 1 is a special case where only
the start of the audio stream is corrected without any later
correction.
-copyts
Copy timestamps from input to output.
-22-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
-copytb
Copy input stream time base from input to output when stream
copying.
-shortest
Finish encoding when the shortest input stream ends.
-dts_delta_threshold
Timestamp discontinuity delta threshold.
-muxdelay seconds (input)
Set the maximum demux-decode delay.
-muxpreload seconds (input)
Set the initial demux-decode delay.
-streamid output-stream-index:new-value (output)
Assign a new stream-id value to an output stream. This option
should be specified prior to the output filename to which it
applies. For the situation where multiple output files exist, a
streamid may be reassigned to a different value.
For example, to set the stream 0 PID to 33 and the stream 1 PID to
36 for an output mpegts file:
ffmpeg -i infile -streamid 0:33 -streamid 1:36 out.ts
-bsf[:stream_specifier] bitstream_filters (output,per-stream)
Set bitstream filters for matching streams. bistream_filters is a
comma-separated list of bitstream filters. Use the "-bsfs" option
to get the list of bitstream filters.
ffmpeg -i h264.mp4 -c:v copy -vbsf h264_mp4toannexb -an out.h264
ffmpeg -i file.mov -an -vn -sbsf mov2textsub -c:s copy -f rawvideo sub.txt
-tag[:stream_specifier] codec_tag (per-stream)
Force a tag/fourcc for matching streams.
-timecode hh:mm:ssSEPff
Specify Timecode for writing. SEP is : for non drop timecode and
; (or .) for drop.
ffmpeg -i input.mpg -timecode 01:02:03.04 -r 30000/1001 -s ntsc output.mpg
Preset files
A preset file contains a sequence of option=value pairs, one for each
line, specifying a sequence of options which would be awkward to
specify on the command line. Lines starting with the hash (#)
character are ignored and are used to provide comments. Check the
presets directory in the FFmpeg source tree for examples.
-23-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
Preset files are specified with the "vpre", "apre", "spre", and "fpre"
options. The "fpre" option takes the filename of the preset instead of
a preset name as input and can be used for any kind of codec. For the
"vpre", "apre", and "spre" options, the options specified in a preset
file are applied to the currently selected codec of the same type as
the preset option.
The argument passed to the "vpre", "apre", and "spre" preset options
identifies the preset file to use according to the following rules:
First ffmpeg searches for a file named arg.ffpreset in the directories
$FFMPEG_DATADIR (if set), and $HOME/.ffmpeg, and in the datadir defined
at configuration time (usually PREFIX/share/ffmpeg) or in a ffpresets
folder along the executable on win32, in that order. For example, if
the argument is "libx264-max", it will search for the file
libx264-max.ffpreset.
If no such file is found, then ffmpeg will search for a file named
codec_name-arg.ffpreset in the above-mentioned directories, where
codec_name is the name of the codec to which the preset file options
will be applied. For example, if you select the video codec with
"-vcodec libx264" and use "-vpre max", then it will search for the file
libx264-max.ffpreset.
TIPS
For streaming at very low bitrate application, use a low frame rate
and a small GOP size. This is especially true for RealVideo where
the Linux player does not seem to be very fast, so it can miss
frames. An example is:
ffmpeg -g 3 -r 3 -t 10 -b:v 50k -s qcif -f rv10 /tmp/b.rm
-24-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
EXAMPLES
Preset files
A preset file contains a sequence of option=value pairs, one for each
line, specifying a sequence of options which can be specified also on
the command line. Lines starting with the hash (#) character are
ignored and are used to provide comments. Empty lines are also ignored.
Check the presets directory in the FFmpeg source tree for examples.
Preset files are specified with the "pre" option, this option takes a
preset name as input. FFmpeg searches for a file named
preset_name.avpreset in the directories $AVCONV_DATADIR (if set), and
$HOME/.ffmpeg, and in the data directory defined at configuration time
(usually $PREFIX/share/ffmpeg) in that order. For example, if the
argument is "libx264-max", it will search for the file
libx264-max.avpreset.
Video and Audio grabbing
If you specify the input format and device then ffmpeg can grab video
and audio directly.
ffmpeg -f oss -i /dev/dsp -f video4linux2 -i /dev/video0 /tmp/out.mpg
Or with an ALSA audio source (mono input, card id 1) instead of OSS:
ffmpeg -f alsa -ac 1 -i hw:1 -f video4linux2 -i /dev/video0 /tmp/out.mpg
Note that you must activate the right video source and channel before
launching ffmpeg with any TV viewer such as
xawtv ("http://linux.bytesex.org/xawtv/") by Gerd Knorr. You also have
to set the audio recording levels correctly with a standard mixer.
X11 grabbing
Grab the X11 display with ffmpeg via
ffmpeg -f x11grab -s cif -r 25 -i :0.0 /tmp/out.mpg
0.0 is display.screen number of your X11 server, same as the DISPLAY
environment variable.
ffmpeg -f x11grab -s cif -r 25 -i :0.0+10,20 /tmp/out.mpg
0.0 is display.screen number of your X11 server, same as the DISPLAY
environment variable. 10 is the x-offset and 20 the y-offset for the
grabbing.
Video and Audio file format conversion
Any supported file format and protocol can serve as input to ffmpeg:
Examples:
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
Converts the audio file a.wav and the raw YUV video file a.yuv to
MPEG file a.mpg.
You can also do audio and video conversions at the same time:
ffmpeg -i /tmp/a.wav -ar 22050 /tmp/a.mp2
You can encode to several formats at the same time and define a
mapping from input stream to output streams:
ffmpeg -i /tmp/a.wav -map 0:a -b:a 64k /tmp/a.mp2 -map 0:a -b:a 128k /tmp/b.mp2
Converts a.wav to a.mp2 at 64 kbits and to b.mp2 at 128 kbits.
-map file:index specifies which input stream is used for each
output stream, in the order of the definition of output streams.
This is a typical DVD ripping example; the input is a VOB file, the
output an AVI file with MPEG-4 video and MP3 audio. Note that in
this command we use B-frames so the MPEG-4 stream is DivX5
-26-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
compatible, and GOP size is 300 which means one intra frame every
10 seconds for 29.97fps input video. Furthermore, the audio stream
is MP3-encoded so you need to enable LAME support by passing
"--enable-libmp3lame" to configure. The mapping is particularly
useful for DVD transcoding to get the desired audio language.
NOTE: To see the supported input formats, use "ffmpeg -formats".
You can extract images from a video, or create a video from many
images:
For extracting images from a video:
ffmpeg -i foo.avi -r 1 -s WxH -f image2 foo-%03d.jpeg
This will extract one video frame per second from the video and
will output them in files named foo-001.jpeg, foo-002.jpeg, etc.
Images will be rescaled to fit the new WxH values.
If you want to extract just a limited number of frames, you can use
the above command in combination with the -vframes or -t option, or
in combination with -ss to start extracting from a certain point in
time.
For creating a video from many images:
ffmpeg -f image2 -i foo-%03d.jpeg -r 12 -s WxH foo.avi
The syntax "foo-%03d.jpeg" specifies to use a decimal number
composed of three digits padded with zeroes to express the sequence
number. It is the same syntax supported by the C printf function,
but only formats accepting a normal integer are suitable.
You can put many streams of the same type in the output:
ffmpeg -i test1.avi -i test2.avi -map 0.3 -map 0.2 -map 0.1 -map 0.0 -c copy
test12.nut
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
if A then B else C
is equivalent to
if(A,B) + ifnot(A,C)
In your C code, you can extend the list of unary and binary functions,
and define recognized constants, so that they are available for your
expressions.
The evaluator also recognizes the International System number
postfixes. If i is appended after the postfix, powers of 2 are used
instead of powers of 10. The B postfix multiplies the value for 8,
and can be appended after another postfix or used alone. This allows
using for example KB, MiB, G and B as postfix.
Follows the list of available International System postfixes, with
indication of the corresponding powers of 10 and of 2.
y
-24 / -80
-21 / -70
-18 / -60
-15 / -50
-12 / -40
-9 / -30
-6 / -20
-3 / -10
-2
-1
3 / 10
3 / 10
6 / 20
9 / 30
12 / 40
15 / 40
-30-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
18 / 50
21 / 60
24 / 70
DECODERS
Decoders are configured elements in FFmpeg which allow the decoding of
multimedia streams.
When you configure your FFmpeg build, all the supported native decoders
are enabled by default. Decoders requiring an external library must be
enabled manually via the corresponding "--enable-lib" option. You can
list all available decoders using the configure option
"--list-decoders".
You can disable all the decoders with the configure option
"--disable-decoders" and selectively enable / disable single decoders
with the options "--enable-decoder=DECODER" /
"--disable-decoder=DECODER".
The option "-codecs" of the ff* tools will display the list of enabled
decoders.
VIDEO DECODERS
A description of some of the currently available video decoders
follows.
rawvideo
Raw video decoder.
This decoder decodes rawvideo streams.
Options
top top_field_first
Specify the assumed field type of the input video.
-1
bottom-field-first is assumed
top-field-first is assumed
AUDIO DECODERS
ffwavesynth
Internal wave synthetizer.
This decoder generates wave patterns according to predefined sequences.
Its use is purely internal and the format of the data it accepts is not
publicly documented.
-31-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
ENCODERS
Encoders are configured elements in FFmpeg which allow the encoding of
multimedia streams.
When you configure your FFmpeg build, all the supported native encoders
are enabled by default. Encoders requiring an external library must be
enabled manually via the corresponding "--enable-lib" option. You can
list all available encoders using the configure option
"--list-encoders".
You can disable all the encoders with the configure option
"--disable-encoders" and selectively enable / disable single encoders
with the options "--enable-encoder=ENCODER" /
"--disable-encoder=ENCODER".
The option "-codecs" of the ff* tools will display the list of enabled
encoders.
AUDIO ENCODERS
A description of some of the currently available audio encoders
follows.
ac3 and ac3_fixed
AC-3 audio encoders.
These encoders implement part of ATSC A/52:2010 and ETSI TS 102 366, as
well as the undocumented RealAudio 3 (a.k.a. dnet).
The ac3 encoder uses floating-point math, while the ac3_fixed encoder
only uses fixed-point integer math. This does not mean that one is
always faster, just that one or the other may be better suited to a
particular system. The floating-point encoder will generally produce
better quality audio for a given bitrate. The ac3_fixed encoder is not
the default codec for any of the output formats, so it must be
specified explicitly using the option "-acodec ac3_fixed" in order to
use it.
AC-3 Metadata
The AC-3 metadata options are used to set parameters that describe the
audio, but in most cases do not affect the audio encoding itself. Some
of the options do directly affect or influence the decoding and
playback of the resulting bitstream, while others are just for
informational purposes. A few of the options will add bits to the
output stream that could otherwise be used for audio data, and will
thus affect the quality of the output. Those will be indicated
accordingly with a note in the option list below.
These parameters are described in detail in several publicly-available
documents.
*<A/52:2010 - Digital Audio Compression (AC-3) (E-AC-3) Standard
("http://www.atsc.org/cms/standards/a_52-2010.pdf")>
-32-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
Downmix Levels
-center_mixlev level
Center Mix Level. The amount of gain the decoder should apply to
the center channel when downmixing to stereo. This field will only
be written to the bitstream if a center channel is present. The
value is specified as a scale factor. There are 3 valid values:
0.707
Apply -3dB gain
0.595
Apply -4.5dB gain (default)
0.500
Apply -6dB gain
-surround_mixlev level
Surround Mix Level. The amount of gain the decoder should apply to
the surround channel(s) when downmixing to stereo. This field will
only be written to the bitstream if one or more surround channels
are present. The value is specified as a scale factor. There are 3
valid values:
0.707
Apply -3dB gain
0.500
Apply -6dB gain (default)
0.000
-33-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
Copyright Exists
-dialnorm value
Dialogue Normalization. Indicates how far the average dialogue
level of the program is below digital 100% full scale (0 dBFS).
This parameter determines a level shift during audio reproduction
-34-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
that sets the average volume of the dialogue to a preset level. The
goal is to match volume level between program sources. A value of
-31dB will result in no volume level change, relative to the source
volume, during audio reproduction. Valid values are whole numbers
in the range -31 to -1, with -31 being the default.
-dsur_mode mode
Dolby Surround Mode. Specifies whether the stereo signal uses Dolby
Surround (Pro Logic). This field will only be written to the
bitstream if the audio stream is stereo. Using this option does NOT
mean the encoder will actually apply Dolby Surround processing.
0
notindicated
Not Indicated (default)
1
off Not Dolby Surround Encoded
2
on
-original boolean
Original Bit Stream Indicator. Specifies whether this audio is from
the original source and not a copy.
0
off Not Original Source
1
on
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
1
ltrt
Lt/Rt Downmix Preferred
2
loro
Lo/Ro Downmix Preferred
-ltrt_cmixlev level
Lt/Rt Center Mix Level. The amount of gain the decoder should apply
to the center channel when downmixing to stereo in Lt/Rt mode.
1.414
Apply +3dB gain
1.189
Apply +1.5dB gain
1.000
Apply 0dB gain
0.841
Apply -1.5dB gain
0.707
Apply -3.0dB gain
0.595
Apply -4.5dB gain (default)
0.500
Apply -6.0dB gain
0.000
Silence Center Channel
-ltrt_surmixlev level
Lt/Rt Surround Mix Level. The amount of gain the decoder should
apply to the surround channel(s) when downmixing to stereo in Lt/Rt
mode.
0.841
Apply -1.5dB gain
0.707
Apply -3.0dB gain
0.595
Apply -4.5dB gain
0.500
Apply -6.0dB gain (default)
-36-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
0.000
Silence Surround Channel(s)
-loro_cmixlev level
Lo/Ro Center Mix Level. The amount of gain the decoder should apply
to the center channel when downmixing to stereo in Lo/Ro mode.
1.414
Apply +3dB gain
1.189
Apply +1.5dB gain
1.000
Apply 0dB gain
0.841
Apply -1.5dB gain
0.707
Apply -3.0dB gain
0.595
Apply -4.5dB gain (default)
0.500
Apply -6.0dB gain
0.000
Silence Center Channel
-loro_surmixlev level
Lo/Ro Surround Mix Level. The amount of gain the decoder should
apply to the surround channel(s) when downmixing to stereo in Lo/Ro
mode.
0.841
Apply -1.5dB gain
0.707
Apply -3.0dB gain
0.595
Apply -4.5dB gain
0.500
Apply -6.0dB gain (default)
0.000
Silence Surround Channel(s)
Extended Bitstream Information - Part 2
-37-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
-dsurex_mode mode
Dolby Surround EX Mode. Indicates whether the stream uses Dolby
Surround EX (7.1 matrixed to 5.1). Using this option does NOT mean
the encoder will actually apply Dolby Surround EX processing.
0
notindicated
Not Indicated (default)
1
on
2
off Dolby Surround EX On
-dheadphone_mode mode
Dolby Headphone Mode. Indicates whether the stream uses Dolby
Headphone encoding (multi-channel matrixed to 2.0 for use with
headphones). Using this option does NOT mean the encoder will
actually apply Dolby Headphone processing.
0
notindicated
Not Indicated (default)
1
on
2
off Dolby Headphone On
-ad_conv_type type
A/D Converter Type. Indicates whether the audio has passed through
HDCD A/D conversion.
0
standard
Standard A/D Converter (default)
1
hdcd
HDCD A/D Converter
Other AC-3 Encoding Options
-stereo_rematrixing boolean
Stereo Rematrixing. Enables/Disables use of rematrixing for stereo
input. This is an optional AC-3 feature that increases quality by
selectively encoding the left/right channels as mid/side. This
option is enabled by default, and it is highly recommended that it
be left as enabled except for testing purposes.
-38-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
-cpl_start_band number
Coupling Start Band. Sets the channel coupling start band, from 1
to 15. If a value higher than the bandwidth is used, it will be
reduced to 1 less than the coupling end band. If auto is used, the
start band will be determined by the encoder based on the bit rate,
sample rate, and channel layout. This option has no effect if
channel coupling is disabled.
-1
auto
Selected by Encoder (default)
VIDEO ENCODERS
A description of some of the currently available video encoders
follows.
libvpx
VP8 format supported through libvpx.
Requires the presence of the libvpx headers and library during
configuration. You need to explicitly configure the build with
"--enable-libvpx".
Options
Mapping from FFmpeg to libvpx options with conversion notes in
-39-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
parentheses.
threads
g_threads
profile
g_profile
vb
rc_target_bitrate
kf_max_dist
keyint_min
kf_min_dist
qmin
rc_min_quantizer
qmax
rc_max_quantizer
bufsize, vb
rc_buf_sz "(bufsize * 1000 / vb)"
rc_buf_optimal_sz "(bufsize * 1000 / vb * 5 / 6)"
rc_init_occupancy, vb
rc_buf_initial_sz "(rc_init_occupancy * 1000 / vb)"
rc_buffer_aggressivity
rc_undershoot_pct
skip_threshold
rc_dropframe_thresh
qcomp
rc_2pass_vbr_bias_pct
maxrate, vb
rc_2pass_vbr_maxsection_pct "(maxrate * 100 / vb)"
minrate, vb
rc_2pass_vbr_minsection_pct "(minrate * 100 / vb)"
minrate, maxrate, vb
"VPX_CBR" "(minrate == maxrate == vb)"
crf "VPX_CQ", "VP8E_SET_CQ_LEVEL"
quality
best
"VPX_DL_BEST_QUALITY"
-40-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
good
"VPX_DL_GOOD_QUALITY"
realtime
"VPX_DL_REALTIME"
speed
"VP8E_SET_CPUUSED"
nr
"VP8E_SET_NOISE_SENSITIVITY"
mb_threshold
"VP8E_SET_STATIC_THRESHOLD"
slices
"VP8E_SET_TOKEN_PARTITIONS"
Alternate reference frame related
vp8flags altref
"VP8E_SET_ENABLEAUTOALTREF"
arnr_max_frames
"VP8E_SET_ARNR_MAXFRAMES"
arnr_type
"VP8E_SET_ARNR_TYPE"
arnr_strength
"VP8E_SET_ARNR_STRENGTH"
rc_lookahead
g_lag_in_frames
vp8flags error_resilient
g_error_resilient
For more information about libvpx see: <http://www.webmproject.org/>
libx264
H.264 / AVC / MPEG-4 AVC / MPEG-4 part 10 format supported through
libx264.
Requires the presence of the libx264 headers and library during
configuration. You need to explicitly configure the build with
"--enable-libx264".
Options
preset preset_name
Set the encoding preset.
tune tune_name
Tune the encoding params.
-41-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
fastfirstpass bool
Use fast settings when encoding first pass, default value is 1.
profile profile_name
Set profile restrictions.
level level
Specify level (as defined by Annex A).
x264opts.
passlogfile filename
Specify filename for 2 pass stats.
(see stats libx264 option).
Deprecated in favor of
wpredp wpred_type
Specify Weighted prediction for P-frames.
x264opts (see weightp libx264 option).
Deprecated in favor of
x264opts options
Allow to set any x264 option, see x264 --fullhelp for a list.
options is a list of key=value couples separated by ":".
For example to specify libx264 encoding options with ffmpeg:
ffmpeg -i foo.mpg -vcodec libx264 -x264opts keyint=123:min-keyint=20 -an out.mkv
For more information about libx264 and the supported options see:
<http://www.videolan.org/developers/x264.html>
DEMUXERS
Demuxers are configured elements in FFmpeg which allow to read the
multimedia streams from a particular type of file.
When you configure your FFmpeg build, all the supported demuxers are
enabled by default. You can list all available ones using the configure
option "--list-demuxers".
You can disable all the demuxers using the configure option
"--disable-demuxers", and selectively enable a single demuxer with the
option "--enable-demuxer=DEMUXER", or disable it with the option
"--disable-demuxer=DEMUXER".
The option "-formats" of the ff* tools will display the list of enabled
demuxers.
The description of some of the currently available demuxers follows.
image2
Image file demuxer.
This demuxer reads from a list of image files specified by a pattern.
-42-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
The pattern may contain the string "%d" or "%0Nd", which specifies the
position of the characters representing a sequential number in each
filename matched by the pattern. If the form "%d0Nd" is used, the
string representing the number in each filename is 0-padded and N is
the total number of 0-padded digits representing the number. The
literal character % can be specified in the pattern with the string
"%%".
If the pattern contains "%d" or "%0Nd", the first filename of the file
list specified by the pattern must contain a number inclusively
contained between 0 and 4, all the following numbers must be
sequential. This limitation may be hopefully fixed.
The pattern may contain a suffix which is used to automatically
determine the format of the images contained in the files.
For example the pattern "img-%03d.bmp" will match a sequence of
filenames of the form img-001.bmp, img-002.bmp, ..., img-010.bmp, etc.;
the pattern "i%%m%%g-%d.jpg" will match a sequence of filenames of the
form i%m%g-1.jpg, i%m%g-2.jpg, ..., i%m%g-10.jpg, etc.
The size, the pixel format, and the format of each image must be the
same for all the files in the sequence.
The following example shows how to use ffmpeg for creating a video from
the images in the file sequence img-001.jpeg, img-002.jpeg, ...,
assuming an input frame rate of 10 frames per second:
ffmpeg -i 'img-%03d.jpeg' -r 10 out.mkv
Note that the pattern must not necessarily contain "%d" or "%0Nd", for
example to convert a single image file img.jpeg you can employ the
command:
ffmpeg -i img.jpeg img.png
applehttp
Apple HTTP Live Streaming demuxer.
This demuxer presents all AVStreams from all variant streams. The id
field is set to the bitrate variant index number. By setting the
discard flags on AVStreams (by pressing a or v in ffplay), the
caller can decide which variant streams to actually receive. The total
bitrate of the variant that the stream belongs to is available in a
metadata key named "variant_bitrate".
sbg
SBaGen script demuxer.
This demuxer reads the script language used by SBaGen
<http://uazu.net/sbagen/> to generate binaural beats sessions. A SBG
script looks like that:
-43-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
-SE
a: 300-2.5/3 440+4.5/0
b: 300-2.5/0 440+4.5/3
off: NOW == a
+0:07:00 == b
+0:14:00 == a
+0:21:00 == b
+0:30:00
off
A SBG script can mix absolute and relative timestamps. If the script
uses either only absolute timestamps (including the script start time)
or only relative ones, then its layout is fixed, and the conversion is
straightforward. On the other hand, if the script mixes both kind of
timestamps, then the NOW reference for relative timestamps will be
taken from the current time of day at the time the script is read, and
the script layout will be frozen according to that reference. That
means that if the script is directly played, the actual times will
match the absolute timestamps up to the sound controllers clock
accuracy, but if the user somehow pauses the playback or seeks, all
times will be shifted accordingly.
MUXERS
Muxers are configured elements in FFmpeg which allow writing multimedia
streams to a particular type of file.
When you configure your FFmpeg build, all the supported muxers are
enabled by default. You can list all available muxers using the
configure option "--list-muxers".
You can disable all the muxers with the configure option
"--disable-muxers" and selectively enable / disable single muxers with
the options "--enable-muxer=MUXER" / "--disable-muxer=MUXER".
The option "-formats" of the ff* tools will display the list of enabled
muxers.
A description of some of the currently available muxers follows.
crc
CRC (Cyclic Redundancy Check) testing format.
This muxer computes and prints the Adler-32 CRC of all the input audio
and video frames. By default audio frames are converted to signed
16-bit raw audio and video frames to raw video before computing the
CRC.
The output of the muxer consists of a single line of the form:
CRC=0xCRC, where CRC is a hexadecimal number 0-padded to 8 digits
containing the CRC for all the decoded input frames.
For example to compute the CRC of the input, and store it in the file
-44-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
out.crc:
ffmpeg -i INPUT -f crc out.crc
You can print the CRC to stdout with the command:
ffmpeg -i INPUT -f crc You can select the output format of each frame with ffmpeg by
specifying the audio and video codec and format. For example to compute
the CRC of the input audio converted to PCM unsigned 8-bit and the
input video converted to MPEG-2 video, use the command:
ffmpeg -i INPUT -c:a pcm_u8 -c:v mpeg2video -f crc See also the framecrc muxer.
framecrc
Per-frame CRC (Cyclic Redundancy Check) testing format.
This muxer computes and prints the Adler-32 CRC for each decoded audio
and video frame. By default audio frames are converted to signed 16-bit
raw audio and video frames to raw video before computing the CRC.
The output of the muxer consists of a line for each audio and video
frame of the form: stream_index, frame_dts, frame_size, 0xCRC, where
CRC is a hexadecimal number 0-padded to 8 digits containing the CRC of
the decoded frame.
For example to compute the CRC of each decoded frame in the input, and
store it in the file out.crc:
ffmpeg -i INPUT -f framecrc out.crc
You can print the CRC of each decoded frame to stdout with the command:
ffmpeg -i INPUT -f framecrc You can select the output format of each frame with ffmpeg by
specifying the audio and video codec and format. For example, to
compute the CRC of each decoded input audio frame converted to PCM
unsigned 8-bit and of each decoded input video frame converted to
MPEG-2 video, use the command:
ffmpeg -i INPUT -c:a pcm_u8 -c:v mpeg2video -f framecrc See also the crc muxer.
image2
Image file muxer.
The image file muxer writes video frames to image files.
-45-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
-47-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
Note that the above command does not read or write the out.null file,
but specifying the output file is required by the ffmpeg syntax.
Alternatively you can write the command as:
ffmpeg -benchmark -i INPUT -f null matroska
Matroska container muxer.
This muxer implements the matroska and webm container specs.
The recognized metadata settings in this muxer are:
title=title name
Name provided to a single track
language=language name
Specifies the language of the track in the Matroska languages form
stereo_mode=mode
Stereo 3D video layout of two views in a single video track
mono
video is not stereo
left_right
Both views are arranged side by side, Left-eye view is on the
left
bottom_top
Both views are arranged in top-bottom orientation, Left-eye
view is at bottom
top_bottom
Both views are arranged in top-bottom orientation, Left-eye
view is on top
checkerboard_rl
Each view is arranged in a checkerboard interleaved pattern,
Left-eye view being first
checkerboard_lr
Each view is arranged in a checkerboard interleaved pattern,
Right-eye view being first
row_interleaved_rl
Each view is constituted by a row based interleaving, Right-eye
view is first row
row_interleaved_lr
Each view is constituted by a row based interleaving, Left-eye
view is first row
-48-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
col_interleaved_rl
Both views are arranged in a column based interleaving manner,
Right-eye view is first column
col_interleaved_lr
Both views are arranged in a column based interleaving manner,
Left-eye view is first column
anaglyph_cyan_red
All frames are in anaglyph format viewable through red-cyan
filters
right_left
Both views are arranged side by side, Right-eye view is on the
left
anaglyph_green_magenta
All frames are in anaglyph format viewable through greenmagenta filters
block_lr
Both eyes laced in one Block, Left-eye view is first
block_rl
Both eyes laced in one Block, Right-eye view is first
For example a 3D WebM clip can be created using the following command
line:
ffmpeg -i sample_left_right_clip.mpg -an -c:v libvpx -metadata
stereo_mode=left_right -y stereo_clip.webm
segment
Basic stream segmenter.
The segmenter muxer outputs streams to a number of separate files of
nearly fixed duration. Output filename pattern can be set in a fashion
similar to image2.
Every segment starts with a video keyframe, if a video stream is
present. The segment muxer works best with a single constant frame
rate video.
Optionally it can generate a flat list of the created segments, one
segment per line.
segment_format format
Override the inner container format, by default it is guessed by
the filename extension.
segment_time t
Set segment duration to t seconds.
-49-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
segment_list name
Generate also a listfile named name.
segment_list_size size
Overwrite the listfile once it reaches size entries.
ffmpeg -i in.mkv -c copy -map 0 -f segment -list out.list out%03d.nut
INPUT DEVICES
Input devices are configured elements in FFmpeg which allow to access
the data coming from a multimedia device attached to your system.
When you configure your FFmpeg build, all the supported input devices
are enabled by default. You can list all available ones using the
configure option "--list-indevs".
You can disable all the input devices using the configure option
"--disable-indevs", and selectively enable an input device using the
option "--enable-indev=INDEV", or you can disable a particular input
device using the option "--disable-indev=INDEV".
The option "-formats" of the ff* tools will display the list of
supported input devices (amongst the demuxers).
A description of the currently available input devices follows.
alsa
ALSA (Advanced Linux Sound Architecture) input device.
To enable this input device during configuration you need libasound
installed on your system.
This device allows capturing from an ALSA device. The name of the
device to capture has to be an ALSA card identifier.
An ALSA identifier has the syntax:
hw:<CARD>[,<DEV>[,<SUBDEV>]]
where the DEV and SUBDEV components are optional.
The three arguments (in order: CARD,DEV,SUBDEV) specify card number or
identifier, device number and subdevice number (-1 means any).
To see the list of cards currently recognized by your system check the
files /proc/asound/cards and /proc/asound/devices.
For example to capture with ffmpeg from an ALSA device with card id 0,
you may run the command:
ffmpeg -f alsa -i hw:0 alsaout.wav
-50-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
Set audio device number for devices with same name (starts at 0,
defaults to 0).
Examples
dv1394
Linux DV 1394 input device.
fbdev
Linux framebuffer input device.
The Linux framebuffer is a graphic hardware-independent abstraction
layer to show graphics on a computer monitor, typically on the console.
It is accessed through a file device node, usually /dev/fb0.
For more detailed information read the file
Documentation/fb/framebuffer.txt included in the Linux source tree.
To record from the framebuffer device /dev/fb0 with ffmpeg:
ffmpeg -f fbdev -r 10 -i /dev/fb0 out.avi
You can take a single screenshot image with the command:
ffmpeg -f fbdev -frames:v 1 -r 1 -i /dev/fb0 screenshot.jpeg
See also <http://linux-fbdev.sourceforge.net/>, and fbset(1).
jack
JACK input device.
To enable this input device during configuration you need libjack
installed on your system.
-52-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
A JACK input device creates one or more JACK writable clients, one for
each audio channel, with name client_name:input_N, where client_name is
the name provided by the application, and N is a number which
identifies the channel. Each writable client will send the acquired
data to the FFmpeg input device.
Once you have created one or more JACK readable clients, you need to
connect them to one or more JACK writable clients.
To connect or disconnect JACK clients you can use the jack_connect and
jack_disconnect programs, or do it through a graphical interface, for
example with qjackctl.
To list the JACK clients and their properties you can invoke the
command jack_lsp.
Follows an example which shows how to capture a JACK readable client
with ffmpeg.
# Create a JACK writable client with name "ffmpeg".
$ ffmpeg -f jack -i ffmpeg -y out.wav
# Start the sample jack_metro readable client.
$ jack_metro -b 120 -d 0.2 -f 4000
# List the current JACK clients.
$ jack_lsp -c
system:capture_1
system:capture_2
system:playback_1
system:playback_2
ffmpeg:input_1
metro:120_bpm
# Connect metro to the ffmpeg writable client.
$ jack_connect metro:120_bpm ffmpeg:input_1
For more information read: <http://jackaudio.org/>
lavfi
Libavfilter input virtual device.
This input device reads data from the open output pads of a libavfilter
filtergraph.
For each filtergraph open output, the input device will create a
corresponding stream which is mapped to the generated output. Currently
only video data is supported. The filtergraph is specified through the
option graph.
Options
-53-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
graph
Specify the filtergraph to use as input. Each video open output
must be labelled by a unique string of the form "outN", where N is
a number starting from 0 corresponding to the mapped input stream
generated by the device. The first unlabelled output is
automatically assigned to the "out0" label, but all the others need
to be specified explicitly.
If not specified defaults to the filename specified for the input
device.
Examples
As the previous example, but use filename for specifying the graph
description, and omit the "out0" label:
ffplay -f lavfi color=pink
Create three different video test filtered sources and play them:
ffplay -f lavfi -graph "testsrc [out0]; testsrc,hflip [out1]; testsrc,negate [out2]"
test3
Read an audio stream from a file using the amovie source and play
it back with ffplay:
ffplay -f lavfi "amovie=test.wav"
Read an audio stream and a video stream and play it back with
ffplay:
ffplay -f lavfi "movie=test.avi[out0];amovie=test.wav[out1]"
libdc1394
IIDC1394 input device, based on libdc1394 and libraw1394.
openal
The OpenAL input device provides audio capture on all systems with a
working OpenAL 1.1 implementation.
To enable this input device during configuration, you need OpenAL
headers and libraries installed on your system, and need to configure
FFmpeg with "--enable-openal".
OpenAL headers and libraries should be provided as part of your OpenAL
implementation, or as an additional download (an SDK). Depending on
your installation you may need to specify additional flags via the
"--extra-cflags" and "--extra-ldflags" for allowing the build system to
locate the OpenAL headers and libraries.
-54-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
See
OpenAL Soft
Portable, open source (LGPL) software implementation. Includes
backends for the most common sound APIs on the Windows, Linux,
Solaris, and BSD operating systems. See
<http://kcat.strangesoft.net/openal.html>.
Apple
OpenAL is part of Core Audio, the official Mac OS X Audio
interface. See
<http://developer.apple.com/technologies/mac/audio-and-video.html>
This device allows to capture from an audio input device handled
through OpenAL.
You need to specify the name of the device to capture in the provided
filename. If the empty string is provided, the device will
automatically select the default device. You can get the list of the
supported devices by using the option list_devices.
Options
channels
Set the number of channels in the captured audio. Only the values 1
(monaural) and 2 (stereo) are currently supported. Defaults to 2.
sample_size
Set the sample size (in bits) of the captured audio. Only the
values 8 and 16 are currently supported. Defaults to 16.
sample_rate
Set the sample rate (in Hz) of the captured audio.
44.1k.
list_devices
If set to true, print a list of devices and exit.
false.
Defaults to
Defaults to
Examples
Print the list of OpenAL supported devices and exit:
$ ffmpeg -list_devices true -f openal -i dummy out.ogg
Capture from the OpenAL device DR-BT101 via PulseAudio:
-55-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
-58-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
The filename passed as input is the capture driver number, ranging from
0 to 9. You may use "list" as filename to print a list of drivers. Any
other filename will be interpreted as device number 0.
x11grab
X11 video input device.
This device allows to capture a region of an X11 display.
The filename passed as input has the syntax:
[<hostname>]:<display_number>.<screen_number>[+<x_offset>,<y_offset>]
hostname:display_number.screen_number specifies the X11 display name of
the screen to grab from. hostname can be omitted, and defaults to
"localhost". The environment variable DISPLAY contains the default
display name.
x_offset and y_offset specify the offsets of the grabbed area with
respect to the top-left border of the X11 screen. They default to 0.
Check the X11 documentation (e.g. man X) for more detailed information.
Use the dpyinfo program for getting basic information about the
properties of your X11 display (e.g. grep for "name" or "dimensions").
For example to grab from :0.0 using ffmpeg:
ffmpeg -f x11grab -r 25 -s cif -i :0.0 out.mpg
# Grab at position 10,20.
ffmpeg -f x11grab -r 25 -s cif -i :0.0+10,20 out.mpg
follow_mouse AVOption
The syntax is:
-follow_mouse centered|<PIXELS>
When it is specified with "centered", the grabbing region follows the
mouse pointer and keeps the pointer at the center of region; otherwise,
the region follows only when the mouse pointer reaches within PIXELS
(greater than zero) to the edge of region.
For example:
ffmpeg -f x11grab -follow_mouse centered -r 25 -s cif -i :0.0 out.mpg
# Follows only when the mouse pointer reaches within 100 pixels to edge
ffmpeg -f x11grab -follow_mouse 100 -r 25 -s cif -i :0.0 out.mpg
show_region AVOption
-59-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
OUTPUT DEVICES
Output devices are configured elements in FFmpeg which allow to write
multimedia data to an output device attached to your system.
When you configure your FFmpeg build, all the supported output devices
are enabled by default. You can list all available ones using the
configure option "--list-outdevs".
You can disable all the output devices using the configure option
"--disable-outdevs", and selectively enable an output device using the
option "--enable-outdev=OUTDEV", or you can disable a particular input
device using the option "--disable-outdev=OUTDEV".
The option "-formats" of the ff* tools will display the list of enabled
output devices (amongst the muxers).
A description of the currently available output devices follows.
alsa
ALSA (Advanced Linux Sound Architecture) output device.
oss
OSS (Open Sound System) output device.
sdl
SDL (Simple DirectMedia Layer) output device.
This output devices allows to show a video stream in an SDL window.
Only one SDL window is allowed per application, so you can have only
one instance of this output device in an application.
To enable this output device you need libsdl installed on your system
when configuring your build.
For more information about SDL, check: <http://www.libsdl.org/>
Options
-60-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
window_title
Set the SDL window title, if not specified default to the filename
specified for the output device.
icon_title
Set the name of the iconified SDL window, if not specified it is
set to the same value of window_title.
window_size
Set the SDL window size, can be a string of the form widthxheight
or a video size abbreviation. If not specified it defaults to the
size of the input video.
Examples
The following command shows the ffmpeg output is an SDL window, forcing
its size to the qcif format:
ffmpeg -i INPUT -vcodec rawvideo -pix_fmt yuv420p -window_size qcif -f sdl "SDL
output"
sndio
sndio audio output device.
PROTOCOLS
Protocols are configured elements in FFmpeg which allow to access
resources which require the use of a particular protocol.
When you configure your FFmpeg build, all the supported protocols are
enabled by default. You can list all available ones using the configure
option "--list-protocols".
You can disable all the protocols using the configure option
"--disable-protocols", and selectively enable a protocol using the
option "--enable-protocol=PROTOCOL", or you can disable a particular
protocol using the option "--disable-protocol=PROTOCOL".
The option "-protocols" of the ff* tools will display the list of
supported protocols.
A description of the currently available protocols follows.
applehttp
Read Apple HTTP Live Streaming compliant segmented stream as a uniform
one. The M3U8 playlists describing the segments can be remote HTTP
resources or local files, accessed using the standard file protocol.
HTTP is default, specific protocol can be declared by specifying
"+proto" after the applehttp URI scheme name, where proto is either
"file" or "http".
applehttp://host/path/to/remote/resource.m3u8
applehttp+http://host/path/to/remote/resource.m3u8
applehttp+file://path/to/local/resource.m3u8
-61-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
concat
Physical concatenation protocol.
Allow to read and seek from many resource in sequence as if they were a
unique resource.
A URL accepted by this protocol has the syntax:
concat:<URL1>|<URL2>|...|<URLN>
where URL1, URL2, ..., URLN are the urls of the resource to be
concatenated, each one possibly specifying a distinct protocol.
For example to read a sequence of files split1.mpeg, split2.mpeg,
split3.mpeg with ffplay use the command:
ffplay concat:split1.mpeg\|split2.mpeg\|split3.mpeg
Note that you may need to escape the character "|" which is special for
many shells.
file
File access protocol.
Allow to read from or read to a file.
For example to read from a file input.mpeg with ffmpeg use the command:
ffmpeg -i file:input.mpeg output.mpeg
The ff* tools default to the file protocol, that is a resource
specified with the name "FILE.mpeg" is interpreted as the URL
"file:FILE.mpeg".
gopher
Gopher protocol.
http
HTTP (Hyper Text Transfer Protocol).
mmst
MMS (Microsoft Media Server) protocol over TCP.
mmsh
MMS (Microsoft Media Server) protocol over HTTP.
The required syntax is:
mmsh://<server>[:<port>][/<app>][/<playpath>]
md5
MD5 output protocol.
-62-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
Computes the MD5 hash of the data to be written, and on close writes
this to the designated output or stdout if none is specified. It can be
used to test muxers without writing an actual file.
Some examples follow.
# Write the MD5 hash of the encoded AVI file to the file output.avi.md5.
ffmpeg -i input.flv -f avi -y md5:output.avi.md5
# Write the MD5 hash of the encoded AVI file to stdout.
ffmpeg -i input.flv -f avi -y md5:
Note that some formats (typically MOV) require the output protocol to
be seekable, so they will fail with the MD5 output protocol.
pipe
UNIX pipe access protocol.
Allow to read and write from UNIX pipes.
The accepted syntax is:
pipe:[<number>]
number is the number corresponding to the file descriptor of the pipe
(e.g. 0 for stdin, 1 for stdout, 2 for stderr). If number is not
specified, by default the stdout file descriptor will be used for
writing, stdin for reading.
For example to read from stdin with ffmpeg:
cat test.wav | ffmpeg -i pipe:0
# ...this is the same as...
cat test.wav | ffmpeg -i pipe:
For writing to stdout with ffmpeg:
ffmpeg -i test.wav -f avi pipe:1 | cat > test.avi
# ...this is the same as...
ffmpeg -i test.wav -f avi pipe: | cat > test.avi
Note that some formats (typically MOV), require the output protocol to
be seekable, so they will fail with the pipe output protocol.
rtmp
Real-Time Messaging Protocol.
The Real-Time Messaging Protocol (RTMP) is used for streaming
multimedia content across a TCP/IP network.
The required syntax is:
-63-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
rtmp://<server>[:<port>][/<app>][/<playpath>]
The accepted parameters are:
server
The address of the RTMP server.
port
The number of the TCP port to use (by default is 1935).
app It is the name of the application to access. It usually corresponds
to the path where the application is installed on the RTMP server
(e.g. /ondemand/, /flash/live/, etc.).
playpath
It is the path or name of the resource to play with reference to
the application specified in app, may be prefixed by "mp4:".
For example to read with ffplay a multimedia resource named "sample"
from the application "vod" from an RTMP server "myserver":
ffplay rtmp://myserver/vod/sample
rtmp, rtmpe, rtmps, rtmpt, rtmpte
Real-Time Messaging Protocol and its variants supported through
librtmp.
Requires the presence of the librtmp headers and library during
configuration. You need to explicitly configure the build with
"--enable-librtmp". If enabled this will replace the native RTMP
protocol.
This protocol provides most client functions and a few server functions
needed to support RTMP, RTMP tunneled in HTTP (RTMPT), encrypted RTMP
(RTMPE), RTMP over SSL/TLS (RTMPS) and tunneled variants of these
encrypted types (RTMPTE, RTMPTS).
The required syntax is:
<rtmp_proto>://<server>[:<port>][/<app>][/<playpath>] <options>
where rtmp_proto is one of the strings "rtmp", "rtmpt", "rtmpe",
"rtmps", "rtmpte", "rtmpts" corresponding to each RTMP variant, and
server, port, app and playpath have the same meaning as specified for
the RTMP native protocol. options contains a list of space-separated
options of the form key=val.
See the librtmp manual page (man 3 librtmp) for more information.
For example, to stream a file in real-time to an RTMP server using
ffmpeg:
ffmpeg -re -i myfile -f flv rtmp://myserver/live/mystream
-64-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
-70-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
AUDIO FILTERS
When you configure your FFmpeg build, you can disable any of the
-71-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
-72-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
amerge
Merge two audio streams into a single multi-channel stream.
This filter does not need any argument.
If the channel layouts of the inputs are disjoint, and therefore
compatible, the channel layout of the output will be set accordingly
and the channels will be reordered as necessary. If the channel layouts
of the inputs are not disjoint, the output will have all the channels
of the first input then all the channels of the second input, in that
order, and the channel layout of the output will be the default value
corresponding to the total number of channels.
For example, if the first input is in 2.1 (FL+FR+LF) and the second
input is FC+BL+BR, then the output will be in 5.1, with the channels in
the following order: a1, a2, b1, a3, b2, b3 (a1 is the first channel of
the first input, b1 is the first channel of the second input).
On the other hand, if both input are in stereo, the output channels
will be in the default order: a1, a2, b1, b2, and the channel layout
will be arbitrarily set to 4.0, which may or may not be the expected
value.
Both inputs must have the same sample rate, format and packing.
If inputs do not have the same duration, the output will stop with the
shortest.
Example: merge two mono files into a stereo stream:
amovie=left.wav [l] ; amovie=right.mp3 [r] ; [l] [r] amerge
anull
Pass the audio source unchanged to the output.
aresample
Resample the input audio to the specified sample rate.
The filter accepts exactly one parameter, the output sample rate. If
not specified then the filter will automatically convert between its
input and output sample rates.
For example, to resample the input audio to 44100Hz:
aresample=44100
ashowinfo
Show a line containing various information for each input audio frame.
The input audio is not modified.
The shown line contains a sequence of key/value pairs of the form
key:value.
-73-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
outdef
output channel specification, of the form:
"out_name=[gain*]in_name[+[gain*]in_name...]"
out_name
output channel to define, either a channel name (FL, FR, etc.) or a
channel number (c0, c1, etc.)
-75-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
gain
multiplicative coefficient for the channel, 1 leaving the volume
unchanged
in_name
input channel to use, see out_name for details; it is not possible
to mix named and numbered input channels
If the = in a channel specification is replaced by <, then the
gains for that specification will be renormalized so that the total is
1, thus avoiding clipping noise.
Mixing examples
For example, if you want to down-mix from stereo to mono, but with a
bigger factor for the left channel:
pan=1:c0=0.9*c0+0.1*c1
A customized down-mix to stereo that works automatically for 3-, 4-, 5and 7-channels surround:
pan=stereo: FL < FL + 0.5*FC + 0.6*BL + 0.6*SL : FR < FR + 0.5*FC + 0.6*BR + 0.6*SR
Note that ffmpeg integrates a default down-mix (and up-mix) system that
should be preferred (see "-ac" option) unless you have very specific
needs.
Remapping examples
The channel remapping will be effective if, and only if:
*<gain coefficients are zeroes or ones,>
*<only one input per channel output,>
*<the number of output channels is supported by libswresample (16 at
the>
moment)
If all these conditions are satisfied, the filter will notify the user
("Pure channel mapping detected"), and use an optimized and lossless
method to do the remapping.
For example, if you have a 5.1 source and want a stereo audio stream by
dropping the extra channels:
pan="stereo: c0=FL : c1=FR"
Given the same source, you can also switch front left and front right
channels and keep the input channel layout:
pan="5.1: c0=c1 : c1=c0 : c2=c2 : c3=c3 : c4=c4 : c5=c5"
-76-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
If the input is a stereo audio stream, you can mute the front left
channel (and still keep the stereo channel layout) with:
pan="stereo:c1=c1"
Still with a stereo audio stream input, you can copy the right channel
in both front left and right:
pan="stereo: c0=FR : c1=FR"
silencedetect
Detect silence in an audio stream.
This filter logs a message when it detects that the input audio volume
is less or equal to a noise tolerance value for a duration greater or
equal to the minimum detected noise duration.
The printed times and duration are expressed in seconds.
duration, d
Set silence duration until notification (default is 2 seconds).
noise, n
Set noise tolerance. Can be specified in dB (in case "dB" is
appended to the specified value) or amplitude ratio. Default is
-60dB, or 0.001.
Detect 5 seconds of silence with -50dB noise tolerance:
silencedetect=n=-50dB:d=5
Complete example with ffmpeg to detect silence with 0.0001 noise
tolerance in silence.mp3:
ffmpeg -f lavfi -i amovie=silence.mp3,silencedetect=noise=0.0001 -f null volume
Adjust the input audio volume.
The filter accepts exactly one parameter vol, which expresses how the
audio volume will be increased or decreased.
Output values are clipped to the maximum value.
If vol is expressed as a decimal number, the output audio volume is
given by the relation:
<output_volume> = <vol> * <input_volume>
If vol is expressed as a decimal number followed by the string "dB",
the value represents the requested change in decibels of the input
audio power, and the output audio volume is given by the relation:
-77-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
AUDIO SOURCES
Below is a description of the currently available audio sources.
abuffer
Buffer audio frames, and make them available to the filter chain.
This source is mainly intended for a programmatic use, in particular
through the interface defined in libavfilter/asrc_abuffer.h.
It accepts the following mandatory parameters:
sample_rate:sample_fmt:channel_layout:packing
sample_rate
The sample rate of the incoming audio buffers.
sample_fmt
The sample format of the incoming audio buffers. Either a sample
format name or its corresponging integer representation from the
enum AVSampleFormat in libavutil/samplefmt.h
channel_layout
The channel layout of the incoming audio buffers. Either a channel
layout name from channel_layout_map in libavutil/audioconvert.c or
its corresponding integer representation from the AV_CH_LAYOUT_*
macros in libavutil/audioconvert.h
packing
Either "packed" or "planar", or their integer representation: 0 or
1 respectively.
For example:
-78-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
abuffer=44100:s16:stereo:planar
will instruct the source to accept planar 16bit signed stereo at
44100Hz. Since the sample format with name "s16" corresponds to the
number 1 and the "stereo" channel layout corresponds to the value 3,
this is equivalent to:
abuffer=44100:1:3:1
aevalsrc
Generate an audio signal specified by an expression.
This source accepts in input one or more expressions (one for each
channel), which are evaluated and used to generate a corresponding
audio signal.
It accepts the syntax: exprs[::options]. exprs is a list of
expressions separated by ":", one for each separate channel. The output
channel layout depends on the number of provided expressions, up to 8
channels are supported.
options is an optional sequence of key=value pairs, separated by ":".
The description of the accepted options follows.
duration, d
Set the minimum duration of the sourced audio. See the function
"av_parse_time()" for the accepted format. Note that the resulting
duration may be greater than the specified duration, as the
generated audio is always cut at the end of a complete frame.
If not specified, or the expressed duration is negative, the audio
is supposed to be generated forever.
nb_samples, n
Set the number of samples per channel per each output frame,
default to 1024.
sample_rate, s
Specify the sample rate, default to 44100.
Each expression in exprs can contain the following constants:
n
sample rate
Examples
Generate silence:
-79-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
aevalsrc=0
Generate a sin signal with frequency of 440 Hz, set sample rate to
8000 Hz:
aevalsrc="sin(440*2*PI*t)::s=8000"
amovie
Read an audio stream from a movie container.
It accepts the syntax: movie_name[:options] where movie_name is the
name of the resource to read (not necessarily a file but also a device
or a stream accessed through some protocol), and options is an optional
sequence of key=value pairs, separated by ":".
The description of the accepted options follows.
format_name, f
Specify the format assumed for the movie to read, and can be either
the name of a container or an input device. If not specified the
format is guessed from movie_name or by probing.
seek_point, sp
Specify the seek point in seconds, the frames will be output
starting from this seek point, the parameter is evaluated with
"av_strtod" so the numerical value may be suffixed by an IS
postfix. Default value is "0".
stream_index, si
Specify the index of the audio stream to read. If the value is -1,
the best suited audio stream will be automatically selected.
Default value is "-1".
anullsrc
Null audio source, return unprocessed audio frames. It is mainly useful
as a template and to be employed in analysis / debugging tools, or as
the source for filters which ignore the input data (for example the sox
synth filter).
It accepts an optional sequence of key=value pairs, separated by ":".
-80-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
2:
Apply a boxblur filter with luma, chroma, and alpha radius set to
boxblur=2:1
copy
Copy the input source unchanged to the output. Mainly useful for
testing purposes.
crop
Crop the input video to out_w:out_h:x:y.
The parameters are expressions containing the following constants:
x, y
the computed values for x and y. They are evaluated for each new
frame.
in_w, in_h
the input width and height
iw, ih
same as in_w and in_h
out_w, out_h
the output (cropped) width and height
ow, oh
same as out_w and out_h
a
same as iw / ih
-83-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
dar input display aspect ratio, it is the same as (iw / ih) * sar
hsub, vsub
horizontal and vertical chroma subsample values. For example for
the pixel format "yuv422p" hsub is 2 and vsub is 1.
n
pos the position in the file of the input frame, NAN if unknown
t
timestamp expressed in seconds, NAN if the input timestamp is
unknown
The out_w and out_h parameters specify the expressions for the width
and height of the output (cropped) video. They are evaluated just at
the configuration of the filter.
The default value of out_w is "in_w", and the default value of out_h is
"in_h".
The expression for out_w may depend on the value of out_h, and the
expression for out_h may depend on out_w, but they cannot depend on x
and y, as x and y are evaluated after out_w and out_h.
The x and y parameters specify the expressions for the position of the
top-left corner of the output (non-cropped) area. They are evaluated
for each frame. If the evaluated value is not valid, it is approximated
to the nearest valid value.
The default value of x is "(in_w-out_w)/2", and the default value for y
is "(in_h-out_h)/2", which set the cropped area at the center of the
input image.
The expression for x may depend on y, and the expression for y may
depend on x.
Follow some examples:
# crop the central input area with size 100x100
crop=100:100
# crop the central input area with size 2/3 of the input video
"crop=2/3*in_w:2/3*in_h"
# crop the input video central square
crop=in_h
# delimit the rectangle with the top-left corner placed at position
# 100:100 and the right-bottom corner corresponding to the right-bottom
# corner of the input image.
crop=in_w-100:in_h-100:100:100
# crop 10 pixels from the left and right borders, and 20 pixels from
-84-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
(and sometimes something even uglier appear - your mileage may vary).
The filter accepts parameters as a string of the form "x:y:w:h:band",
or as a list of key=value pairs, separated by ":".
The description of the accepted parameters follows.
x, y
Specify the top left corner coordinates of the logo. They must be
specified.
w, h
Specify the width and height of the logo to clear. They must be
specified.
band, t
Specify the thickness of the fuzzy edge of the rectangle (added to
w and h). The default value is 4.
show
When set to 1, a green rectangle is drawn on the screen to simplify
finding the right x, y, w, h parameters, and band is set to 4. The
default value is 0.
Some examples follow.
Set a rectangle covering the area with top left corner coordinates
0,0 and size 100x77, setting a band of size 10:
delogo=0:0:100:77:10
deshake
Attempt to fix small changes in horizontal and/or vertical shift. This
filter helps remove camera shake from hand-holding a camera, bumping a
tripod, moving on a vehicle, etc.
The filter accepts parameters as a string of the form
"x:y:w:h:rx:ry:edge:blocksize:contrast:search:filename"
A description of the accepted parameters follows.
x, y, w, h
Specify a rectangular area where to limit the search for motion
vectors. If desired the search for motion vectors can be limited
to a rectangular area of the frame defined by its top left corner,
width and height. These parameters have the same meaning as the
drawbox filter which can be used to visualise the position of the
bounding box.
-86-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
within the video frame. They are relative to the top/left border of
the output image.
The default value of x and y is "0".
See below for the list of accepted constants.
fontsize
The font size to be used for drawing text.
fontsize is 16.
fontcolor
The color to be used for drawing fonts. Either a string (e.g.
"red") or in 0xRRGGBB[AA] format (e.g. "0xff000033"), possibly
followed by an alpha specifier. The default value of fontcolor is
"black".
boxcolor
The color to be used for drawing box around text. Either a string
(e.g. "yellow") or in 0xRRGGBB[AA] format (e.g. "0xff00ff"),
possibly followed by an alpha specifier. The default value of
boxcolor is "white".
box Used to draw a box around text using background color. Value
should be either 1 (enable) or 0 (disable). The default value of
box is 0.
shadowx, shadowy
The x and y offsets for the text shadow position with respect to
the position of the text. They can be either positive or negative
values. Default value for both is "0".
shadowcolor
The color to be used for drawing a shadow behind the drawn text.
It can be a color name (e.g. "yellow") or a string in the
0xRRGGBB[AA] form (e.g. "0xff00ff"), possibly followed by an alpha
specifier. The default value of shadowcolor is "black".
ft_load_flags
Flags to be used for loading the fonts.
The flags map the corresponding flags supported by libfreetype, and
are a combination of the following values:
default
no_scale
no_hinting
render
no_bitmap
vertical_layout
force_autohint
crop_bitmap
pedantic
-89-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
ignore_global_advance_width
no_recurse
ignore_transform
monochrome
linear_design
no_autohint
end table
Default value is "render".
For more information consult the documentation for the FT_LOAD_*
libfreetype flags.
tabsize
The size in number of spaces to use for rendering the tab.
value is 4.
Default
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
t
timestamp expressed in seconds, NAN if the input timestamp is
unknown
timecode
initial timecode representation in "hh:mm:ss[:;.]ff" format. It can
be used with or without text parameter. rate option must be
specified. Note that timecode options are not effective if FFmpeg
is build with "--disable-avcodec".
r, rate
frame rate (timecode only)
Some examples follow.
Draw "Test Text" with font FreeSerif, using the default values for
the optional parameters.
drawtext="fontfile=/usr/share/fonts/truetype/freefont/FreeSerif.ttf: text='Test Text'"
Show a text line sliding from right to left in the last row of the
video frame. The file LONG_LINE is assumed to contain a single line
with no newlines.
drawtext=fontsize=15:fontfile=FreeSerif.ttf:text=LONG_LINE:y=h-line_h:x=-50*t
Show the content of file CREDITS off the bottom of the frame and
-91-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
scroll up.
drawtext=fontsize=20:fontfile=FreeSerif.ttf:textfile=CREDITS:y=h-20*t"
Draw a single green letter "g", at the center of the input video.
The glyph baseline is placed at half screen height.
drawtext=fontsize=60:fontfile=FreeSerif.ttf:fontcolor=green:text=g:x=(w-max_glyph_w)/2
:y=h/2-ascent
For more information about libfreetype, check:
<http://www.freetype.org/>.
fade
Apply fade-in/out effect to input video.
It accepts the parameters: type:start_frame:nb_frames[:options]
type specifies if the effect type, can be either "in" for fade-in, or
"out" for a fade-out effect.
start_frame specifies the number of the start frame for starting to
apply the fade effect.
nb_frames specifies the number of frames for which the fade effect has
to last. At the end of the fade-in effect the output video will have
the same intensity as the input video, at the end of the fade-out
transition the output video will be completely black.
options is an optional sequence of key=value pairs, separated by ":".
The description of the accepted options follows.
type, t
See type.
start_frame, s
See start_frame.
nb_frames, n
See nb_frames.
alpha
If set to 1, fade only alpha channel, if one exists on the input.
Default value is 0.
A few usage examples follow, usable too as test scenarios.
# fade in first 30 frames of video
fade=in:0:30
# fade out last 45 frames of a 200-frame video
fade=out:155:45
-92-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
# fade in first 25 frames and fade out last 25 frames of a 1000-frame video
fade=in:0:25, fade=out:975:25
# make first 5 frames black, then fade in from frame 5-24
fade=in:5:20
# fade in alpha over first 25 frames of video
fade=in:0:25:alpha=1
fieldorder
Transform the field order of the input video.
It accepts one parameter which specifies the required field order that
the input interlaced video will be transformed to. The parameter can
assume one of the following values:
0 or bff
output bottom field first
1 or tff
output top field first
Default value is "tff".
Transformation is achieved by shifting the picture content up or down
by one line, and filling the remaining line with appropriate picture
content. This method is consistent with most broadcast field order
converters.
If the input video is not flagged as being interlaced, or it is already
flagged as being of the required output field order then this filter
does not alter the incoming video.
This filter is very useful when converting to or from PAL DV material,
which is bottom field first.
For example:
ffmpeg -i in.vob -vf "fieldorder=bff" out.dv
fifo
Buffer input images and send them when they are requested.
This filter is mainly useful when auto-inserted by the libavfilter
framework.
The filter does not take parameters.
format
Convert the input video to one of the specified pixel formats.
Libavfilter will try to pick one that is supported for the input to the
next filter.
-93-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
The filter accepts a list of pixel format names, separated by ":", for
example "yuv420p:monow:rgb24".
Some examples follow:
# convert the input video to the format "yuv420p"
format=yuv420p
# convert the input video to any of the formats in the list
format=yuv420p:yuv444p:yuv410p
frei0r
Apply a frei0r effect to the input video.
To enable compilation of this filter you need to install the frei0r
header and configure FFmpeg with "--enable-frei0r".
The filter supports the syntax:
<filter_name>[{:|=}<param1>:<param2>:...:<paramN>]
filter_name is the name to the frei0r effect to load. If the
environment variable FREI0R_PATH is defined, the frei0r effect is
searched in each one of the directories specified by the colon
separated list in FREIOR_PATH, otherwise in the standard frei0r paths,
which are in this order: HOME/.frei0r-1/lib/, /usr/local/lib/frei0r-1/,
/usr/lib/frei0r-1/.
param1, param2, ... , paramN specify the parameters for the frei0r
effect.
A frei0r effect parameter can be a boolean (whose values are specified
with "y" and "n"), a double, a color (specified by the syntax R/G/B, R,
G, and B being float numbers from 0.0 to 1.0) or by an
"av_parse_color()" color description), a position (specified by the
syntax X/Y, X and Y being float numbers) and a string.
The number and kind of parameters depend on the loaded effect. If an
effect parameter is not specified the default value is set.
Some examples follow:
# apply the distort0r effect, set the first two double parameters
frei0r=distort0r:0.5:0.01
# apply the colordistance effect, takes a color as first parameter
frei0r=colordistance:0.2/0.3/0.4
frei0r=colordistance:violet
frei0r=colordistance:0x112233
# apply the perspective effect, specify the top left and top right
# image positions
-94-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
frei0r=perspective:0.2/0.2:0.8/0.2
For more information see: <http://piksel.org/frei0r>
gradfun
Fix the banding artifacts that are sometimes introduced into nearly
flat regions by truncation to 8bit color depth. Interpolate the
gradients that should go where the bands are, and dither them.
This filter is designed for playback only. Do not use it prior to
lossy compression, because compression tends to lose the dither and
bring back the bands.
The filter takes two optional parameters, separated by ::
strength:radius
strength is the maximum amount by which the filter will change any one
pixel. Also the threshold for detecting nearly flat regions. Acceptable
values range from .51 to 255, default value is 1.2, out-of-range values
will be clipped to the valid range.
radius is the neighborhood to fit the gradient to. A larger radius
makes for smoother gradients, but also prevents the filter from
modifying the pixels near detailed regions. Acceptable values are 8-32,
default value is 16, out-of-range values will be clipped to the valid
range.
# default parameters
gradfun=1.2:16
# omitting radius
gradfun=1.2
hflip
Flip the input video horizontally.
For example to horizontally flip the input video with ffmpeg:
ffmpeg -i in.avi -vf "hflip" out.avi
hqdn3d
High precision/quality 3d denoise filter. This filter aims to reduce
image noise producing smooth images and making still images really
still. It should enhance compressibility.
It accepts the following optional parameters:
luma_spatial:chroma_spatial:luma_tmp:chroma_tmp
luma_spatial
a non-negative float number which specifies spatial luma strength,
defaults to 4.0
chroma_spatial
-95-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
first
pixel component
c1
c2
third
c3
pixel component
red component
green component
blue component
alpha component
The lutyuv filter requires YUV pixel formats in input, and accepts the
options:
y
Y/luminance component
U/Cb component
-96-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
V/Cr component
alpha component
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
lutyuv="y=2*val"
# remove green and blue components
lutrgb="g=0:b=0"
# set a constant alpha channel value on input
format=rgba,lutrgb=a="maxval-minval/2"
# correct luminance gamma by a 0.5 factor
lutyuv=y=gammaval(0.5)
mp
Apply an MPlayer filter to the input video.
This filter provides a wrapper around most of the filters of
MPlayer/MEncoder.
This wrapper is considered experimental. Some of the wrapped filters
may not work properly and we may drop support for them, as they will be
implemented natively into FFmpeg. Thus you should avoid depending on
them when writing portable scripts.
The filters accepts the parameters: filter_name[:=]filter_params
filter_name is the name of a supported MPlayer filter, filter_params is
a string containing the parameters accepted by the named filter.
The list of the currently supported filters follows:
2xsai
decimate
denoise3d
detc
dint
divtc
down3dright
dsize
eq2
eq
field
fil
fixpts
framestep
fspp
geq
harddup
hqdn3d
hue
il
ilpack
ivtc
kerndeint
mcdeint
-98-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
mirror
noise
ow
palette
perspective
phase
pp7
pullup
qp
rectangle
remove-logo
rotate
sab
screenshot
smartblur
softpulldown
softskip
spp
swapuv
telecine
tile
tinterlace
unsharp
uspp
yuvcsp
yvu9
The parameter syntax and behavior for the listed filters are the same
of the corresponding MPlayer filters. For detailed instructions check
the "VIDEO FILTERS" section in the MPlayer manual.
Some examples follow:
# remove a logo by interpolating the surrounding pixels
mp=delogo=200:200:80:20:1
# adjust gamma, brightness, contrast
mp=eq2=1.0:2:0.5
# tweak hue and saturation
mp=hue=100:-10
See also mplayer(1), <http://www.mplayerhq.hu/>.
negate
Negate input video.
This filter accepts an integer in input, if non-zero it negates the
alpha component (if available). The default value in input is 0.
noformat
Force libavfilter not to use any of the specified pixel formats for the
input to the next filter.
-99-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
The filter accepts a list of pixel format names, separated by ":", for
example "yuv420p:monow:rgb24".
Some examples follow:
# force libavfilter to use a format different from "yuv420p" for the
# input to the vflip filter
noformat=yuv420p,vflip
# convert the input video to any of the formats not contained in the list
noformat=yuv420p:yuv444p:yuv410p
null
Pass the video source unchanged to the output.
ocv
Apply video transform using libopencv.
To enable this filter install libopencv library and headers and
configure FFmpeg with "--enable-libopencv".
The filter takes the parameters: filter_name{:=}filter_params.
filter_name is the name of the libopencv filter to apply.
filter_params specifies the parameters to pass to the libopencv filter.
If not specified the default values are assumed.
Refer to the official libopencv documentation for more precise
information:
<http://opencv.willowgarage.com/documentation/c/image_filtering.html>
Follows the list of supported libopencv filters.
dilate
Dilate an image by using a specific structuring element.
corresponds to the libopencv function "cvDilate".
This filter
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
to a bright pixel. When a custom shape is used, cols and rows are
ignored, the number or columns and rows of the read file are assumed
instead.
The default value for struct_el is "3x3+0x0/rect".
nb_iterations specifies the number of times the transform is applied to
the image, and defaults to 1.
Follow some example:
# use the default values
ocv=dilate
# dilate using a structuring element with a 5x5 cross, iterate two times
ocv=dilate=5x5+2x2/cross:2
# read the shape from the file diamond.shape, iterate two times
# the file diamond.shape may contain a pattern of characters like this:
#
*
# ***
# *****
# ***
#
*
# the specified cols and rows are ignored (but not the anchor point coordinates)
ocv=0x0+2x2/custom=diamond.shape:2
erode
Erode an image by using a specific structuring element.
corresponds to the libopencv function "cvErode".
This filter
-101-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
movie=logo2.png [logo2];
[in][logo1]
overlay=10:H-h-10 [in+logo1];
[in+logo1][logo2] overlay=W-w-10:H-h-10 [out]
# add a transparent color layer on top of the main video,
# WxH specifies the size of the main input to the overlay filter
color=red.3:WxH [over]; [in][over] overlay [out]
You can chain together more overlays but the efficiency of such
approach is yet to be tested.
pad
Add paddings to the input image, and places the original input at the
given coordinates x, y.
It accepts the following parameters: width:height:x:y:color.
The parameters width, height, x, and y are expressions containing the
following constants:
in_w, in_h
the input video width and height
iw, ih
same as in_w and in_h
out_w, out_h
the output width and height, that is the size of the padded area as
specified by the width and height expressions
ow, oh
same as out_w and out_h
x, y
x and y offsets as specified by the x and y expressions, or NAN if
not yet specified
a
same as iw / ih
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
The width expression can reference the value set by the height
expression, and vice versa.
The default value of width and height is 0.
x, y
Specify the offsets where to place the input image in the padded
area with respect to the top/left border of the output image.
The x expression can reference the value set by the y expression,
and vice versa.
The default value of x and y is 0.
color
Specify the color of the padded area, it can be the name of a color
(case insensitive match) or a 0xRRGGBB[AA] sequence.
The default value of color is "black".
Some examples follow:
# Add paddings with color "violet" to the input video. Output video
# size is 640x480, the top-left corner of the input video is placed at
# column 0, row 40.
pad=640:480:0:40:violet
# pad the input to get an output with dimensions increased bt 3/2,
# and put the input video at the center of the padded area
pad="3/2*iw:3/2*ih:(ow-iw)/2:(oh-ih)/2"
# pad the input to get a squared output with size equal to the maximum
# value between the input width and height, and put the input video at
# the center of the padded area
pad="max(iw,ih):ow:(ow-iw)/2:(oh-ih)/2"
# pad the input to get a final w/h ratio of 16:9
pad="ih*16/9:ih:(ow-iw)/2:(oh-ih)/2"
# for anamorphic video, in order to set the output display aspect ratio,
# it is necessary to use sar in the expression, according to the relation:
# (ih * X / ih) * sar = output_dar
# X = output_dar / sar
pad="ih*16/9/sar:ih:(ow-iw)/2:(oh-ih)/2"
# double output size and put the input video in the bottom-right
# corner of the output padded area
pad="2*iw:2*ih:ow-iw:oh-ih"
pixdesctest
Pixel format descriptor test filter, mainly useful for internal
testing. The output video should be equal to the input video.
-104-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
For example:
format=monow, pixdesctest
can be used to test the monowhite pixel format descriptor definition.
scale
Scale the input video to width:height[:interl={1|-1}] and/or convert
the image format.
The parameters width and height are expressions containing the
following constants:
in_w, in_h
the input width and height
iw, ih
same as in_w and in_h
out_w, out_h
the output (cropped) width and height
ow, oh
same as out_w and out_h
a
same as iw / ih
-105-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
selected_n
the sequential number of the selected frame, starting from 0
prev_selected_n
the sequential number of the last selected frame, NAN if undefined
TB
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
t
the PTS (Presentation TimeStamp) of the filtered video frame,
expressed in seconds, NAN if undefined
prev_pts
the PTS of the previously filtered video frame, NAN if undefined
prev_selected_pts
the PTS of the last previously filtered video frame, NAN if
undefined
prev_selected_t
the PTS of the last previously selected video frame, NAN if
undefined
start_pts
the PTS of the first video frame in the video, NAN if undefined
start_t
the time of the first video frame in the video, NAN if undefined
pict_type
the type of the filtered frame, can assume one of the following
values:
I
P
B
S
SI
SP
BI
interlace_type
the frame interlace type, can assume one of the following values:
PROGRESSIVE
the frame is progressive (not interlaced)
TOPFIRST
the frame is top-field-first
BOTTOMFIRST
the frame is bottom-field-first
key 1 if the filtered frame is a key-frame, 0 otherwise
pos the position in the file of the filtered frame, -1 if the
information is not available (e.g. for synthetic video)
The default value of the select expression is "1".
Some examples follow:
-107-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
-108-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
Accept in input an expression evaluated through the eval API, which can
contain the following constants:
PTS the presentation timestamp in input
N
STARTPTS
the PTS of the first video frame
INTERLACED
tell if the current frame is interlaced
POS original position in the file of the frame, or undefined if
undefined for the current frame
PREV_INPTS
previous input PTS
PREV_OUTPTS
previous output PTS
Some examples follow:
# start counting PTS from zero
setpts=PTS-STARTPTS
# fast motion
setpts=0.5*PTS
# slow motion
setpts=2.0*PTS
# fixed rate 25 fps
setpts=N/(25*TB)
# fixed rate 25 fps with some jitter
setpts='1/(25*TB) * (N + 0.05 * sin(N*2*PI/25))'
setsar
Set the Sample (aka Pixel) Aspect Ratio for the filter output video.
Note that as a consequence of the application of this filter, the
output display aspect ratio will change according to the following
equation: DAR = HORIZONTAL_RESOLUTION / VERTICAL_RESOLUTION * SAR
Keep in mind that the sample aspect ratio set by this filter may be
changed by later filters in the filterchain, e.g. if another "setsar"
or a "setdar" filter is applied.
The filter accepts a parameter string which represents the wanted
sample aspect ratio. The parameter can be a floating point number
string, or an expression of the form num:den, where num and den are the
-109-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
It is mainly
-110-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
i
interlaced mode ("P" for "progressive", "T" for top field first,
"B" for bottom field first)
iskey
1 if the frame is a key frame, 0 otherwise
type
picture type of the input frame ("I" for an I-frame, "P" for a
P-frame, "B" for a B-frame, "?" for unknown type). Check also the
documentation of the "AVPictureType" enum and of the
"av_get_picture_type_char" function defined in libavutil/avutil.h.
checksum
Adler-32 checksum (printed in hexadecimal) of all the planes of the
input frame
plane_checksum
Adler-32 checksum (printed in hexadecimal) of each plane of the
input frame, expressed in the form "[c0 c1 c2 c3]"
slicify
Pass the images of input video on to next video filter as multiple
slices.
ffmpeg -i in.avi -vf "slicify=32" out.avi
The filter accepts the slice height as parameter. If the parameter is
not specified it will use the default value of 16.
Adding this in the beginning of filter chains should make filtering
faster due to better use of the memory cache.
split
Pass on the input video to two outputs. Both outputs are identical to
the input video.
For example:
[in] split [splitout1][splitout2];
[splitout1] crop=100:100:0:0
[cropout];
[splitout2] pad=200:200:100:100 [padout];
-111-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
will create two separate outputs from the same input, one cropped and
one padded.
swapuv
Swap U & V plane.
thumbnail
Select the most representative frame in a given sequence of consecutive
frames.
It accepts as argument the frames batch size to analyze (default
N=100); in a set of N frames, the filter will pick one of them, and
then handle the next batch of N frames until the end.
Since the filter keeps track of the whole frames sequence, a bigger N
value will result in a higher memory usage, so a high value is not
recommended.
The following example extract one picture each 50 frames:
thumbnail=50
Complete example of a thumbnail creation with ffmpeg:
ffmpeg -i in.avi -vf thumbnail,scale=300:200 -frames:v 1 out.png
tinterlace
Perform various types of temporal field interlacing.
Frames are counted starting from 1, so the first input frame is
considered odd.
This filter accepts a single parameter specifying the mode. Available
modes are:
0
Move odd frames into the upper field, even into the lower field,
generating a double height frame at half framerate.
1
Only output even frames, odd frames are dropped, generating a frame
with unchanged height at half framerate.
2
Only output odd frames, even frames are dropped, generating a frame
with unchanged height at half framerate.
3
Expand each frame to full height, but pad alternate lines with
black, generating a frame with double height at the same input
framerate.
4
Interleave the upper field from odd frames with the lower field
from even frames, generating a frame with unchanged height at half
framerate.
5
Interleave the lower field from odd frames with the upper field
-112-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
l.L
. .
r.R
L.l
. .
R.r
R.r
. .
L.l
r.R
. .
l.L
unsharp
Sharpen or blur the input video.
It accepts the following parameters:
luma_msize_x:luma_msize_y:luma_amount:chroma_msize_x:chroma_msize_y:chroma_amount
Negative values for the amount will blur the input video, while
positive values will sharpen. All parameters are optional and default
to the equivalent of the string 5:5:1.0:5:5:0.0.
luma_msize_x
Set the luma matrix horizontal size. It can be an integer between 3
and 13, default value is 5.
luma_msize_y
Set the luma matrix vertical size. It can be an integer between 3
and 13, default value is 5.
-113-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
luma_amount
Set the luma effect strength. It can be a float number between -2.0
and 5.0, default value is 1.0.
chroma_msize_x
Set the chroma matrix horizontal size. It can be an integer between
3 and 13, default value is 5.
chroma_msize_y
Set the chroma matrix vertical size. It can be an integer between 3
and 13, default value is 5.
chroma_amount
Set the chroma effect strength. It can be a float number between
-2.0 and 5.0, default value is 0.0.
# Strong luma sharpen effect parameters
unsharp=7:7:2.5
# Strong blur of both luma and chroma parameters
unsharp=7:7:-2:7:7:-2
# Use the default values with B<ffmpeg>
ffmpeg -i in.avi -vf "unsharp" out.mp4
vflip
Flip the input video vertically.
ffmpeg -i in.avi -vf "vflip" out.avi
yadif
Deinterlace the input video ("yadif" means "yet another deinterlacing
filter").
It accepts the optional parameters: mode:parity:auto.
mode specifies the interlacing mode to adopt, accepts one of the
following values:
0
Default value is 0.
parity specifies the picture field parity assumed for the input
interlaced video, accepts one of the following values:
-114-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
-1
Default value is 0.
VIDEO SOURCES
Below is a description of the currently available video sources.
buffer
Buffer video frames, and make them available to the filter chain.
This source is mainly intended for a programmatic use, in particular
through the interface defined in libavfilter/vsrc_buffer.h.
It accepts the following parameters:
width:height:pix_fmt_string:timebase_num:timebase_den:sample_aspect_ratio_num:sample_aspec
t_ratio.den:scale_params
All the parameters but scale_params need to be explicitly defined.
Follows the list of the accepted parameters.
width, height
Specify the width and height of the buffered video frames.
pix_fmt_string
A string representing the pixel format of the buffered video
frames. It may be a number corresponding to a pixel format, or a
pixel format name.
timebase_num, timebase_den
Specify numerator and denomitor of the timebase assumed by the
timestamps of the buffered frames.
sample_aspect_ratio.num, sample_aspect_ratio.den
Specify numerator and denominator of the sample aspect ratio
assumed by the video frames.
scale_params
-115-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
-116-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
This is
Examples
Read the initial state from pattern, and specify an output of size
200x400.
cellauto=f=pattern:s=200x400
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
color
Provide an uniformly colored input.
It accepts the following parameters: color:frame_size:frame_rate
Follows the description of the accepted parameters.
color
Specify the color of the source. It can be the name of a color
(case insensitive match) or a 0xRRGGBB[AA] sequence, possibly
followed by an alpha specifier. The default value is "black".
frame_size
Specify the size of the sourced video, it may be a string of the
form widthxheight, or the name of a size abbreviation. The default
value is "320x240".
frame_rate
Specify the frame rate of the sourced video, as the number of
frames generated per second. It has to be a string in the format
frame_rate_num/frame_rate_den, an integer number, a float number or
a valid video frame rate abbreviation. The default value is "25".
For example the following graph description will generate a red source
with an opacity of 0.2, with size "qcif" and a frame rate of 10 frames
per second, which will be overlayed over the source connected to the
pad with identifier "in".
"[email protected]:qcif:10 [color]; [in][color] overlay [out]"
movie
Read a video stream from a movie container.
It accepts the syntax: movie_name[:options] where movie_name is the
name of the resource to read (not necessarily a file but also a device
or a stream accessed through some protocol), and options is an optional
sequence of key=value pairs, separated by ":".
The description of the accepted options follows.
format_name, f
Specifies the format assumed for the movie to read, and can be
either the name of a container or an input device. If not specified
-118-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
This source
-119-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
[-]HH[:MM[:SS[.m...]]]
[-]S+[.m...]
See also the function "av_parse_time()".
If not specified, or the expressed duration is negative, the video
is supposed to be generated forever.
test, t
Set the number or the name of the test to perform. Supported tests
are:
dc_luma
dc_chroma
freq_luma
freq_chroma
amp_luma
amp_chroma
cbp
mv
ring1
ring2
all
Default value is "all", which will cycle through the list of all
tests.
For example the following:
testsrc=t=dc_luma
will generate a "dc_luma" test pattern.
frei0r_src
Provide a frei0r source.
To enable compilation of this filter you need to install the frei0r
header and configure FFmpeg with "--enable-frei0r".
The source supports the syntax:
<size>:<rate>:<src_name>[{=|:}<param1>:<param2>:...:<paramN>]
size is the size of the video to generate, may be a string of the form
widthxheight or a frame size abbreviation. rate is the rate of the
video to generate, may be a string of the form num/den or a frame rate
abbreviation. src_name is the name to the frei0r source to load. For
more information regarding frei0r and how to set the parameters read
the section frei0r in the description of the video filters.
Some examples follow:
# generate a frei0r partik0l source with size 200x200 and frame rate 10
-120-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
-122-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
life=ratio=2/3:s=200x200
-123-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
decimals, n
Set the number of decimals to show in the timestamp, only used in
the "testsrc" source.
The displayed timestamp value will correspond to the original
timestamp value multiplied by the power of 10 of the specified
value. Default value is 0.
For example the following:
testsrc=duration=5.3:size=qcif:rate=10
will generate a video with a duration of 5.3 seconds, with size 176x144
and a frame rate of 10 frames per second.
If the input content is to be ignored, "nullsrc" can be used. The
following command generates noise in the luminance plane by employing
the "mp=geq" filter:
nullsrc=s=256x256, mp=geq=random(1)*255:128:128
VIDEO SINKS
Below is a description of the currently available video sinks.
buffersink
Buffer video frames, and make them available to the end of the filter
graph.
This sink is mainly intended for a programmatic use, in particular
through the interface defined in libavfilter/buffersink.h.
It does not require a string parameter in input, but you need to
specify a pointer to a list of supported pixel formats terminated by -1
in the opaque parameter provided to "avfilter_init_filter" when
initializing this sink.
nullsink
Null video sink, do absolutely nothing with the input video. It is
mainly useful as a template and to be employed in analysis / debugging
tools.
METADATA
FFmpeg is able to dump metadata from media files into a simple
UTF-8-encoded INI-like text file and then load it back using the
metadata muxer/demuxer.
The file format is as follows:
1. A file consists of a header and a number of metadata tags divided
into sections, each on its own line.
2. The header is a ;FFMETADATA string, followed by a version number
(now 1).
-124-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
3.
4.
-125-
X:\Z_ELEMENTAL_TESTING\ffmpeg_man.txt
2014-08-30
FFMPEG(1)
-126-