0% found this document useful (0 votes)

14 views

????

Uploaded by

s.manjunarayana

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views

????

Uploaded by

s.manjunarayana

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 53

PERL

Chapter 1: Getting started with Perl

Language
Version Release Notes Release Date
1.000 1987-12-18
2.000 1988-06-05
3.000 1989-10-18
4.000 1991-03-21
5.000 1994-10-17
5.001 1995-05-13
5.002 1996-02-29
5.003 1996-06-25
5.004 perl5004delta 1997-05-15
5.005 perl5005delta 1998-07-22
5.6.0 perl56delta 2000-03-22
5.8.0 perl58delta 2002-07-18
perl581delta,
perl582delta,
perl583delta,
perl584delta,
5.8.8 2006-02-01
perl585delta,
perl586delta,
perl587delta,
perl588delta
5.10.0 perl5100delta 2007-12-18
5.12.0 perl5120delta 2010-04-12
5.14.0 perl5140delta 2011-05-14
5.16.0 perl5160delta 2012-05-20
5.18.0 perl5180delta 2013-05-18
5.20.0 perl5200delta 2014-05-27
5.22.0 perl5220delta 2015-06-01
5.24.0 perl5240delta 2016-05-09
5.26.0 perl5260delta 2017-05-30

Section 1.1: Getting started with Perl

Perl tries to do what you mean:

print "Hello World\n";

The two tricky bits are the semicolon at the end of the line and the \n, which adds a newline (line feed). If you have
a relatively new version of perl, you can use say instead of print to have the carriage return added automatically:

Version ≥ 5.10.0
use feature 'say';
say "Hello World";

The say feature is also enabled automatically with a use v5.10 (or higher) declaration:

use v5.10;

GoalKicker.com – Perl® Notes for Professionals 2

say "Hello World";

It's pretty common to just use perl on the command line using the -e option:

$ perl -e 'print "Hello World\n"'

Hello World

Adding the -l option is one way to print newlines automatically:

$ perl -le 'print "Hello World"'

Hello World
Version ≥ 5.10.0

If you want to enable new features, use the -E option instead:

$ perl -E 'say "Hello World"'

Hello World

You can also, of course, save the script in a ﬁle. Just remove the -e command line option and use the ﬁlename of
the script: perl script.pl. For programs longer than a line, it's wise to turn on a couple of options:

use strict;
use warnings;

print "Hello World\n";

There's no real disadvantage other than making the code slightly longer. In exchange, the strict pragma prevents
you from using code that is potentially unsafe and warnings notiﬁes you of many common errors.

Notice the line-ending semicolon is optional for the last line, but is a good idea in case you later add to the end of
your code.

For more options how to run Perl, see perlrun or type perldoc perlrun at a command prompt. For a more detailed
introduction to Perl, see perlintro or type perldoc perlintro at a command prompt. For a quirky interactive
tutorial, Try Perl.

GoalKicker.com – Perl® Notes for Professionals 3

Chapter 2: Comments
Section 2.1: Single-line comments
Single-line comments begin with a pound sign # and go to the end of the line:

# This is a comment

my $foo = "bar"; # This is also a comment

Section 2.2: Multi-line comments

Multi-line comments start with = and with the =cut statement. These are special comments called POD (Plain Old
Documentation).

Any text between the markers will be commented out:

=begin comment

This is another comment.

And it spans multiple lines!

=end comment

=cut

GoalKicker.com – Perl® Notes for Professionals 4

Chapter 3: Variables
Section 3.1: Scalars
Scalars are Perl's most basic data type. They're marked with the sigil $ and hold a single value of one of three types:

a number (3, 42, 3.141, etc.)

a string ('hi', "abc", etc.)
a reference to a variable (see other examples).

my $integer = 3; # number
my $string = "Hello World"; # string
my $reference = \$string; # reference to $string

Perl converts between numbers and strings on the ﬂy, based on what a particular operator expects.

my $number = '41'; # string '41'

my $meaning = $number + 1; # number 42
my $sadness = '20 apples'; # string '20 apples'
my $danger = $sadness * 2; # number '40', raises warning

When converting a string into a number, Perl takes as many digits from the front of a string as it can – hence why 20
apples is converted into 20 in the last line.

Based on whether you want to treat the contents of a scalar as a string or a number, you need to use diﬀerent
operators. Do not mix them.

# String comparison # Number comparison

'Potato' eq 'Potato'; 42 == 42;
'Potato' ne 'Pomato'; 42 != 24;
'Camel' lt 'Potato'; 41 < 42;
'Zombie' gt 'Potato'; 43 > 42;

# String concatenation # Number summation

'Banana' . 'phone'; 23 + 19;

# String repetition # Number multiplication

'nan' x 3; 6 * 7;

Attempting to use string operations on numbers will not raise warnings; attempting to use number operations on
non-numeric strings will. Do be aware that some non-digit strings such as 'inf', 'nan', '0 but true' count as
numbers.

Section 3.2: Array References

Array References are scalars ($) which refer to Arrays.

my @array = ("Hello"); # Creating array, assigning value from a list

my $array_reference = \@array;

These can be created more short-hand as follows:

my $other_array_reference = ["Hello"];

Modifying / Using array references require dereferencing them ﬁrst.

GoalKicker.com – Perl® Notes for Professionals 5

my @contents = @{ $array_reference }; # Prefix notation
my @contents = @$array_reference; # Braces can be left out
Version ≥ 5.24.0

New postﬁx dereference syntax, available by default from v5.24

use v5.24;
my @contents = $array_reference->@*; # New postfix notation

When accessing an arrayref's contents by index you can use the -> syntactical sugar.

my @array = qw(one two three); my $arrayref = [ qw(one two three) ]

my $one = $array[0]; my $one = $arrayref->[0];

Unlike arrays, arrayrefs can be nested:

my @array = ( (1, 0), (0, 1) ) # ONE array of FOUR elements: (1, 0, 0, 1)

my @matrix = ( [1, 0], [0, 1] ) # an array of two arrayrefs
my $matrix = [ [0, 1], [1, 0] ] # an arrayref of arrayrefs
# There is no namespace conflict between scalars, arrays and hashes
# so @matrix and $matrix _both_ exist at this point and hold different values.

my @diagonal_1 = ($matrix[0]->[1], $matrix[1]->[0]) # uses @matrix

my @diagonal_2 = ($matrix->[0]->[1], $matrix->[1]->[0]) # uses $matrix
# Since chained []- and {}-access can only happen on references, you can
# omit some of those arrows.
my $corner_1 = $matrix[0][1]; # uses @matrix;
my $corner_2 = $matrix->[0][1]; # uses $matrix;

When used as Boolean, references are always true.

Section 3.3: Scalar References

A reference is a scalar variable (one preﬁxed by $ ) which “refers to” some other data.

my $value = "Hello";
my $reference = \$value;
print $value; # => Hello
print $reference; # => SCALAR(0x2683310)

To get the referred-to data, you de-reference it.

say ${$reference}; # Explicit prefix syntax

say $$reference; # The braces can be left out (confusing)
Version ≥ 5.24.0

New postﬁx dereference syntax, available by default from v5.24

use v5.24;
say $reference->$*; # New postfix notation

This "de-referenced value" can then be changed like it was the original variable.

${$reference} =~ s/Hello/World/;
print ${$reference}; # => World
print $value; # => World

GoalKicker.com – Perl® Notes for Professionals 6

A reference is always truthy – even if the value it refers to is falsy (like 0 or "").

You may want a Scalar Reference If:

You want to pass a string to a function, and have it modify that string for you without it being a return value.

You wish to explicitly avoid Perl implicitly copying the contents of a large string at some point in your function
passing ( especially relevant on older Perls without copy-on-write strings )

You wish to disambiguate string-like values with speciﬁc meaning, from strings that convey content, for
example:

Disambiguate a ﬁle name from ﬁle content

Disambiguate returned content from a returned error string

You wish to implement a lightweight inside out object model, where objects handed to calling code don't
carry user visible metadata:

our %objects;
my $next_id = 0;
sub new {
my $object_id = $next_id++;
$objects{ $object_id } = { ... }; # Assign data for object
my $ref = \$object_id;
return bless( $ref, "MyClass" );
}

Section 3.4: Arrays

Arrays store an ordered sequence of values. You can access the contents by index, or iterate over them. The values
will stay in the order you ﬁlled them in.

my @numbers_to_ten = (1,2,3,4,5,6,7,8,9,10); # More conveniently: (1..10)

my @chars_of_hello = ('h','e','l','l','o');
my @word_list = ('Hello','World');

# Note the sigil: access an @array item with $array[index]

my $second_char_of_hello = $chars_of_hello[1]; # 'e'

# Use negative indices to count from the end (with -1 being last)
my $last_char_of_hello = $chars_of_hello[-1];

# Assign an array to a scalar to get the length of the array

my $length_of_array = @chars_of_hello; # 5

# You can use $# to get the last index of an array, and confuse Stack Overflow
my $last_index_of_array = $#chars_of_hello; # 4

# You can also access multiple elements of an array at the same time
# This is called "array slice"
# Since this returns multiple values, the sigil to use here on the RHS is @
my @some_chars_of_hello = @chars_of_hello[1..3]; # ('H', 'e', 'l')
my @out_of_order_chars = @chars_of_hello[1,4,2]; # ('e', 'o', 'l')

# In Python you can say array[1:-1] to get all elements but first and last
# Not so in Perl: (1..-1) is an empty list. Use $# instead
my @empty_list = @chars_of_hello[1..-1]; # ()
my @inner_chars_of_hello = @chars_of_hello[1..$#chars_of_hello-1]; # ('e','l','l')

GoalKicker.com – Perl® Notes for Professionals 7

# Access beyond the end of the array yields undef, not an error
my $undef = $chars_of_hello[6]; # undef

Arrays are mutable:

use utf8; # necessary because this snippet is utf-8

$chars_of_hello[1] = 'u'; # ('h','u','l','l','o')
push @chars_of_hello, ('!', '!'); # ('h','u','l','l','o','!','!')
pop @chars_of_hello; # ('h','u','l','l','o','!')
shift @chars_of_hello; # ('u','l','l','o','!')
unshift @chars_of_hello, ('¡', 'H'); # ('¡','H','u','l','l','o','!')
@chars_of_hello[2..5] = ('O','L','A'); # ('¡','H','O','L','A',undef,'!') whoops!
delete $chars_of_hello[-2]; # ('¡','H','O','L','A', '!')

# Setting elements beyond the end of an array does not result in an error
# The array is extended with undef's as necessary. This is "autovivification."
my @array; # ()
my @array[3] = 'x'; # (undef, undef, undef, 'x')

Finally, you can loop over the contents of an array:

use v5.10; # necessary for 'say'

for my $number (@numbers_to_ten) {
say $number ** 2;
}

When used as booleans, arrays are true if they are not empty.

Section 3.5: Typeglobs, typeglob refs, ﬁlehandles and

constants
A typeglob *foo holds references to the contents of global variables with that name: $foo, @foo, $foo, &foo, etc. You
can access it like an hash and assign to manipulate the symbol tables directly (evil!).

use v5.10; # necessary for say

our $foo = "foo";
our $bar;
say ref *foo{SCALAR}; # SCALAR
say ${ *foo{SCALAR} }; # bar
*bar = *foo;
say $bar; # bar
$bar = 'egg';
say $foo; # egg

Typeglobs are more commonly handled when dealing with ﬁles. open, for example, produces a reference to a
typeglob when asked to create a non-global ﬁlehandle:

use v5.10; # necessary for say

open(my $log, '> utf-8', '/tmp/log') or die $!; # open for writing with encoding
say $log 'Log opened';

# You can dereference this globref, but it's not very useful.
say ref $log; # GLOB
say (*{$log}->{IO} // 'undef'); # undef

close $log or die $!;

GoalKicker.com – Perl® Notes for Professionals 8

Typeglobs can also be used to make global read-only variables, though use constant is in broader use.

# Global constant creation

*TRUE = \('1');
our $TRUE;
say $TRUE; # 1
$TRUE = ''; # dies, "Modification of a read-only value attempted"

# use constant instead defines a parameterless function, therefore it's not global,
# can be used without sigils, can be imported, but does not interpolate easily.
use constant (FALSE => 0);
say FALSE; # 0
say &FALSE; # 0
say "${\FALSE}"; # 0 (ugh)
say *FALSE{CODE}; # CODE(0xMA1DBABE)

# Of course, neither is truly constant when you can manipulate the symbol table...
*TRUE = \('');
use constant (EVIL => 1);
*FALSE = *EVIL;

Section 3.6: Sigils

Perl has a number of sigils:

$scalar = 1; # individual value

@array = ( 1, 2, 3, 4, 5 ); # sequence of values
%hash = ('it', 'ciao', 'en', 'hello', 'fr', 'salut'); # unordered key-value pairs
&function('arguments'); # subroutine
*typeglob; # symbol table entry

These look like sigils, but aren't:

\@array; # \ returns the reference of what's on the right (so, a reference to @array)
$#array; # this is the index of the last element of @array

You can use braces after the sigil if you should be so inclined. Occasionally, this improves readability.

say ${value} = 5;

While you use different sigils to define variables of different types, the same variable can be accessed in different
ways based on what sigils you use.

%hash; # we use % because we are looking at an entire hash

$hash{it}; # we want a single value, however, that's singular, so we use $
$array[0]; # likewise for an array. notice the change in brackets.
@array[0,3]; # we want multiple values of an array, so we instead use @
@hash{'it','en'}; # similarly for hashes (this gives the values: 'ciao', 'hello')
%hash{'it','fr'}; # we want an hash with just some of the keys, so we use %
# (this gives key-value pairs: 'it', 'ciao', 'fr', 'salut')

This is especially true of references. In order to use a referenced value you can combine sigils together.

my @array = 1..5; # This is an array

my $reference_to_an_array = \@array; # A reference to an array is a singular value
push @array, 6; # push expects an array
push @$reference_to_an_array, 7; # the @ sigil means what's on the right is an array
# and what's on the right is $reference_to_an_array

GoalKicker.com – Perl® Notes for Professionals 9

# hence: first a @, then a $

Here's a perhaps less confusing way to think about it. As we saw earlier, you can use braces to wrap what's on the
right of a sigil. So you can think of @{} as something that takes an array reference and gives you the referenced
array.

# pop does not like array references

pop $reference_to_an_array; # ERROR in Perl 5.20+
# but if we use @{}, then...
pop @{ $reference_to_an_array }; # this works!

As it turns out, @{} actually accepts an expression:

my $values = undef;
say pop @{ $values }; # ERROR: can't use undef as an array reference
say pop @{ $values // [5] } # undef // [5] gives [5], so this prints 5

...and the same trick works for other sigils, too.

# This is not an example of good Perl. It is merely a demonstration of this language feature
my $hashref = undef;
for my $key ( %{ $hashref // {} } ) {
"This doesn't crash";
}

...but if the "argument" to a sigil is simple, you can leave the braces away.

say $$scalar_reference;
say pop @$array_reference;
for keys (%$hash_reference) { ... };

Things can get excessively extravagant. This works, but please Perl responsibly.

my %hash = (it => 'ciao', en => 'hi', fr => 'salut');

my $reference = \%hash;
my $reference_to_a_reference = \$reference;

my $italian = $hash{it}; # Direct access

my @greets = @$reference{'it', 'en'}; # Dereference, then access as array
my %subhash = %$$reference_to_a_reference{'en', 'fr'} # Dereference ×2 then access as hash

For most normal use, you can just use subroutine names without a sigil. (Variables without a sigil are typically called
"barewords".) The & sigil is only useful in a limited number of cases.

Making a reference to a subroutine:

sub many_bars { 'bar' x $_[0] }

my $reference = \&many_bars;
say $reference->(3); # barbarbar

Calling a function ignoring its prototype.

Combined with goto, as a slightly weird function call that has the current call frame replaced with the caller.
Think the linux exec() API call, but for functions.

GoalKicker.com – Perl® Notes for Professionals 10

Section 3.7: Hash References
Hash references are scalars which contain a pointer to the memory location containing the data of a hash. Because
the scalar points directly to the hash itself, when it is passed to a subroutine, changes made to the hash are not
local to the subroutine as with a regular hash, but instead are global.

First, let's examine what happens when you pass a normal hash to a subroutine and modify it within there:

use strict;
use warnings;
use Data::Dumper;

sub modify
{
my %hash = @_;

$hash{new_value} = 2;

print Dumper("Within the subroutine");

print Dumper(\%hash);

return;
}

my %example_hash = (
old_value => 1,
);

modify(%example_hash);

print Dumper("After exiting the subroutine");

print Dumper(\%example_hash);

Which results in:

$VAR1 = 'Within the subroutine';

$VAR1 = {
'new_value' => 2,
'old_value' => 1
};
$VAR1 = 'After exiting the subroutine';
$VAR1 = {
'old_value' => 1
};

Notice that after we exit the subroutine, the hash remains unaltered; all changes to it were local to the modify
subroutine, because we passed a copy of the hash, not the hash itself.

In comparison, when you pass a hashref, you are passing the address to the original hash, so any changes made
within the subroutine will be made to the original hash:

use strict;
use warnings;
use Data::Dumper;

sub modify
{
my $hashref = shift;

GoalKicker.com – Perl® Notes for Professionals 11

# De-reference the hash to add a new value
$hashref->{new_value} = 2;

print Dumper("Within the subroutine");

print Dumper($hashref);

return;
}

# Create a hashref
my $example_ref = {
old_value => 1,
};

# Pass a hashref to a subroutine

modify($example_ref);

print Dumper("After exiting the subroutine");

print Dumper($example_ref);

This will result in:

$VAR1 = 'Within the subroutine';

$VAR1 = {
'new_value' => 2,
'old_value' => 1
};
$VAR1 = 'After exiting the subroutine';
$VAR1 = {
'new_value' => 2,
'old_value' => 1
};

Section 3.8: Hashes

Hashes can be understood as lookup-tables. You can access its contents by speciﬁying a key for each of them. Keys
must be strings. If they're not, they will be converted to strings.

If you give the hash simply a known key, it will serve you its value.

# Elements are in (key, value, key, value) sequence

my %inhabitants_of = ("London", 8674000, "Paris", 2244000);

# You can save some typing and gain in clarity by using the "fat comma"
# syntactical sugar. It behaves like a comma and quotes what's on the left.
my %translations_of_hello = (spanish => 'Hola', german => 'Hallo', swedish => 'Hej');

In the following example, note the brackets and sigil: you access an element of %hash using $hash{key} because the
value you want is a scalar. Some consider it good practice to quote the key while others ﬁnd this style visually noisy.
Quoting is only required for keys that could be mistaken for expressions like $hash{'some-key'}

my $greeting = $translations_of_hello{'spanish'};

While Perl by default will try to use barewords as strings, + modiﬁer can also be used to indicate to Perl that key
should not be interpolated but executed with result of execution being used as a key:

my %employee = ( name => 'John Doe', shift => 'night' );

# this example will print 'night'

GoalKicker.com – Perl® Notes for Professionals 12

print $employee{shift};

# but this one will execute [shift][1], extracting first element from @_,
# and use result as a key
print $employee{+shift};

Like with arrays, you can access multiple hash elements at the same time. This is called a hash slice. The resulting
value is a list, so use the @ sigil:

my @words = @translations_of_hello{'spanish', 'german'}; # ('Hola', 'Hallo')

Iterate over the keys of an hash with keys keys will return items in a random order. Combine with sort if you wish.

for my $lang (sort keys %translations_of_hello) {

say $translations_of_hello{$lang};
}

If you do not actually need the keys like in the previous example, values returns the hash's values directly:

for my $translation (values %translations_of_hello) {

say $translation;
}

You can also use a while loop with each to iterate over the hash. This way, you will get both the key and the value at
the same time, without a separate value lookup. Its use is however discouraged, as each can break in mistifying
ways.

# DISCOURAGED
while (my ($lang, $translation) = each %translations_of_hello) {
say $translation;
}

Access to unset elements returns undef, not an error:

my $italian = $translations_of_hello{'italian'}; # undef

map and list ﬂattening can be used to create hashes out of arrays. This is a popular way to create a 'set' of values,
e.g. to quickly check whether a value is in @elems. This operation usually takes O(n) time (i.e. proportional to the
number of elements) but can be done in constant time (O(1)) by turning the list into a hash:

@elems = qw(x y x z t);

my %set = map { $_ => 1 } @elems; # (x, 1, y, 1, t, 1)
my $y_membership = $set{'y'}; # 1
my $w_membership = $set{'w'}; # undef

This requires some explanation. The contents of @elems get read into a list, which is processed by map. map accepts a
code block that gets called for each value of its input list; the value of the element is available for use in $_. Our
code block returns two list elements for each input element: $_, the input element, and 1, just some value. Once you
account for list ﬂattening, the outcome is that map { $_ => 1 } @elems turns qw(x y x z t) into (x => 1, y =>
1, x => 1, z => 1, t => 1).

As those elements get assigned into the hash, odd elements become hash keys and even elements become hash
values. When a key is speciﬁed multiple times in a list to be assigned to a hash, the last value wins. This eﬀectively
discards duplicates.

GoalKicker.com – Perl® Notes for Professionals 13

A faster way to turn a list into a hash uses assignment to a hash slice. It uses the x operator to multiply the single-
element list (1) by the size of @elems, so there is a 1 value for each of the keys in the slice on the left hand side:

@elems = qw(x y x z t);

my %set;
@set{@elems} = (1) x @elems;

The following application of hashes also exploits the fact that hashes and lists can often be used interchangeably to
implement named function args:

sub hash_args {
my %args = @_;
my %defaults = (foo => 1, bar => 0);
my %overrides = (__unsafe => 0);
my %settings = (%defaults, %args, %overrides);
}

# This function can then be called like this:

hash_args(foo => 5, bar => 3); # (foo => 5, bar => 3, __unsafe ==> 0)
hash_args(); # (foo => 1, bar => 0, __unsafe ==> 0)
hash_args(__unsafe => 1) # (foo => 1, bar => 0, __unsafe ==> 0)

When used as booleans, hashes are true if they are not empty.

GoalKicker.com – Perl® Notes for Professionals 14

Chapter 4: Interpolation in Perl
Section 4.1: What is interpolated
Perl interpolates variable names:

my $name = 'Paul';
print "Hello, $name!\n"; # Hello, Paul!

my @char = ('a', 'b', 'c');

print "$char[1]\n"; # b

my %map = (a => 125, b => 1080, c => 11);

print "$map{a}\n"; # 125

Arrays may be interpolated as a whole, their elements are separated by spaces:

my @char = ('a', 'b', 'c');

print "My chars are @char\n"; # My chars are a b c

Perl does not interpolate hashes as a whole:

my %map = (a => 125, b => 1080, c => 11);

print "My map is %map\n"; # My map is %map

and function calls (including constants):

use constant {
PI => '3.1415926'
};
print "I like PI\n"; # I like PI
print "I like " . PI . "\n"; # I like 3.1415926

Perl interpolates escape sequences starting with \:

\t horizontal tab
\n newline
\r return
\f form feed
\b backspace
\a alarm (bell)
\e escape

Interpolation of \n depends on the system where program is working: it will produce a newline character(s)
according to the current system conventions.

Perl does not interpolate \v, which means vertical tab in C and other languages.

Character may be addressed using their codes:

\x{1d11e} ???? by hexadecimal code

\o{350436} ???? by octal code
\N{U+1d11e} ???? by Unicode code point

or Unicode names:

GoalKicker.com – Perl® Notes for Professionals 15

\N{MUSICAL SYMBOL G CLEF}

Character with codes from 0x00 to 0xFF in the native encoding may be addressed in a shorter form:

\x0a hexadecimal
\012 octal

Control character may be addressed using special escape sequences:

\c@ chr(0)
\ca chr(1)
\cb chr(2)
...
\cz chr(26)
\c[ chr(27)
\c\ chr(28) # Cannot be used at the end of a string
# since backslash will interpolate the terminating quote
\c] chr(29)
\c^ chr(30)
\c_ chr(31)
\c? chr(127)

Uppercase letters have the same meaning: "\cA" == "\ca".

Interpretation of all escape sequences except for \N{...} may depend on the platform since they use platform-
and encoding-dependent codes.

Section 4.2: Basic interpolation

Interpolation means that Perl interpreter will substitute the values of variables for their name and some symbols
(which are impossible or diﬃcult to type in directly) for special sequences of characters (it is also known as
escaping). The most important distinction is between single and double quotes: double quotes interpolate the
enclosed string, but single quotes do not.

my $name = 'Paul';
my $age = 64;
print "My name is $name.\nI am $age.\n"; # My name is Paul.
# I am 64.

But:

print 'My name is $name.\nI am $age.\n'; # My name is $name.\nI am $age.\n

You can use q{} (with any delimiter) instead of single quotes and qq{} instead of double quotes. For example,
q{I'm 64} allows to use an apostrophe within a non-interpolated string (otherwise it would terminate the string).

Statements:

print qq{$name said: "I'm $age".}; # Paul said: "I'm 64".

print "$name said: \"I'm $age\"." # Paul said: "I'm 64".

do the same thing, but in the ﬁrst one you do not need to escape double quotes within the string.

If your variable name clashes with surrounding text, you can use the syntax ${var} to disambiguate:

my $decade = 80;

GoalKicker.com – Perl® Notes for Professionals 16

Chapter 5: True and false
Section 5.1: List of true and false values
use feature qw( say );

# Numbers are true if they're not equal to 0.

say 0 ? 'true' : 'false'; # false
say 1 ? 'true' : 'false'; # true
say 2 ? 'true' : 'false'; # true
say -1 ? 'true' : 'false'; # true
say 1-1 ? 'true' : 'false'; # false
say 0e7 ? 'true' : 'false'; # false
say -0.00 ? 'true' : 'false'; # false

# Strings are true if they're not empty.

say 'a' ? 'true' : 'false'; # true
say 'false' ? 'true' : 'false'; # true
say '' ? 'true' : 'false'; # false

# Even if a string would be treated as 0 in numeric context, it's true if nonempty.

# The only exception is the string "0", which is false.
# To force numeric context add 0 to the string
say '0' ? 'true' : 'false'; # false
say '0.0' ? 'true' : 'false'; # true
say '0e0' ? 'true' : 'false'; # true
say '0 but true' ? 'true' : 'false'; # true
say '0 whargarbl' ? 'true' : 'false'; # true
say 0+'0 argarbl' ? 'true' : 'false'; # false

# Things that become numbers in scalar context are treated as numbers.

my @c = ();
my @d = (0);
say @c ? 'true' : 'false'; # false
say @d ? 'true' : 'false'; # true

# Anything undefined is false.

say undef ? 'true' : 'false'; # false

# References are always true, even if they point at something false

my @c = ();
my $d = 0;
say \@c ? 'true' : 'false'; # true
say \$d ? 'true' : 'false'; # true
say \0 ? 'true' : 'false'; # true
say \'' ? 'true' : 'false'; # true

GoalKicker.com – Perl® Notes for Professionals 18

Chapter 6: Dates and Time
Section 6.1: Date formatting
Time::Piece is available in perl 5 after version 10

use Time::Piece;

my $date = localtime->strftime('%m/%d/%Y');
print $date;

Output
07/26/2016

Section 6.2: Create new DateTime

Install DateTime on your PC and then use it in perl script:

use DateTime;

Create new current datetime

$dt = DateTime->now( time_zone => 'Asia/Ho_Chi_Minh');

Then you can access elements's values of date and time:

$year = $dt->year;
$month = $dt->month;
$day = $dt->day;
$hour = $dt->hour;
$minute = $dt->minute;
$second = $dt->second;

To get only time:

my $time = $dt->hms; #return time with format hh:mm:ss

To get only date:

my $date = $dt->ymd; #return date with format yyyy-mm-dd

Section 6.3: Working with elements of datetime

Set single element:

$dt->set( year => 2016 );

Set many elements:

$dt->set( year => 2016, 'month' => 8);

Add duration to datetime

$dt->add( hour => 1, month => 2)

Datetime subtraction:

GoalKicker.com – Perl® Notes for Professionals 19

my $dt1 = DateTime->new(
year => 2016,
month => 8,
day => 20,
);

my $dt2 = DateTime->new(
year => 2016,
month => 8,
day => 24,
);

my $duration = $dt2->subtract_datetime($dt1);
print $duration->days

You will get the result is 4 days

Section 6.4: Calculate code execution time

use Time::HiRes qw( time );

my $start = time();

#Code for which execution time is calculated

sleep(1.2);

my $end = time();

printf("Execution Time: %0.02f s\n", $end - $start);

This will print execution time of Code in seconds

GoalKicker.com – Perl® Notes for Professionals 20

Chapter 7: Control Statements
Section 7.1: Conditionals
Perl supports many kinds of conditional statements (statements that are based on boolean results). The most
common conditional statements are if-else, unless, and ternary statements. given statements are introduced as a
switch-like construct from C-derived languages and are available in versions Perl 5.10 and above.

If-Else Statements

The basic structure of an if-statement is like this:

if (EXPR) BLOCK
if (EXPR) BLOCK else BLOCK
if (EXPR) BLOCK elsif (EXPR) BLOCK ...
if (EXPR) BLOCK elsif (EXPR) BLOCK ... else BLOCK

For simple if-statements, the if can precede or succeed the code to be executed.

$number = 7;
if ($number > 4) { print "$number is greater than four!"; }

# Can also be written this way

print "$number is greater than four!" if $number > 4;

Section 7.2: Loops

Perl supports many kinds of loop constructs: for/foreach, while/do-while, and until.

@numbers = 1..42;
for (my $i=0; $i <= $#numbers; $i++) {
print "$numbers[$i]\n";
}

#Can also be written as

foreach my $num (@numbers) {
print "$num\n";
}

The while loop evaluates the conditional before executing the associated block. So, sometimes the block is never
executed. For example, the following code would never be executed if the filehandle $fh was the filehandle for an
empty file, or if was already exhausted before the conditional.

while (my $line = readline $fh) {

say $line;
}

The do/while and do/until loops, on the other hand, evaluate the conditional after each time the block is executed.
So, a do/while or a do/until loop is always executed at least once.

my $greeting_count = 0;
do {
say "Hello";
$greeting_count++;
} until ( $greeting_count > 1)

GoalKicker.com – Perl® Notes for Professionals 21

Chapter 8: Subroutines
Section 8.1: Creating subroutines
Subroutines are created by using the keyword sub followed by an identiﬁer and a code block enclosed in braces.

You can access the arguments by using the special variable @_, which contains all arguments as an array.

sub function_name {
my ($arg1, $arg2, @more_args) = @_;
# ...
}

Since the function shift defaults to shifting @_ when used inside a subroutine, it's a common pattern to extract the
arguments sequentially into local variables at the beginning of a subroutine:

sub function_name {
my $arg1 = shift;
my $arg2 = shift;
my @more_args = @_;
# ...
}

# emulate named parameters (instead of positional)

sub function_name {
my %args = (arg1 => 'default', @_);
my $arg1 = delete $args{arg1};
my $arg2 = delete $args{arg2};
# ...
}

sub {
my $arg1 = shift;
# ...
}->($arg);
Version ≥ 5.20.0

Alternatively, the experimental feature "signatures" can be used to unpack parameters, which are passed by value
(not by reference).

use feature "signatures";

sub function_name($arg1, $arg2, @more_args) {

# ...
}

Default values can be used for the parameters.

use feature "signatures";

sub function_name($arg1=1, $arg2=2) {

# ...
}

You can use any expression to give a default value to a parameter – including other parameters.

sub function_name($arg1=1, $arg2=$arg1+1) {

GoalKicker.com – Perl® Notes for Professionals 23

# ...
}

Note that you can't reference parameters which are deﬁned after the current parameter – hence the following code
doesn't work quite as expected.

sub function_name($arg1=$arg2, $arg2=1) {

print $arg1; # => <nothing>
print $arg2; # => 1
}

Section 8.2: Subroutines

Subroutines hold code. Unless speciﬁed otherwise, they are globally deﬁned.

# Functions do not (have to) specify their argument list

sub returns_one {
# Functions return the value of the last expression by default
# The return keyword here is unnecessary, but helps readability.
return 1;
}

# Its arguments are available in @_, however

sub sum {
my $ret = 0;
for my $value (@_) {
$ret += $value
}
return $ret;
}

# Perl makes an effort to make parens around argument list optional

say sum 1..3; # 6

# If you treat functions as variables, the & sigil is mandatory.

say defined ∑ # 1

Some builtins such as print or say are keywords, not functions, so e.g. &say is undeﬁned. It also does mean that
you can deﬁne them, but you will have to specify the package name to actually call them

# This defines the function under the default package, 'main'

sub say {
# This is instead the say keyword
say "I say, @_";
}

# ...so you can call it like this:

main::say('wow'); # I say, wow.
Version ≥ 5.18.0

Since Perl 5.18, you can also have non-global functions:

use feature 'lexical_subs';

my $value;
{
# Nasty code ahead
my sub prod {
my $ret = 1;
$ret *= $_ for @_;

GoalKicker.com – Perl® Notes for Professionals 24

$ret;
}
$value = prod 1..6; # 720
say defined ∏ # 1
}
say defined ∏ # 0
Version ≥ 5.20.0

Since 5.20, you can also have named parameters.

use feature 'signatures';

sub greet($name) {
say "Hello, $name";
}

This should not be confused with prototypes, a facility Perl has to let you deﬁne functions that behave like built-ins.
Function prototypes must be visible at compile time and its eﬀects can be ignored by specifying the & sigil.
Prototypes are generally considered to be an advanced feature that is best used with great care.

# This prototype makes it a compilation error to call this function with anything
# that isn't an array. Additionally, arrays are automatically turned into arrayrefs
sub receives_arrayrefs(\@\@) {
my $x = shift;
my $y = shift;
}

my @a = (1..3);
my @b = (1..4);
receives_arrayrefs(@a, @b); # okay, $x = \@a, $y = \@b, @_ = ();
receives_arrayrefs(\@a, \@b); # compilation error, "Type … must be array …"
BEGIN { receives_arrayrefs(\@a, \@b); }

# Specify the sigil to ignore the prototypes.

&receives_arrayrefs(\@a, \@b); # okay, $x = \@a, $y = \@b, @_ = ();
&receives_arrayrefs(@a, @b); # ok, but $x = 1, $y = 2, @_ = (3,1,2,3,4);

Section 8.3: Subroutine arguments are passed by reference

(except those in signatures)
Subroutine arguments in Perl are passed by reference, unless they are in the signature. This means that the
members of the @_ array inside the sub are just aliases to the actual arguments. In the following example, $text in
the main program is left modified after the subroutine call because $_[0] inside the sub is actually just a different
name for the same variable. The second invocation throws an error because a string literal is not a variable and
therefore can't be modified.

use feature 'say';

sub edit {
$_[0] =~ s/world/sub/;
}

my $text = "Hello, world!";

edit($text);
say $text; # Hello, sub!

edit("Hello, world!"); # Error: Modification of a read-only value attempted

To avoid clobbering your caller's variables it is therefore important to copy @_ to locally scoped variables (my ...) as

GoalKicker.com – Perl® Notes for Professionals 25

Chapter 9: Debug Output
Section 9.1: Dumping with Style
Sometimes Data::Dumper is not enough. Got a Moose object you want to inspect? Huge numbers of the same
structure? Want stuﬀ sorted? Colored? Data::Printer is your friend.

use Data::Printer;

p $data_structure;

Data::Printer writes to STDERR, like warn. That makes it easier to ﬁnd the output. By default, it sorts hash keys and
looks at objects.

use Data::Printer;
use LWP::UserAgent;

my $ua = LWP::UserAgent->new;
p $ua;

It will look at all the methods of the object, and also list the internals.

LWP::UserAgent {
Parents LWP::MemberMixin
public methods (45) : add_handler, agent, clone, conn_cache, cookie_jar, credentials,
default_header, default_headers, delete, env_proxy, from, get, get_basic_credentials,
get_my_handler, handlers, head, is_online, is_protocol_supported, local_address, max_redirect,
max_size, mirror, new, no_proxy, parse_head, post, prepare_request, progress, protocols_allowed,
protocols_forbidden, proxy, put, redirect_ok, remove_handler, request, requests_redirectable,
run_handlers, send_request, set_my_handler, show_progress, simple_request, ssl_opts, timeout,
use_alarm, use_eval
private methods (4) : _agent, _need_proxy, _new_response, _process_colonic_headers
internals: {
def_headers HTTP::Headers,
handlers {
response_header HTTP::Config
},
local_address undef,
max_redirect 7,
max_size undef,
no_proxy [],
protocols_allowed undef,
protocols_forbidden undef,
proxy {},
requests_redirectable [
[0] "GET",
[1] "HEAD"
],
show_progress undef,
ssl_opts {
verify_hostname 1
},
timeout 180,
use_eval 1

GoalKicker.com – Perl® Notes for Professionals 27

}
}

You can conﬁgure it further, so it serializes certain objects in a certain way, or to include objects up to an arbitrary
depth. The full conﬁguration is available in the documentation.

Unfortunately Data::Printer does not ship with Perl, so you need to install it from CPAN or through your package
management system.

Section 9.2: Dumping data-structures

use Data::Dumper;

my $data_structure = { foo => 'bar' };

print Dumper $data_structure;

Using Data::Dumper is an easy way to look at data structures or variable content at run time. It ships with Perl and
you can load it easily. The Dumper function returns the data structure serialized in a way that looks like Perl code.

$VAR1 = {
'foo' => 'bar',
}

That makes it very useful to quickly look at some values in your code. It's one of the most handy tools you have in
your arsenal. Read the full documentation on metacpan.

Section 9.3: Data::Show

The function show is automatically exported when use Data::Show; is executed. This function takes a variable as its
sole argument and it outputs:

1. the name of the variable

2. the contents of that variable (in a readable format)
3. the line of the ﬁle that show is run from
4. the ﬁle show is run from

Assuming that the following is code from the ﬁle example.pl:

use strict;
use warnings;
use Data::Show;

my @array = (1, 2, 3, 4, 5, 6, 7, 8, 9, 10);

my %hash = ( foo => 1, bar => { baz => 10, qux => 20 } );

my $href = \%hash;

show @array;
show %hash;
show $href;

perl example.pl gives the following output:

======( @array )=======================[ 'example.pl', line 11 ]======

GoalKicker.com – Perl® Notes for Professionals 28

[1 .. 10]

======( %hash )========================[ 'example.pl', line 12 ]======

{ bar => { baz => 10, qux => 20 }, foo => 1 }

======( $href )========================[ 'example.pl', line 13 ]======

{ bar => { baz => 10, qux => 20 }, foo => 1 }

See the documentation for Data::Show.

Section 9.4: Dumping array list

my @data_array = (123, 456, 789, 'poi', 'uyt', "rew", "qas");
print Dumper @data_array;

Using Data::Dumper gives an easy access to fetch list values. The Dumper returns the list values serialized in a way
that looks like Perl code.

Output:

$VAR1 = 123;
$VAR2 = 456;
$VAR3 = 789;
$VAR4 = 'poi';
$VAR5 = 'uyt';
$VAR6 = 'rew';
$VAR7 = 'qas';

As suggested by user @dgw When dumping arrays or hashes it is better to use an array reference or a hash
reference, those will be shown better ﬁtting to the input.

$ref_data = [23,45,67,'mnb','vcx'];
print Dumper $ref_data;

Output:

$VAR1 = [
23,
45,
67,
'mnb',
'vcx'
];

You can also reference the array when printing.

my @data_array = (23,45,67,'mnb','vcx');
print Dumper \@data_array;

Output:

$VAR1 = [
23,

GoalKicker.com – Perl® Notes for Professionals 29

Chapter 10: Lists
Section 10.1: Array as list
The array is one of Perl's basic variable types. It contains a list, which is an ordered sequence of zero or more
scalars. The array is the variable holding (and providing access to) the list data, as is documented in perldata.

You can assign a list to an array:

my @foo = ( 4, 5, 6 );

You can use an array wherever a list is expected:

join '-', ( 4, 5, 6 );
join '-', @foo;

Some operators only work with arrays since they mutate the list an array contains:

shift @array;
unshift @array, ( 1, 2, 3 );
pop @array;
push @array, ( 7, 8, 9 );

Section 10.2: Assigning a list to a hash

Lists can also be assigned to hash variables. When creating a list that will be assigned to a hash variable, it is
recommended to use the fat comma => between keys and values to show their relationship:

my %hash = ( foo => 42, bar => 43, baz => 44 );

The => is really only a special comma that automatically quotes the operand to its left. So, you could use normal
commas, but the relationship is not as clear:

my %hash = ( 'foo', 42, 'bar', 43, 'baz', 44 );

You can also use quoted strings for the left hand operand of the fat comma =>, which is especially useful for keys
containing spaces.

my %hash = ( 'foo bar' => 42, 'baz qux' => 43 );

For details see Comma operator at perldoc perlop.

Section 10.3: Lists can be passed into subroutines

As to pass list into a subroutine, you specify the subroutine's name and then supply the list to it:

test_subroutine( 'item1', 'item2' );

test_subroutine 'item1', 'item2'; # same

Internally Perl makes aliases to those arguments and put them into the array @_ which is available within the
subroutine:

@_ = ( 'item1', 'item2' ); # Done internally by perl

GoalKicker.com – Perl® Notes for Professionals 31

You access subroutine arguments like this:

sub test_subroutine {
print $_[0]; # item1
print $_[1]; # item2
}

Aliasing gives you the ability to change the original value of argument passed to subroutine:

sub test_subroutine {
$_[0] += 2;
}

my $x = 7;
test_subroutine( $x );
print $x; # 9

To prevent inadvertent changes of original values passed into your subroutine, you should copy them:

sub test_subroutine {
my( $copy_arg1, $copy_arg2 ) = @_;
$copy_arg1 += 2;
}

my $x = 7;
test_subroutine $x; # in this case $copy_arg2 will have `undef` value
print $x; # 7

To test how many arguments were passed into the subroutine, check the size of @_

sub test_subroutine {
print scalar @_, ' argument(s) passed into subroutine';
}

If you pass array arguments into a subroutine they all will be ﬂattened:

my @x = ( 1, 2, 3 );
my @y = qw/ a b c /; # ( 'a', 'b', 'c' )
test_some_subroutine @x, 'hi', @y; # 7 argument(s) passed into subroutine
# @_ = ( 1, 2, 3, 'hi', 'a', 'b', 'c' ) # Done internally for this call

If your test_some_subroutine contains the statement $_[4] = 'd', for the above call it will cause $y[0] to have
value d afterwards:

print "@y"; # d b c

Section 10.4: Return list from subroutine

You can, of course, return lists from subs:

sub foo {
my @list1 = ( 1, 2, 3 );
my @list2 = ( 4, 5 );

return ( @list1, @list2 );

}

GoalKicker.com – Perl® Notes for Professionals 32

my @list = foo();
print @list; # 12345

But it is not the recommended way to do that unless you know what you are doing.

While this is OK when the result is in LIST context, in SCALAR context things are unclear. Let's take a look at the
next line:

print scalar foo(); # 2

Why 2? What is going on?

1. Because foo() evaluated in SCALAR context, this list ( @list1, @list2 ) also evaluated in SCALAR context
2. In SCALAR context, LIST returns its last element. Here it is @list2
3. Again in SCALAR context, array @list2 returns the number of its elements. Here it is 2.

In most cases the right strategy will return references to data structures.
So in our case we should do the following instead:

return ( \@list1, \@list2 );

Then the caller does something like this to receive the two returned arrayrefs:

my ($list1, $list2) = foo(...);

Section 10.5: Hash as list

In list context hash is ﬂattened.

my @bar = ( %hash, %hash );

The array @bar is initialized by list of two %hash hashes

both %hash are ﬂattened

new list is created from ﬂattened items
@bar array is initialized by that list

It is guaranteed that key-value pairs goes together. Keys are always even indexed, values - odd. It is not guaranteed
that key-value pairs are always ﬂattened in same order:

my %hash = ( a => 1, b => 2 );

print %hash; # Maybe 'a1b2' or 'b2a1'

Section 10.6: Using arrayref to pass array to sub

The arrayref for @foo is \@foo. This is handy if you need to pass an array and other things to a subroutine. Passing
@foo is like passing multiple scalars. But passing \@foo is a single scalar. Inside the subroutine:

xyz(\@foo, 123);
...
sub xyz {
my ($arr, $etc) = @_;
print $arr->[0]; # using the first item in $arr. It is like $foo[0]

GoalKicker.com – Perl® Notes for Professionals 33

Chapter 11: Sorting
For sorting lists of things, Perl has only a single function, unsurprisingly called sort. It is ﬂexible enough to sort all
kinds of items: numbers, strings in any number of encodings, nested data structures or objects. However, due to its
ﬂexibility, there are quite a few tricks and idioms to be learned for its use.

Section 11.1: Basic Lexical Sort

@sorted = sort @list;

@sorted = sort { $a cmp $b } @list;

sub compare { $a cmp $b }

@sorted = sort compare @list;

The three examples above do exactly the same thing. If you don't supply any comparator function or block, sort
assumes you want the list on its right sorted lexically. This is usually the form you want if you just need your data in
some predictable order and don't care about linguistic correctness.

sort passes pairs of items in @list to the comparator function, which tells sort which item is larger. The cmp
operator does this for strings while <=> does the same thing for numbers. The comparator is called quite often, on
average n * log(n) times with n being the number of elements to be sorted, so it's important it be fast. This is the
reason sort uses predeﬁned package global variables ($a and $b) to pass the elements to be compared to the block
or function, instead of proper function parameters.

If you use locale, cmp takes locale specific collation order into account, e.g. it will sort Å like A under a Danish locale
but after Z under an English or German one. However, it doesn't take the more complex Unicode sorting rules into
account nor does it offer any control over the order—for example phone books are often sorted differently from
dictionaries. For those cases, the Unicode::Collate and particularly Unicode::Collate::Locale modules are
recommended.

Section 11.2: The Schwartzian Transform

This is probably the most famous example of a sort optimization making use of Perl's functional programming
facilities, to be used where the sort order of items depend on an expensive function.

# What you would usually do

@sorted = sort { slow($a) <=> slow($b) } @list;

# What you do to make it faster

@sorted =
map { $_->[0] }
sort { $a->[1] <=> $b->[1] }
map { [ $_, slow($_) ] }
@list;

The trouble with the first example is that the comparator is called very often and keeps recalculating values using a
slow function over and over. A typical example would be sorting file names by their file size:

use File::stat;
@sorted = sort { stat($a)->size <=> stat($b)->size } glob "*";

This works, but at best it incurs the overhead of two system calls per comparison, at worst it has to go to the disk,
twice, for every single comparison, and that disk may be in an overloaded ﬁle server on the other side of the planet.

GoalKicker.com – Perl® Notes for Professionals 34

Enter Randall Schwartz's trick.

The Schwartzian Transform basically shoves @list through three functions, bottom-to-top. The first map turns each
entry into a two-element list of the original item and the result of the slow function as a sort key, so at the end of
this we have called slow() exactly once for each element. The following sort can then simply access the sort key by
looking in the list. As we don't care about the sort keys but only need the original elements in sorted order, the final
map throws away the two-element lists from the already-sorted list it receives from @sort and returns a list of only
their first members.

Section 11.3: Case Insensitive Sort

The traditional technique to make sort ignore case is to pass strings to lc or uc for comparison:

@sorted = sort { lc($a) cmp lc($b) } @list;

This works on all versions of Perl 5 and is completely suﬃcient for English; it doesn't matter whether you use uc or
lc. However, it presents a problem for languages like Greek or Turkish where there is no 1:1 correspondence
between upper- and lowercase letters so you get diﬀerent results depending on whether you use uc or lc.
Therefore, Perl 5.16 and higher have a case folding function called fc that avoids this problem, so modern multi-
lingual sorting should use this:

@sorted = sort { fc($a) cmp fc($b) } @list;

Section 11.4: Numeric Sort

@sorted = sort { $a <=> $b } @list;

Comparing $a and $b with the <=> operator ensures they are compared numerically and not textually as per
default.

Section 11.5: Reverse Sort

@sorted = sort { $b <=> $a } @list;
@sorted = reverse sort { $a <=> $b } @list;

Sorting items in descending order can simply be achieved by swapping $a and $b in the comparator block.
However, some people prefer the clarity of a separate reverse even though it is slightly slower.

GoalKicker.com – Perl® Notes for Professionals 35

Chapter 12: File I/O (reading and writing
files)
Mode Explaination
> Write (trunc). Will overwrite existing files. Creates a new file if no file was found
Write (append). Will not overwrite files but append new content at the end of it. Will also create a file if used
>>
for opening a non existing file.
< Read. Opens the file in read only mode.
+< Read / Write. Will not create or truncate the file.
+> Read / Write (trunc). Will create and truncate the file.
+>> Read / Write (append). Will create but not truncate the file.

Section 12.1: Opening A FileHandle for Reading

Opening Generic ASCII Text Files
Version ≥ 5.6.0
open my $filehandle, '<', $name_of_file or die "Can't open $name_of_file, $!";

This is the basic idiom for "default" File IO and makes $filehandle a readable input stream of bytes, ﬁltered by a
default system-speciﬁc decoder, which can be locally set with the open pragma

Perl itself does not handle errors in ﬁle opening, so you have to handle those yourself by checking the exit condition
of open. $! is populated with the error message that caused open to fail.

On Windows, the default decoder is a "CRLF" ﬁlter, which maps any "\r\n" sequences in the input to "\n"

Opening Binary Files

Version ≥ 5.8.0
open my $filehandle, '<:raw', 'path/to/file' or die "Can't open $name_of_file, $!";

This speciﬁes that Perl should not perform a CRLF translation on Windows.

Opening UTF8 Text Files

Version ≥ 5.8.0
open my $filehandle, '<:raw:encoding(utf-8)', 'path/to/file'
or die "Can't open $name_of_file, $!";

This speciﬁes that Perl should both avoid CRLF translation, and then decode the resulting bytes into strings of
characters ( internally implemented as arrays of integers which can exceed 255 ), instead of strings of bytes

Section 12.2: Reading from a ﬁle

my $filename = '/path/to/file';

open my $fh, '<', $filename or die "Failed to open file: $filename";

# You can then either read the file one line at a time...
while(chomp(my $line = <$fh>)) {
print $line . "\n";
}

# ...or read whole file into an array in one go

GoalKicker.com – Perl® Notes for Professionals 36

chomp(my @fileArray = <$fh>);

If you know that your input ﬁle is UTF-8, you can specify the encoding:

open my $fh, '<:encoding(utf8)', $filename or die "Failed to open file: $filename";

After finished reading from the file, the filehandle should be closed:

close $fh or warn "close failed: $!";

Section 12.3: Write to a ﬁle

This code opens a file for writing. Returns an error if the file couldn't be opened. Also closes the file at the end.

#!/usr/bin/perl
use strict;
use warnings;
use open qw( :encoding(UTF-8) :std ); # Make UTF-8 default encoding

# Open "output.txt" for writing (">") and from now on, refer to it as the variable $fh.
open(my $fh, ">", "output.txt")
# In case the action failed, print error message and quit.
or die "Can't open > output.txt: $!";

Now we have an open file ready for writing which we access through $fh (this variable is called a filehandle). Next
we can direct output to that file using the print operator:

# Print "Hello" to $fh ("output.txt").

print $fh "Hello";
# Don't forget to close the file once we're done!
close $fh or warn "Close failed: $!";

The open operator has a scalar variable ($fh in this case) as its first parameter. Since it is defined in the open
operator it is treated as a filehandle. Second parameter ">" (greater than) defines that the file is opened for writing.
The last parameter is the path of the file to write the data to.

To write the data into the file, the print operator is used along with the filehandle. Notice that in the print operator
there is no comma between the filehandle and the statement itself, just whitespace.

Section 12.4: "use autodie" and you won't need to check ﬁle
open/close failures
autodie allows you to work with ﬁles without having to explicitly check for open/close failures.

Since Perl 5.10.1, the autodie pragma has been available in core Perl. When used, Perl will automatically check for

GoalKicker.com – Perl® Notes for Professionals 37

errors when opening and closing ﬁles.

Here is an example in which all of the lines of one ﬁle are read and then written to the end of a log ﬁle.

use 5.010; # 5.010 and later enable "say", which prints arguments, then a newline
use strict; # require declaring variables (avoid silent errors due to typos)
use warnings; # enable helpful syntax-related warnings
use open qw( :encoding(UTF-8) :std ); # Make UTF-8 default encoding
use autodie; # Automatically handle errors in opening and closing files

open(my $fh_in, '<', "input.txt"); # check for failure is automatic

# open a file for appending (i.e. using ">>")

open( my $fh_log, '>>', "output.log"); # check for failure is automatic

while (my $line = readline $fh_in) # also works: while (my $line = <$fh_in>)
{
# remove newline
chomp $line;

# write to log file

say $fh_log $line or die "failed to print '$line'"; # autodie doesn't check print
}

# Close the file handles (check for failure is automatic)

close $fh_in;
close $fh_log;

By the way, you should technically always check print statements. Many people don't, but perl (the Perl
interpreter) doesn't do this automatically and neither does autodie.

Section 12.5: Rewind a ﬁlehandle

Sometimes it is needful to backtrack after reading.

# identify current position in file, in case the first line isn't a comment
my $current_pos = tell;

while (my $line = readline $fh)

{
if ($line =~ /$START_OF_COMMENT_LINE/)
{
push @names, get_name_from_comment($line);
}
else {
last; # break out of the while loop
}
$current_pos = tell; # keep track of current position, in case we need to rewind the next line
read
}

# Step back a line so that it can be processed later as the first data line
seek $fh, $current_pos, 0;

Section 12.6: Reading and Writing gzip compressed ﬁles

Writing a gzipped ﬁle

To write a gzipped ﬁle, use the module IO::Compress::Gzip and create a ﬁlehandle by creating a new instance of

GoalKicker.com – Perl® Notes for Professionals 38

IO::Compress::Gzip for the desired output ﬁle:

use strict;
use warnings;
use open qw( :encoding(UTF-8) :std ); # Make UTF-8 default encoding

use IO::Compress::Gzip;

my $fh_out = IO::Compress::Gzip->new("hello.txt.gz");

print $fh_out "Hello World!\n";

close $fh_out;

use IO::Compress::Gzip;

Reading from a gzipped ﬁle

To read from a gzipped file, use the module IO::Uncompress::Gunzip and then create a filehandle by creating a
new instance of IO::Uncompress::Gunzip for the input file:

#!/bin/env perl
use strict;
use warnings;
use open qw( :encoding(UTF-8) :std ); # Make UTF-8 default encoding

use IO::Uncompress::Gunzip;

my $fh_in = IO::Uncompress::Gunzip->new("hello.txt.gz");

my $line = readline $fh_in;

print $line;

Section 12.7: Setting the default Encoding for IO

# encode/decode UTF-8 for files and standard input/output
use open qw( :encoding(UTF-8) :std );

This pragma changes the default mode of reading and writing text ( ﬁles, standard input, standard output, and
standard error ) to UTF-8, which is typically what you want when writing new applications.

ASCII is a subset of UTF-8, so this is not expected to cause any problems with legacy ASCII files and will help protect
you the accidental file corruption that can happen when treating UTF-8 files as ASCII.

However, it is important that you know what the encoding of your ﬁles is that you are dealing with and handle them
accordingly. (Reasons that we should not ignore Unicode.) For more in depth treatment of Unicode, please see the
Perl Unicode topic.

GoalKicker.com – Perl® Notes for Professionals 39

Chapter 13: Reading a ﬁle's content into a
variable
Section 13.1: Path::Tiny
Using the idiom from The Manual Way several times in a script soon gets tedious so you might want to try a
module.

use Path::Tiny;
my $contents = path($filename)->slurp;

You can pass a binmode option if you need control over ﬁle encodings, line endings etc. - see man perlio:

my $contents = path($filename)->slurp( {binmode => ":encoding(UTF-8)"} );

Path::Tiny also has a lot of other functions for dealing with ﬁles so it may be a good choice.

Section 13.2: The manual way

open my $fh, '<', $filename
or die "Could not open $filename for reading: $!";
my $contents = do { local $/; <$fh> };

After opening the file (read man perlio if you want to read specific file encodings instead of raw bytes), the trick is
in the do block: <$fh>, the file handle in a diamond operator, returns a single record from the file. The "input record
separator" variable $/ specifies what a "record" is—by default it is set to a newline character so "a record" means "a
single line". As $/ is a global variable, local does two things: it creates a temporary local copy of $/ that will vanish
at the end of the block, and gives it the (non-)value undef (the "value" which Perl gives to uninitialized variables).
When the input record separator has that (non-)value, the diamond operator will return the entire file. (It considers
the entire file to be a single line.)

Using do, you can even get around manually opening a ﬁle. For repeated reading of ﬁles,

sub readfile { do { local(@ARGV,$/) = $_[0]; <> } }

my $content = readfile($filename);

can be used. Here, another global variable(@ARGV) is localized to simulate the same process used when starting a
perl script with parameters. $/ is still undef, since the array in front of it "eats" all incoming arguments. Next, the
diamond operator <> again delivers one record deﬁned by $/ (the whole ﬁle) and returns from the do block, which
in turn return from the sub.

The sub has no explicit error handling, which is bad practice! If an error occurs while reading the ﬁle, you will
receive undef as return value, as opposed to an empty string from an empty ﬁle.

Another disadvantage of the last code is the fact that you cannot use PerlIO for diﬀerent ﬁle encodings—you always
get raw bytes.

Section 13.3: File::Slurp

Don't use it. Although it has been around for a long time and is still the module most programmers will suggest, it is
broken and not likely to be ﬁxed.

GoalKicker.com – Perl® Notes for Professionals 40

Section 13.4: File::Slurper
This is a minimalist module that only slurps ﬁles into variables, nothing else.

use File::Slurper 'read_text';

my $contents = read_text($filename);

read_text() takes two optional parameters to specify the ﬁle encoding and whether line endings should be
translated between the unixish LF or DOSish CRLF standards:

my $contents = read_text($filename, 'UTF-8', 1);

Section 13.5: Slurping a ﬁle into an array variable

open(my $fh, '<', "/some/path") or die $!;
my @ary = <$fh>;

When evaluated in list context, the diamond operator returns a list consisting of all the lines in the ﬁle (in this case,
assigning the result to an array supplies list context). The line terminator is retained, and can be removed by
chomping:

chomp(@ary); #removes line terminators from all the array elements.

Section 13.6: Slurp ﬁle in one-liner

Input record separator can be speciﬁed with -0 switch (zero, not capital O). It takes an octal or hexadecimal number
as value. Any value 0400 or above will cause Perl to slurp ﬁles, but by convention, the value used for this purpose is
0777.

perl -0777 -e 'my $file = <>; print length($file)' input.txt

Going further with minimalism, specifying -n switch causes Perl to automatically read each line (in our case — the
whole ﬁle) into variable $_.

perl -0777 -ne 'print length($_)' input.txt

GoalKicker.com – Perl® Notes for Professionals 41

Chapter 14: Strings and quoting methods
Section 14.1: String Literal Quoting
String literals imply no escaping or interpolation ( with the exception of quoting string terminators )

print 'This is a string literal\n'; # emits a literal \ and n to terminal

print 'This literal contains a \'postraphe '; # emits the ' but not its preceding \

You can use alternative quoting mechanisms to avoid clashes:

print q/This is is a literal \' <-- 2 characters /; # prints both \ and '
print q^This is is a literal \' <-- 2 characters ^; # also

Certain chosen quote characters are "balanced"

print q{ This is a literal and I contain { parens! } }; # prints inner { }

Section 14.2: Double-quoting

Double-quoted strings use interpolation and escaping – unlike single-quoted strings. To double-quote a string,
use either double quotes " or the qq operator.

my $greeting = "Hello!\n";
print $greeting;
# => Hello! (followed by a linefeed)

my $bush = "They misunderestimated me."

print qq/As Bush once said: "$bush"\n/;
# => As Bush once said: "They misunderestimated me." (with linefeed)

The qq is useful here, to avoid having to escape the quotation marks. Without it, we would have to write...

print "As Bush once said: \"$bush\"\n";

... which just isn't as nice.

Perl doesn't limit you to using a slash / with qq; you can use any (visible) character.

use feature 'say';

say qq/You can use slashes.../;

say qq{...or braces...};
say qq^...or hats...^;
say qq|...or pipes...|;
# say qq ...but not whitespace. ;

You can also interpolate arrays into strings.

use feature 'say';

my @letters = ('a', 'b', 'c');

say "I like these letters: @letters.";

GoalKicker.com – Perl® Notes for Professionals 42

# => I like these letters: a b c.

By default the values are space-separated – because the special variable $" defaults to a single space. This can, of
course, be changed.

use feature 'say';

my @letters = ('a', 'b', 'c');

{local $" = ", "; say "@letters"; } # a, b, c

If you prefer, you have the option to use English and change $LIST_SEPARATOR instead:

use v5.18; # English should be avoided on older Perls

use English;

my @letters = ('a', 'b', 'c');

{ local $LIST_SEPARATOR = "\n"; say "My favourite letters:\n\n@letters" }

For anything more complex than this, you should use a loop instead.

say "My favourite letters:";

say;
for my $letter (@letters) {
say " - $letter";
}

Interpolation does not work with hashes.

use feature 'say';

my %hash = ('a', 'b', 'c', 'd');

say "This doesn't work: %hash" # This doesn't work: %hash

Some code abuses interpolation of references – avoid it.

use feature 'say';

say "2 + 2 == @{[ 2 + 2 ]}"; # 2 + 2 = 4 (avoid this)

say "2 + 2 == ${\( 2 + 2 )}"; # 2 + 2 = 4 (avoid this)

The so-called "cart operator" causes perl to dereference @{ ... } the array reference [ ... ] that contains the
expression that you want to interpolate, 2 + 2. When you use this trick, Perl builds an anonymous array, then
dereferences it and discards it.

The ${\( ... )} version is somewhat less wasteful, but it still requires allocating memory and it is even harder to
read.

Instead, consider writing:

say "2 + 2 == " . 2 + 2;

my $result = 2 + 2; say "2 + 2 == $result"

Section 14.3: Heredocs

Large Multi-Line strings are burdensome to write.

GoalKicker.com – Perl® Notes for Professionals 43

my $variable = <<'EOF';
this block of text is interpreted literally,
no \'quotes matter, they're just text
only the trailing left-aligned EOF matters.
EOF

NB: Make sure you ignore stack-overﬂows syntax highlighter: It is very wrong.

And Interpolated Heredocs work the same way.

my $variable = <<"I Want it to End";

this block of text is interpreted.
quotes\nare interpreted, and $interpolations
get interpolated...
but still, left-aligned "I Want it to End" matters.
I Want it to End

Pending in 5.26.0* is an "Indented Heredoc" Syntax which trims left-padding oﬀ for you

Version ≥ 5.26.0
my $variable = <<~"MuchNicer";
this block of text is interpreted.
quotes\nare interpreted, and $interpolations
get interpolated...
but still, left-aligned "I Want it to End" matters.
MuchNicer

Section 14.4: Removing trailing newlines

The function chomp will remove one newline character, if present, from each scalar passed to it. chomp will mutate
the original string and will return the number of characters removed

my $str = "Hello World\n\n";

my $removed = chomp($str);
print $str; # "Hello World\n"
print $removed; # 1

# chomp again, removing another newline

$removed = chomp $str;
print $str; # "Hello World"
print $removed; # 1

# chomp again, but no newline to remove

$removed = chomp $str;
print $str; # "Hello World"
print $removed; # 0

You can also chomp more than one string at once:

my @strs = ("Hello\n", "World!\n\n"); # one newline in first string, two in second

my $removed = chomp(@strs); # @strs is now ("Hello", "World!\n")

print $removed; # 2

$removed = chomp(@strs); # @strs is now ("Hello", "World!")

print $removed; # 1

$removed = chomp(@strs); # @strs is still ("Hello", "World!")

GoalKicker.com – Perl® Notes for Professionals 44

print $removed; # 0

But usually, no one worries about how many newlines were removed, so chomp is usually seen in void context, and
usually due to having read lines from a ﬁle:

while (my $line = readline $fh)

{
chomp $line;

# now do something with $line

}

my @lines = readline $fh2;

chomp (@lines); # remove newline from end of each line

GoalKicker.com – Perl® Notes for Professionals 45

Chapter 15: Split a string on unquoted
separators
Section 15.1: parse_line()
Using parse_line() of Text::ParseWords:

use 5.010;
use Text::ParseWords;

my $line = q{"a quoted, comma", word1, word2};

my @parsed = parse_line(',', 1, $line);
say for @parsed;

Output:

"a quoted, comma"

word1
word2

Section 15.2: Text::CSV or Text::CSV_XS

use Text::CSV; # Can use Text::CSV which will switch to _XS if installed
$sep_char = ",";
my $csv = Text::CSV->new({sep_char => $sep_char});
my $line = q{"a quoted, comma", word1, word2};
$csv->parse($line);
my @fields = $csv->fields();
print join("\n", @fields)."\n";

Output:

a quoted, comma
word1
word2

NOTES

By default, Text::CSV does not strip whitespace around separator character, the way Text::ParseWords does.
However, adding allow_whitespace=>1 to constructor attributes achieves that eﬀect.

my $csv = Text::CSV_XS->new({sep_char => $sep_char, allow_whitespace=>1});

Output:

a quoted, comma
word1
word2

The library supports escaping special characters (quotes, separators)

The library supports conﬁgurable separator character, quote character, and escape character

Documentatoin: http://search.cpan.org/perldoc/Text::CSV

GoalKicker.com – Perl® Notes for Professionals 46

Chapter 16: Object-oriented Perl
Section 16.1: Deﬁning classes in modern Perl
Although available, deﬁning a class from scratch is not recommended in modern Perl. Use one of helper OO
systems which provide more features and convenience. Among these systems are:

Moose - inspired by Perl 6 OO design

Class::Accessor - a lightweight alternative to Moose

Class::Tiny - truly minimal class builder

Moose

package Foo;
use Moose;

has bar => (is => 'ro'); # a read-only property

has baz => (is => 'rw', isa => 'Bool'); # a read-write boolean property

sub qux {
my $self = shift;
my $barIsBaz = $self->bar eq 'baz'; # property getter
$self->baz($barIsBaz); # property setter
}

Class::Accessor (Moose syntax)

package Foo;
use Class::Accessor 'antlers';

has bar => (is => 'ro'); # a read-only property

has baz => (is => 'rw', isa => 'Bool'); # a read-write property (only 'is' supported, the type is
ignored)

Class::Accessor (native syntax)

package Foo;
use base qw(Class::Accessor);

Foo->mk_accessors(qw(bar baz)); # some read-write properties

Foo->mk_accessors(qw(qux)); # a read-only property

Class::Tiny

package Foo;
use Class::Tiny qw(bar baz); # just props

Section 16.2: Creating Objects

Unlike many other languages, Perl does not have constructors that allocate memory for your objects. Instead, one
should write a class method that both create a data structure and populate it with data (you may know it as the
Factory Method design pattern).

GoalKicker.com – Perl® Notes for Professionals 47

package Point;
use strict;

sub new {
my ($class, $x, $y) = @_;
my $self = { x => $x, y => $y }; # store object data in a hash
bless $self, $class; # bind the hash to the class
return $self;
}

This method can be used as follows:

my $point = Point->new(1, 2.5);

Whenever the arrow operator -> is used with methods, its left operand is prepended to the given argument list. So,
@_ in new will contain values ('Point', 1, 2.5).

There is nothing special in the name new. You can call the factory methods as you prefer.

There is nothing special in hashes. You could do the same in the following way:

package Point;
use strict;

sub new {
my ($class, @coord) = @_;
my $self = \@coord;
bless $self, $class;
return $self;
}

In general, any reference may be an object, even a scalar reference. But most often, hashes are the most
convenient way to represent object data.

Section 16.3: Deﬁning Classes

In general, classes in Perl are just packages. They can contain data and methods, as usual packages.

package Point;
use strict;

my $CANVAS_SIZE = [1000, 1000];

sub new {
...
}

sub polar_coordinates {
...
}

It is important to note that the variables declared in a package are class variables, not object (instance) variables.
Changing of a package-level variable aﬀects all objects of the class. How to store object-speciﬁc data, see in
"Creating Objects".

GoalKicker.com – Perl® Notes for Professionals 48

What makes class packages speciﬁc, is the arrow operator ->. It may be used after a bare word:

Point->new(...);

or after a scalar variable (usually holding a reference):

my @polar = $point->polar_coordinates;

What is to the left of the arrow is prepended to the given argument list of the method. For example, after call

Point->new(1, 2);

array @_ in new will contain three arguments: ('Point', 1, 2).

Packages representing classes should take this convention into account and expect that all their methods will have
one extra argument.

Section 16.4: Inheritance and methods resolution

To make a class a subclass of another class, use parent pragma:

package Point;
use strict;
...
1;

package Point2D;
use strict;
use parent qw(Point);
...
1;

package Point3D;
use strict;
use parent qw(Point);
...
1;

Perl allows for multiple inheritance:

package Point2D;
use strict;
use parent qw(Point PlanarObject);
...
1;

Inheritance is all about resolution which method is to be called in a particular situation. Since pure Perl does not
prescribe any rules about the data structure used to store object data, inheritance has nothing to do with that.

Consider the following class hierarchy:

package GeometryObject;
use strict;

sub transpose { ...}

GoalKicker.com – Perl® Notes for Professionals 49

package Point;
use strict;
use parent qw(GeometryObject);

sub new { ... };

package PlanarObject;
use strict;
use parent qw(GeometryObject);

sub transpose { ... }

package Point2D;
use strict;
use parent qw(Point PlanarObject);

sub new { ... }

sub polar_coordinates { ... }

The method resolution works as follows:

1. The starting point is deﬁned by the left operand of the arrow operator.

If it is a bare word:

Point2D->new(...);

...or a scalar variable holding a string:

my $class = 'Point2D';
$class->new(...);

...then the starting point is the package with the corresponding name (Point2D in both examples).

If the left operand is a scalar variable holding a blessed reference:

my $point = {...};
bless $point, 'Point2D'; # typically, it is encapsulated into class methods
my @coord = $point->polar_coordinates;

then the starting point is the class of the reference (again, Point2D). The arrow operator cannot be
used to call methods for unblessed references.

2. If the starting point contains the required method, it is simply called.

Thus, since Point2D::new exists,

Point2D->new(...);

will simply call it.

GoalKicker.com – Perl® Notes for Professionals 50

3. If the starting point does not contain the required method, depth-ﬁrst search in the parent classes is
performed. In the example above, the search order will be as follows:

Point2D
Point (ﬁrst parent of Point2D)
GeometryObject (parent of Point)
PlanarObject (second parent of Point2D)

For example, in the following code:

my $point = Point2D->new(...);
$point->transpose(...);

the method that will be called is GeometryObject::transpose, even though it would be overridden in
PlanarObject::transpose.

4. You can set the starting point explicitly.

In the previous example, you can explicitly call PlanarObject::transpose like so:

my $point = Point2D->new(...);
$point->PlanarObject::transpose(...);

5. In a similar manner, SUPER:: performs method search in parent classes of the current class.

For example,

package Point2D;
use strict;
use parent qw(Point PlanarObject);

sub new {
(my $class, $x, $y) = @_;
my $self = $class->SUPER::new;
...
}

will call Point::new in the course of the Point2D::new execution.

Section 16.5: Class and Object Methods

In Perl, the diﬀerence between class (static) and object (instance) methods is not so strong as in some other
languages, but it still exists.

The left operand of the arrow operator -> becomes the ﬁrst argument of the method to be called. It may be either a
string:

# the first argument of new is string 'Point' in both cases

Point->new(...);

my $class = 'Point';

GoalKicker.com – Perl® Notes for Professionals 51

$class->new(...);

or an object reference:

# reference contained in $point is the first argument of polar_coordinates

my $point = Point->new(...);
my @coord = $point->polar_coordinates;

Class methods are just the ones that expect their ﬁrst argument to be a string, and object methods are the ones
that expect their ﬁrst argument to be an object reference.

Class methods typically do not do anything with their ﬁrst argument, which is just a name of the class. Generally, it
is only used by Perl itself for method resolution. Therefore, a typical class method can be called for an object as
well:

my $width = Point->canvas_width;

my $point = Point->new(...);
my $width = $point->canvas_width;

Although this syntax is allowed, it is often misleading, so it is better to avoid it.

Object methods receive an object reference as the ﬁrst argument, so they can address the object data (unlike class
methods):

package Point;
use strict;

sub polar_coordinates {
my ($point) = @_;
my $x = $point->{x};
my $y = $point->{y};
return (sqrt($x * $x + $y * $y), atan2($y, $x));
}

The same method can track both cases: when it is called as a class or an object method:

sub universal_method {
my $self = shift;
if (ref $self) {
# object logic
...
}
else {
# class logic
...
}
}

Section 16.6: Roles

A role in Perl is essentially

a set of methods and attributes which

injected into a class directly.

GoalKicker.com – Perl® Notes for Professionals 52

A role provides a piece of functionality which can be composed into (or applied to) any class (which is said to
consume the role). A role cannot be inherited but may be consumed by another role.

A role may also require consuming classes to implement some methods instead of implementing the methods itself
(just like interfaces in Java or C#).

Perl does not have built-in support for roles but there are CPAN classes which provide such support.

Moose::Role

package Chatty;
use Moose::Role;

requires 'introduce'; # a method consuming classes must implement

sub greet { # a method already implemented in the role

print "Hi!\n";
}

package Parrot;
use Moose;

with 'Chatty';

sub introduce {
print "I'm Buddy.\n";
}

Role::Tiny

Use if your OO system does not provide support for roles (e.g. Class::Accessor or Class::Tiny). Does not support
attributes.

package Chatty;
use Role::Tiny;

requires 'introduce'; # a method consuming classes must implement

sub greet { # a method already implemented in the role

print "Hi!\n";
}

package Parrot;
use Class::Tiny;
use Role::Tiny::With;

with 'Chatty';

sub introduce {
print "I'm Buddy.\n";
}

GoalKicker.com – Perl® Notes for Professionals 53

Chapter 17: Exception handling
Section 17.1: eval and die
This is the built-in way to deal with "exceptions" without relying on third party libraries like Try::Tiny.

my $ret;

eval {
$ret = some_function_that_might_die();
1;
} or do {
my $eval_error = $@ || "Zombie error!";
handle_error($eval_error);
};

# use $ret

We "abuse" the fact that die has a false return value, and the return value of the overall code block is the value of
the last expression in the code block:

if $ret is assigned to successfully, then the 1; expression is the last thing that happens in the eval code
block. The eval code block thus has a true value, so the or do block does not run.
if some_function_that_might_die() does die, then the last thing that happens in the eval code block is the
die. The eval code block thus has a false value and the or do block does run.
The ﬁrst thing you must do in the or do block is read $@. This global variable will hold whatever argument was
passed to die. The || "Zombie Error" guard is popular, but unnecessary in the general case.

This is important to understand because some not all code does fail by calling die, but the same structure can be
used regardless. Consider a database function that returns:

the number of rows aﬀected on success

'0 but true' if the query is successful but no rows were aﬀected
0 if the query was not successful.

In that case you can still use the same idiom, but you have to skip the ﬁnal 1;, and this function has to be the last
thing in the eval. Something like this:

eval {
my $value = My::Database::retrieve($my_thing); # dies on fail
$value->set_status("Completed");
$value->set_completed_timestamp(time());
$value->update(); # returns false value on fail
} or do { # handles both the die and the 0 return value
my $eval_error = $@ || "Zombie error!";
handle_error($eval_error);
};

GoalKicker.com – Perl® Notes for Professionals 54

Chapter 18: Regular Expressions
Section 18.1: Replace a string using regular expressions
s/foo/bar/; # replace "foo" with "bar" in $_
my $foo = "foo";
$foo =~ s/foo/bar/; # do the above on a different variable using the binding operator =~
s~ foo ~ bar ~; # using ~ as a delimiter
$foo = s/foo/bar/r; # non-destructive r flag: returns the replacement string without modifying the
variable it's bound to
s/foo/bar/g; # replace all instances

Section 18.2: Matching strings

The =~ operator attempts to match a regular expression (set apart by /) to a string:

my $str = "hello world";

print "Hi, yourself!\n" if $str =~ /^hello/;

/^hello/ is the actual regular expression. The ^ is a special character that tells the regular expression to start with
the beginning of the string and not match in the middle somewhere. Then the regex tries to ﬁnd the following
letters in order h, e, l, l, and o.

Regular expressions attempt to match the default variable ($_) if bare:

$_ = "hello world";

print "Ahoy!\n" if /^hello/;

You can also use diﬀerent delimiters is you precede the regular expression with the m operator:

m~^hello~;
m{^hello};
m|^hello|;

This is useful when matching strings that include the / character:

print "user directory" if m|^/usr|;

Section 18.3: Parsing a string with a regex

Generally, it's not a good idea to use a regular expression to parse a complex structure. But it can be done. For
instance, you might want to load data into hive table and fields are separated by comma but complex types like
array are separated by a "|". Files contain records with all fields separated by comma and complex type are inside
square bracket. In that case, this bit of disposable Perl might be sufficient:

echo "1,2,[3,4,5],5,6,[7,8],[1,2,34],5" | \
perl -ne \
'while( /\[[^,\]]+\,.*\]/ ){
if( /\[([^\]\|]+)\]/){
$text = $1;
$text_to_replace = $text;
$text =~ s/\,/\|/g;
s/$text_to_replace/$text/;

GoalKicker.com – Perl® Notes for Professionals 55

}
} print'

You'll want to spot check the output:

1,2,[3|4|5],5,6,[7|8],[1|2|34],5

Section 18.4: Usage of \Q and \E in pattern matching

What's between \Q and \E is treated as normal characters
#!/usr/bin/perl

my $str = "hello.it's.me";

my @test = (
"hello.it's.me",
"hello/it's!me",
);

sub ismatched($) { $_[0] ? "MATCHED!" : "DID NOT MATCH!" }

my @match = (
[ general_match=> sub { ismatched /$str/ } ],
[ qe_match => sub { ismatched /\Q$str\E/ } ],
);

for (@test) {
print "\String = '$_':\n";

foreach my $method (@match) {

my($name,$match) = @$method;
print " - $name: ", $match->(), "\n";
}

Output

String = 'hello.it's.me':
- general_match: MATCHED!
- qe_match: MATCHED!
String = 'hello/it's!me':
- general_match: MATCHED!
- qe_match: DID NOT MATCH!

GoalKicker.com – Perl® Notes for Professionals 56

THANK YOU !

The VLSI Voyager

DM-MICA RENTOMOJO Shivram Kerkar
No ratings yet
DM-MICA RENTOMOJO Shivram Kerkar
7 pages
Catalogues Kerger 2019
100% (1)
Catalogues Kerger 2019
257 pages
Practical Extraction and Reporting Language
No ratings yet
Practical Extraction and Reporting Language
45 pages
Tutorial
No ratings yet
Tutorial
48 pages
Lecture 20-2 Perl Programming: /usr/local/bin/perl /usr/bin/perl
No ratings yet
Lecture 20-2 Perl Programming: /usr/local/bin/perl /usr/bin/perl
10 pages
An experiment-driven guide to Perl
No ratings yet
An experiment-driven guide to Perl
4 pages
Artigo Wall
No ratings yet
Artigo Wall
71 pages
Learn Perl in About 2 Hours 30 Minutes: by Sam Hughes
No ratings yet
Learn Perl in About 2 Hours 30 Minutes: by Sam Hughes
21 pages
WT - Unit - 3 (Except Last Topic)
No ratings yet
WT - Unit - 3 (Except Last Topic)
16 pages
A Word About Operating Systems
No ratings yet
A Word About Operating Systems
35 pages
Learn Perl in About 2 Hours 30 Minutes: by Sam Hughes
No ratings yet
Learn Perl in About 2 Hours 30 Minutes: by Sam Hughes
23 pages
Perl Tutorial PDF
No ratings yet
Perl Tutorial PDF
25 pages
Learn Perl in About 2 Hours 30 Minutes PDF
No ratings yet
Learn Perl in About 2 Hours 30 Minutes PDF
25 pages
Perl
No ratings yet
Perl
25 pages
Perl
No ratings yet
Perl
54 pages
Perl Intro
No ratings yet
Perl Intro
21 pages
Perl Version 5.8.8 Documentation - Perlintro: Strongly Perltoc
No ratings yet
Perl Version 5.8.8 Documentation - Perlintro: Strongly Perltoc
10 pages
Perl Programming
No ratings yet
Perl Programming
25 pages
SL Unit-III (Perl)
No ratings yet
SL Unit-III (Perl)
210 pages
Short Perl Tutorial: Instructor: Rada Mihalcea University of Antwerp
No ratings yet
Short Perl Tutorial: Instructor: Rada Mihalcea University of Antwerp
34 pages
Short Perl Tutorial: Instructor: Rada Mihalcea University of Antwerp
No ratings yet
Short Perl Tutorial: Instructor: Rada Mihalcea University of Antwerp
34 pages
Perl Scripting: Dr. Ayman Wahba
No ratings yet
Perl Scripting: Dr. Ayman Wahba
68 pages
Unit 3
No ratings yet
Unit 3
117 pages
Module 2
No ratings yet
Module 2
23 pages
Perl Version 5.10.1 Documentation - Perlintro
No ratings yet
Perl Version 5.10.1 Documentation - Perlintro
11 pages
Perlintro (Unix)
No ratings yet
Perlintro (Unix)
11 pages
SL UNIT-II PERL (1)
No ratings yet
SL UNIT-II PERL (1)
84 pages
Perl For Hardware Design
100% (1)
Perl For Hardware Design
16 pages
Perl 01 - Basics
No ratings yet
Perl 01 - Basics
57 pages
Perl Web Tutorial-LokeshB
No ratings yet
Perl Web Tutorial-LokeshB
23 pages
Perl
No ratings yet
Perl
60 pages
Perl Variables: Scalars
No ratings yet
Perl Variables: Scalars
9 pages
Perl Training
No ratings yet
Perl Training
33 pages
Brief Introduction To Perl
No ratings yet
Brief Introduction To Perl
11 pages
Basics of Perl - UNIT-8
No ratings yet
Basics of Perl - UNIT-8
37 pages
Perl Tutorial
No ratings yet
Perl Tutorial
6 pages
Perl Version 5.14.1 Documentation - Perlintro
No ratings yet
Perl Version 5.14.1 Documentation - Perlintro
11 pages
Perl Version 5.14.0 Documentation - Perlintro
No ratings yet
Perl Version 5.14.0 Documentation - Perlintro
11 pages
Perl Workshop
100% (1)
Perl Workshop
68 pages
Introduction To Perl
100% (1)
Introduction To Perl
62 pages
PERL Basics
No ratings yet
PERL Basics
53 pages
Perl Programming: David Schweikert
No ratings yet
Perl Programming: David Schweikert
14 pages
Csc410 Perl Programming
No ratings yet
Csc410 Perl Programming
36 pages
Perl Version 5.16.2 Documentation - Perlintro
No ratings yet
Perl Version 5.16.2 Documentation - Perlintro
11 pages
Introduction To Perl: By: Cédric Notredame (Adapted From BT Mcinnes)
No ratings yet
Introduction To Perl: By: Cédric Notredame (Adapted From BT Mcinnes)
55 pages
Perl Tutorial 08
No ratings yet
Perl Tutorial 08
54 pages
2015 SPW Perl6 Course
100% (1)
2015 SPW Perl6 Course
90 pages
Session 6
No ratings yet
Session 6
32 pages
Pratical Extraction and Reporting Language: Welcome
No ratings yet
Pratical Extraction and Reporting Language: Welcome
137 pages
Lecture Notes 15
No ratings yet
Lecture Notes 15
56 pages
Perl Training Session1 22nd Sept 2012
No ratings yet
Perl Training Session1 22nd Sept 2012
125 pages
By: Fateme Karimi Sara Mohammadi
No ratings yet
By: Fateme Karimi Sara Mohammadi
25 pages
Perl
No ratings yet
Perl
99 pages
Perl Part II
No ratings yet
Perl Part II
17 pages
Basics of Perl: Advanced Unix Tools CS214 Spring 2004 Monday March 8, 2004
No ratings yet
Basics of Perl: Advanced Unix Tools CS214 Spring 2004 Monday March 8, 2004
3 pages
perl-2
No ratings yet
perl-2
14 pages
5 Perl
No ratings yet
5 Perl
32 pages
Scripting Languages Perl Basics: Course: 67557 Hebrew University Lecturer: Elliot Jaffe - הפי טוילא
No ratings yet
Scripting Languages Perl Basics: Course: 67557 Hebrew University Lecturer: Elliot Jaffe - הפי טוילא
48 pages
Perl Tutorial: Based On A Tutorial by Nano Gough
No ratings yet
Perl Tutorial: Based On A Tutorial by Nano Gough
24 pages
Introduction to PHP, Part 2, Second Edition
From Everand
Introduction to PHP, Part 2, Second Edition
Adam Majczak
No ratings yet
Perl Fast Track Guide - 86 Key Points Every Programmer from Other Languages Should Master
From Everand
Perl Fast Track Guide - 86 Key Points Every Programmer from Other Languages Should Master
Ginno
No ratings yet
Perl One-Liners: 130 Programs That Get Things Done
From Everand
Perl One-Liners: 130 Programs That Get Things Done
Peteris Krumins
4/5 (3)
SNMP GD 7 01 2013
No ratings yet
SNMP GD 7 01 2013
106 pages
Organizational structure-5-NEW-2020
No ratings yet
Organizational structure-5-NEW-2020
26 pages
MD196T Data Sheet
No ratings yet
MD196T Data Sheet
2 pages
Focus On Grammar 3, Third Edition: Powerpoint Presentations User'S Guide
No ratings yet
Focus On Grammar 3, Third Edition: Powerpoint Presentations User'S Guide
9 pages
Foreign Travellers Who Visited India
No ratings yet
Foreign Travellers Who Visited India
10 pages
Lab Report #3 - Limiting Reagent of Solution: Na CO S)
No ratings yet
Lab Report #3 - Limiting Reagent of Solution: Na CO S)
3 pages
Catalog F
No ratings yet
Catalog F
6 pages
Definition of T-Lines
No ratings yet
Definition of T-Lines
4 pages
Sidewinder Missiles Report
0% (1)
Sidewinder Missiles Report
9 pages
Vessel Sizing - Knockout Drums Design Guide
No ratings yet
Vessel Sizing - Knockout Drums Design Guide
2 pages
GAA 24350 AW11 IVb PDF
100% (1)
GAA 24350 AW11 IVb PDF
11 pages
Express Mini Wall Chart
No ratings yet
Express Mini Wall Chart
1 page
Supermarket Management Design Report
No ratings yet
Supermarket Management Design Report
37 pages
Oregon Mugshot Bill HB3467 Requiring Websites To Remove Arrest Records and Photos
100% (1)
Oregon Mugshot Bill HB3467 Requiring Websites To Remove Arrest Records and Photos
11 pages
Work Energy
0% (1)
Work Energy
2 pages
CAT 2018-10 Excel Catalogue EN 1571676506
No ratings yet
CAT 2018-10 Excel Catalogue EN 1571676506
16 pages
TCOM 111 The Teacher and The Community, School Culture
No ratings yet
TCOM 111 The Teacher and The Community, School Culture
31 pages
INSTRUMENT - AUTODRILLER - AOI - TECH MANUAL - IOM-111 Automatic Drilling Control System
No ratings yet
INSTRUMENT - AUTODRILLER - AOI - TECH MANUAL - IOM-111 Automatic Drilling Control System
53 pages
Flexure IWF Examination
No ratings yet
Flexure IWF Examination
6 pages
Brigada Eskwela Narrative Report
No ratings yet
Brigada Eskwela Narrative Report
1 page
Industrial Control Transformers (Machine Tool) - 9070T200D1 PDF
No ratings yet
Industrial Control Transformers (Machine Tool) - 9070T200D1 PDF
2 pages
Ourpdf
No ratings yet
Ourpdf
22 pages
Salesforce Field Names Reference
No ratings yet
Salesforce Field Names Reference
78 pages
Notes On Algorithms Pseudocode and Flowc
No ratings yet
Notes On Algorithms Pseudocode and Flowc
6 pages
Q1 - TLE 9 - Week 1
No ratings yet
Q1 - TLE 9 - Week 1
3 pages
Wire Crimping Tools
No ratings yet
Wire Crimping Tools
22 pages
Associate - M&A Consulting - Pre Deal
No ratings yet
Associate - M&A Consulting - Pre Deal
3 pages
Probability Distribution PDF
No ratings yet
Probability Distribution PDF
44 pages