perl5137delta - what is new for perl v5.13.7
This document describes differences between the 5.13.6 release and the 5.13.7 release.
If you are upgrading from an earlier release such as 5.13.5, first read perl5136delta, which describes differences between 5.13.5 and 5.13.6.
The +
prototype is a special alternative to $
that will act like
\[@%]
when given a literal array or hash variable, but will otherwise
force scalar context on the argument. This is useful for functions which
should accept either a literal array or an array reference as the argument:
When using the +
prototype, your function must check that the argument
is of an acceptable type.
use re '/flags';
The re
pragma now has the ability to turn on regular expression flags
till the end of the lexical scope:
- use re '/x';
- "foo" =~ / (.+) /; # /x implied
See '/flags' mode in re for details.
Statement labels can now occur before any type of statement or declaration,
such as package
.
use feature "unicode_strings"
now applies to more regex matchingAnother chunk of the The Unicode Bug in perlunicode is fixed in this
release. Now, regular expressions compiled within the scope of the
"unicode_strings" feature (or under the "u" regex modifier (specifiable
currently only with infix notation (?u:...)
or via use re '/u'
)
will match the same whether or not the target string is encoded in utf8,
with regard to [[:posix:]]
character classes
Work is underway to add the case sensitive matching to the control of this feature, but was not complete in time for this dot release.
All built-in functions that operate directly on array or hash containers now also accept hard references to arrays or hashes:
- |----------------------------+---------------------------|
- | Traditional syntax | Terse syntax |
- |----------------------------+---------------------------|
- | push @$arrayref, @stuff | push $arrayref, @stuff |
- | unshift @$arrayref, @stuff | unshift $arrayref, @stuff |
- | pop @$arrayref | pop $arrayref |
- | shift @$arrayref | shift $arrayref |
- | splice @$arrayref, 0, 2 | splice $arrayref, 0, 2 |
- | keys %$hashref | keys $hashref |
- | keys @$arrayref | keys $arrayref |
- | values %$hashref | values $hashref |
- | values @$arrayref | values $arrayref |
- | ($k,$v) = each %$hashref | ($k,$v) = each $hashref |
- | ($k,$v) = each @$arrayref | ($k,$v) = each $arrayref |
- |----------------------------+---------------------------|
This allows these built-in functions to act on long dereferencing chains
or on the return value of subroutines without needing to wrap them in
@{}
or %{}
:
For push
, unshift
and splice
, the reference will auto-vivify
if it is not defined, just as if it were wrapped with @{}
.
Calling keys
or values
directly on a reference gives a substantial
performance improvement over explicit dereferencing.
For keys
, values
, each
, when overloaded dereferencing is
present, the overloaded dereference is used instead of dereferencing the
underlying reftype. Warnings are issued about assumptions made in the
following three ambiguous cases:
The /r
flag, which was added to s///
in 5.13.2, has been extended to
the y///
operator.
It causes it to perform the substitution on a copy of its operand, returning that copy instead of a character count.
${^GLOBAL_PHASE}
A new global variable, ${^GLOBAL_PHASE}
, has been added to allow
introspection of the current phase of the perl interpreter. It's explained in
detail in ${^GLOBAL_PHASE} in perlvar and
BEGIN, UNITCHECK, CHECK, INIT and END in perlmod.
Perl comes with the Unicode 6.0 data base updated with Corrigendum #8, with one exception noted below. See http://unicode.org/versions/Unicode6.0.0 for details on the new release. Perl does not support any Unicode provisional properties, including the new ones for this release, but their database files are packaged with Perl.
Unicode 6.0 has chosen to use the name BELL
for the character at U+1F514,
which is a symbol that looks like a bell, and used in Japanese cell
phones. This conflicts with the long-standing Perl usage of having
BELL
mean the ASCII BEL
character, U+0007. In Perl 5.14,
\N{BELL}
will continue to mean U+0007, but its use will generate a
deprecated warning message, unless such warnings are turned off. The
new name for U+0007 in Perl will be ALERT
, which corresponds nicely
with the existing shorthand sequence for it, "\a"
. \N{BEL}
will
mean U+0007, with no warning given. The character at U+1F514 will not
have a name in 5.14, but can be referred to by \N{U+1F514}
. The plan
is that in Perl 5.16, \N{BELL}
will refer to U+1F514, and so all code
that uses \N{BELL}
should convert by then to using \N{ALERT}
,
\N{BEL}
, or "\a"
instead.
Custom ops can now be registered with the new custom_op_register
C
function and the XOP
structure. This will make it easier to add new
properties of custom ops in the future. Two new properties have been added
already, xop_class
and xop_peep
.
xop_class
is one of the OA_*OP constants, and allows B and other
introspection mechanisms to work with custom ops that aren't BASEOPs.
xop_peep
is a pointer to a function that will be called for ops of this
type from Perl_rpeep
.
See Custom Operators in perlguts and Custom Operators in perlapi for more detail.
The old PL_custom_op_names
/PL_custom_op_descs
interface is still
supported but discouraged.
If you assign a typeglob to a scalar variable:
- $glob = *foo;
the glob that is copied to $glob
is marked with a special flag
indicating that the glob is just a copy. This allows subsequent assignments
to $glob
to overwrite the glob. The original glob, however, is
immutable.
Many Perl operators did not distinguish between these two types of globs.
This would result in strange behaviour in edge cases: untie $scalar
would do nothing if the last thing assigned to the scalar was a glob
(because it treated it as untie *$scalar
, which unties a handle).
Assignment to a glob slot (e.g., (*$glob) = \@some_array
) would simply
assign \@some_array
to $glob
.
To fix this, the *{}
operator (including the *foo
and *$foo
forms)
has been modified to make a new immutable glob if its operand is a glob
copy. Various operators that make a distinction between globs and scalars
have been modified to treat only immutable globs as globs.
This causes an incompatible change in code that assigns a glob to the
return value of *{}
when that operator was passed a glob copy. Take the
following code, for instance:
- $glob = *foo;
- *$glob = *bar;
The *$glob
on the second line returns a new immutable glob. That new
glob is made an alias to *bar
. Then it is discarded. So the second
assignment has no effect.
It also means that tie $handle
will now tie $handle
as a scalar, even
if it has had a glob assigned to it.
The upside to this incompatible change is that bugs [perl #77496], [perl #77502], [perl #77508], [perl #77688], and [perl #77812], and maybe others, too, have been fixed.
See http://rt.perl.org/rt3/Public/Bug/Display.html?id=77810 for even more detail.
Stash list assignment %foo:: = ()
used to make the stash anonymous
temporarily while it was being emptied. Consequently, any of its
subroutines referenced elsewhere would become anonymous (showing up as
"(unknown)" in caller
). Now they retain their package names, such that
caller
will return the original sub name if there is still a reference
to its typeglob, or "foo::__ANON__" otherwise
[perl #79208].
\N{BELL}
is deprecatedThis is because Unicode is using that name for a different character. See Unicode Version 6.0 is now supported (mostly) for more explanation.
When an object has many weak references to it, freeing that object can under some some circumstances take O(N^2) time to free (where N is the number of references). The number of circumstances has been reduced. [perl #75254].
The following modules were added by the Unicode::Collate
upgrade from 0.63 to 0.67. See below for details.
Unicode::Collate::CJK::Big5
Unicode::Collate::CJK::GB2312
Unicode::Collate::CJK::JISX0208
Unicode::Collate::CJK::Korean
Unicode::Collate::CJK::Pinyin
Unicode::Collate::CJK::Stroke
Archive::Extract
has been upgraded from 0.44 to 0.46
Resolves an issue with NetBSD-current and its new unzip executable.
Archive::Tar
has been upgraded from 1.68 to 1.72
This adds the ptargrep utility for using regular expressions against the contents of files in a tar archive.
B
has been upgraded from 1.24 to 1.26.
It no longer crashes when taking apart a y///
containing characters
outside the octet range or compiled in a use utf8
scope.
The size of the shared object has been reduced by about 40%, with no reduction in functionality.
B::Deparse
has been upgraded from 0.99 to 1.01.
It fixes deparsing of our
followed by a variable with funny characters
(as permitted under the utf8
pragma)
[perl #33752].
CGI
has been upgraded from 3.49 to 3.50
This provides the following security fixes: the MIME boundary in multipart_init is now random and improvements to the handling of newlines embedded in header values.
The documentation for param_fetch() has been corrected and clarified.
CPAN
has been upgraded from 1.94_61 to 1.94_62
CPANPLUS
has been upgraded from 0.9007 to 0.9010
Fixes for the SQLite source engine and resolving of issues with the testsuite when run under local::lib and/or cpanminus
CPANPLUS::Dist::Build
has been upgraded from 0.48 to 0.50
Data::Dumper
has been upgraded from 2.129 to 2.130_01.
DynaLoader
has been upgraded from 1.10 to 1.11.
It fixes a buffer overflow when passed a very long file name.
ExtUtils::Constant
has been upgraded from 0.22 to 0.23.
The AUTOLOAD
helper code generated by ExtUtils::Constant::ProxySubs
can now croak
for missing constants, or generate a complete AUTOLOAD
subroutine in XS, allowing simplification of many modules that use it.
(Fcntl
, File::Glob
, GDBM_File
, I18N::Langinfo
, POSIX
, Socket
)
ExtUtils::Constant::ProxySubs
can now optionally push the names of all
constants onto the package's C{@EXPORT_OK}. This has been used to replace
less space-efficient code in B
, helping considerably shrink the size of its
shared object.
Fcntl
has been upgraded from 1.09 to 1.10.
File::Fetch
has been upgraded from 0.24 to 0.28
HTTP::Lite
is now supported for 'http' scheme.
The fetch
utility is supported on FreeBSD, NetBSD and
Dragonfly BSD for the http
and ftp
schemes.
File::Glob
has been upgraded from 1.09 to 1.10.
File::stat
has been upgraded from 1.03 to 1.04.
The -x
and -X
file test operators now work correctly under the root
user.
GDBM_File
has been upgraded from 1.11 to 1.12.
This fixes a memory leak when DBM filters are used.
Hash::Util
has been upgraded from 0.09 to 0.10.
Hash::Util::FieldHash
has been upgraded from 1.05 to 1.06.
I18N::Langinfo
has been upgraded from 0.06 to 0.07.
Locale::Maketext
has been upgraded from 1.16 to 1.17.
Math::BigInt
has been upgraded from 1.97 to 1.99_01.
Math::BigRat
has been upgraded from 0.26 to 0.26_01
Math::BigInt::FastCalc
has been upgraded from 0.22 to 0.24_01.
MIME::Base64
has been upgraded from 3.09 to 3.10
Includes new functions to calculate the length of encoded and decoded base64 strings.
mro
has been upgraded from 1.04 to 1.05.
NDBM_File
has been upgraded from 1.09 to 1.10.
This fixes a memory leak when DBM filters are used.
ODBM_File
has been upgraded from 1.08 to 1.09.
This fixes a memory leak when DBM filters are used.
Opcode
has been upgraded from 1.16 to 1.17.
parent
has been upgraded from 0.223 to 0.224
Pod::Simple
has been upgraded from 3.14 to 3.15
Includes various fixes to HTML
and XHTML
handling.
POSIX
has been upgraded from 1.21 to 1.22.
re
has been upgraded from 0.13 to 0.14, for the sake of the new
use re "/flags"
pragma.
Safe
has been upgraded from 2.28 to 2.29.
It adds &version::vxs::VCMP
to the default share.
SDBM_File
has been upgraded from 1.07 to 1.08.
SelfLoader
has been upgraded from 1.17 to 1.18.
It now works in taint mode [perl #72062].
Socket
has been upgraded from 1.90 to 1.91.
Storable
has been upgraded from 2.22 to 2.24
Includes performance improvement for overloaded classes.
Sys::Hostname
has been upgraded from 1.13 to 1.14.
Unicode::Collate
has been upgraded from 0.63 to 0.67
This release newly adds locales ja
ko
and zh
and its variants
( zh__big5han
, zh__gb2312han
, zh__pinyin
, zh__stroke
).
Supported UCA_Version 22 for Unicode 6.0.0.
The following modules have been added:
Unicode::Collate::CJK::Big5
for zh__big5han
which makes
tailoring of CJK Unified Ideographs in the order of CLDR's big5han ordering.
Unicode::Collate::CJK::GB2312
for zh__gb2312han
which makes
tailoring of CJK Unified Ideographs in the order of CLDR's gb2312han ordering.
Unicode::Collate::CJK::JISX0208
which makes tailoring of 6355 kanji
(CJK Unified Ideographs) in the JIS X 0208 order.
Unicode::Collate::CJK::Korean
which makes tailoring of CJK Unified Ideographs
in the order of CLDR's Korean ordering.
Unicode::Collate::CJK::Pinyin
for zh__pinyin
which makes
tailoring of CJK Unified Ideographs in the order of CLDR's pinyin ordering.
Unicode::Collate::CJK::Stroke
for zh__stroke
which makes
tailoring of CJK Unified Ideographs in the order of CLDR's stroke ordering.
perlvar reorders the variables and groups them by topic. Each variable introduced after Perl 5.000 notes the first version in which it is available. perlvar also has a new section for deprecated variables to note when they were removed.
New style guide for POD documentation, split mostly from the NOTES section of the pod2man man page.
( This was added to v5.13.6
but was not documented with that release ).
Array and hash slices in scalar context are now documented in perldata.
perlform and perllocale have been corrected to state that
use locale
affects formats.
"Using !~ with %s doesn't make sense": This message was actually added in
5.13.2, but was omitted from perldelta. It now applies also to the y///
operator, and has been documented.
ptargrep is a utility to apply pattern matching to the contents of files
in a tar archive. It comes with Archive::Tar
.
The new t/mro/isa_aliases.t has been added, which tests that
*Foo::ISA = *Bar::ISA
works properly.
t/mro/isarev.t has been added, which tests that PL_isarev
(accessible
at the Perl level via mro::get_isarev
) is updated properly.
t/run/switchd-78586.t has been added, which tests that [perl #78586] has been fixed (related to line numbers in the debugger).
Directory handles are now properly cloned when threads are created. In perl 5.13.6, child threads simply stopped inheriting directory handles. In previous versions, threads would share handles, resulting in crashes.
Support for building with Visual C++ 2010 is now underway, but is not yet complete. See README.win32 for more details.
Record-oriented files (record format variable or variable with fixed control) opened for write by the perlio layer will now be line buffered to prevent the introduction of spurious line breaks whenever the perlio buffer fills up.
lex_start
has been added to the API, but is considered experimental.
A new parse_block
function has been added to the API
[perl #78222].
A new, experimental API has been added for accessing the internal
structure that Perl uses for %^H
. See the functions beginning with
cophh_
in perlapi.
A stash can now have a list of effective names in addition to its usual
name. The first effective name can be accessed via the HvENAME
macro,
which is now the recommended name to use in MRO linearisations (HvNAME
being a fallback if there is no HvENAME
).
These names are added and deleted via hv_ename_add
and
hv_ename_delete
. These two functions are not part of the API.
The way the parser handles labels has been cleaned up and refactored. As a
result, the newFOROP()
constructor function no longer takes a parameter
stating what label is to go in the state op.
The newWHILEOP()
and newFOROP()
functions no longer accept a line
number as a parameter.
A new parse_barestmt()
function has been added, for parsing a statement
without a label.
A new parse_label()
function has been added, that parses a statement
label, separate from statements.
The CvSTASH()
macro can now only be used as an rvalue. CvSTASH_set()
has been added to replace assignment to CvSTASH()
. This is to ensure
that backreferences are handled properly. These macros are not part of the
API.
The op_scope()
and op_lvalue()
functions have been added to the API,
but are considered experimental.
The parse_stmt
C function added in earlier in the 5.13.x series has been
fixed to work with statements ending with }
[perl #78222].
The parse_fullstmt
C function added in 5.13.5 has been fixed to work
when called while an expression is being parsed.
Characters in the Latin-1 non-ASCII range (0x80 to 0xFF) used not to match
themselves if the string happened to be UTF8-encoded internally, the
regular expression was not, and the character in the regular expression was
inside a repeated group (e.g.,
Encode::decode_utf8("\303\200") =~ /(\xc0)+/
)
[perl #78464].
The (?d)
regular expression construct now overrides a previous (?u)
or use feature "unicode_string"
[perl #78508].
A memory leak in do "file"
, introduced in perl 5.13.6, has been fixed
[perl #78488].
Various bugs related to typeglob dereferencing have been fixed. See Dereferencing typeglobs, above.
The SvPVbyte
function available to XS modules now calls magic before
downgrading the SV, to avoid warnings about wide characters
[perl #72398].
The =
operator used to ignore magic (e.g., tie methods) on its
right-hand side if the scalar happened to hold a typeglob. This could
happen if a typeglob was the last thing returned from or assigned to a tied
scalar
[perl #77498].
sprintf
was ignoring locales when called with constant arguments
[perl #78632].
A non-ASCII character in the Latin-1 range could match both a Posix
class, such as [[:alnum:]]
, and its inverse [[:^alnum:]]
. This is
now fixed for regular expressions compiled under the "u"
modifier.
See use feature unicode_strings now applies to more regex matching.
[perl #18281].
Concatenating long strings under use encoding
no longer causes perl to
crash
[perl #78674].
Typeglob assignments would crash if the glob's stash no longer existed, so long as the glob assigned to was named 'ISA' or the glob on either side of the assignment contained a subroutine.
Calling ->import
on a class lacking an import method could corrupt
the stack, resulting in strange behaviour. For instance,
- push @a, "foo", $b = bar->import;
would assign 'foo' to $b
[perl #63790].
Creating an alias to a package when that package had been detached from the symbol table would result in corrupted isa caches [perl #77358].
.=
followed by <>
or readline
would leak memory if $/
contained characters beyond the octet range and the scalar assigned to
happened to be encoded as UTF8 internally
[perl #72246].
The recv
function could crash when called with the MSG_TRUNC flag
[perl #75082].
Evaluating a simple glob (like *a
) was calling get-magic on the glob,
even when its contents were not being used
[perl #78580].
This bug was introduced in 5.13.2 and did not affect earlier perl versions.
Matching a Unicode character against an alternation containing characters
that happened to match continuation bytes in the former's UTF8
representation (qq{\x{30ab}} =~ /\xab|\xa9/
) would cause erroneous
warnings
[perl #70998].
s///r
(added in 5.13.2) no longer leaks.
The trie optimisation was not taking empty groups into account, preventing
'foo' from matching /\A(?:(?:)foo|bar|zot)\z/
[perl #78356].
A pattern containing a +
inside a lookahead would sometimes cause an
incorrect match failure in a global match (e.g., /(?=(\S+))/g
)
[perl #68564].
Iterating with foreach
over an array returned by an lvalue sub now works
[perl #23790].
$@
is now localised during calls to binmode
to prevent action at a
distance
[perl #78844].
PL_isarev
, which is accessible to Perl via mro::get_isarev
is now
updated properly when packages are deleted or removed from the @ISA
of
other classes. This allows many packages to be created and deleted without
causing a memory leak
[perl #75176].
undef *Foo::
and undef *Foo::ISA
and delete $package::{ISA}
used not to update the internal isa caches if the
stash or @ISA
array had a reference elsewhere. In
fact, undef *Foo::ISA
would stop a new @Foo::ISA
array from updating
caches.
@ISA
arrays can now be shared between classes via
*Foo::ISA = \@Bar::ISA
or *Foo::ISA = *Bar::ISA
[perl #77238].
The parser no longer hangs when encountering certain Unicode characters, such as U+387 [perl #74022].
formline
no longer crashes when passed a tainted format picture. It also
taints $^A
now if its arguments are tainted
[perl #79138].
A signal handler called within a signal handler could cause leaks or double-frees. Now fixed. [perl #76248].
When trying to report Use of uninitialized value $Foo::BAR
, crashes could
occur if the GLOB of the global variable causing the warning has been detached
from its original stash by, for example delete $::{'Foo::'}
. This has been
fixed by disabling the reporting of variable names in the warning in those
cases.
Randy Kobes, creator of the kobesearch alternative to search.cpan.org and contributor/maintainer to several core Perl toolchain modules, passed away on September 18, 2010 after a battle with lung cancer. His contributions to the Perl community will be missed.
Perl 5.13.7 represents approximately one month of development since Perl 5.13.6 and contains 73100 lines of changes across 518 files from 39 authors and committers:
Abhijit Menon-Sen, Abigail, Ben Morrow, Chas. J. Owens IV, Chris 'BinGOs' Williams, Craig A. Berry, David Golden, David Mitchell, Father Chrysostomos, Fingle Nark, Florian Ragwitz, George Greer, Grant McLean, H.Merijn Brand, Ian Goodacre, Jan Dubois, Jerry D. Hedden, Jesse Vincent, Karl Williamson, Lubomir Rintel, Marty Pauley, Moritz Lenz, Nicholas Clark, Nicolas Kaiser, Niko Tyni, Peter John Acklam, Rafael Garcia-Suarez, Shlomi Fish, Steffen Mueller, Steve Hay, Tatsuhiko Miyagawa, Tim Bunce, Todd Rinaldo, Tom Christiansen, Tom Hukins, Tony Cook, Yves Orton, Zefram and brian d foy
Many of the changes included in this version originated in the CPAN modules included in Perl's core. We're grateful to the entire CPAN community for helping Perl to flourish.
If you find what you think is a bug, you might check the articles recently posted to the comp.lang.perl.misc newsgroup and the perl bug database at http://rt.perl.org/perlbug/ . There may also be information at http://www.perl.org/ , the Perl Home Page.
If you believe you have an unreported bug, please run the perlbug
program included with your release. Be sure to trim your bug down
to a tiny but sufficient test case. Your bug report, along with the
output of perl -V
, will be sent off to perlbug@perl.org to be
analysed by the Perl porting team.
If the bug you are reporting has security implications, which make it inappropriate to send to a publicly archived mailing list, then please send it to perl5-security-report@perl.org. This points to a closed subscription unarchived mailing list, which includes all the core committers, who be able to help assess the impact of issues, figure out a resolution, and help co-ordinate the release of patches to mitigate or fix the problem across all platforms on which Perl is supported. Please only use this address for security issues in the Perl core, not for modules independently distributed on CPAN.
The Changes file for an explanation of how to view exhaustive details on what changed.
The INSTALL file for how to build Perl.
The README file for general stuff.
The Artistic and Copying files for copyright information.