Perl6: Unary Sort

Perl6: Unary Sort(perl6advent.wordpress.com)

66 points by patrickas 12 years ago | 44 comments

ciderpunx 12 years ago |

I really need to take a proper looks at Perl 6 -- it has some nice characteristics. Maybe that's my new years resolution.

stesch 12 years ago | |

And a dozen ways to write a very simple function.

Renaud 12 years ago | | |

Old debate, it's a core philosophy or the language. It's a bit like complaining that there are a 100 ways to cook chicken.

Personally, I like the freedom that Perl offers. Perl 5.x is still widely used in some industries, finance and banking for instance, and it's still the glue that holds 'nix systems together (have a look at the scripts in the bin folders of any distro).

Following Modern Perl[1] best practices, you can write powerful, meaningful and expressive Perl 5 without shooting yourself in the foot.

Perl 6 is another beast, it removes many of the ambiguities present in Perl 5 and introduces more functional paradigms.

At its core, Perl remains a multipurpose tool. The fact that there are multiple ways to do a thing is not bad, whether it suits you or not is a matter of personal preference really.

So, if someone is curious about Perl, they should be encouraged to try it and find out for themselves if they like it or not.

[1] http://modernperlbooks.com/books/modern_perl/

kamaal 12 years ago | | |

That's a wrong way of looking at things. The right way is to see how a language adapts per your thinking for writing complex functions. When you look at things from such an angle, its highly restrictive if a language expects you to always bend towards its one true ideology.

A nice language will offer you enough creative ammunition and will do as you wish while you are busy at more important things.

rmc 12 years ago | | |

It's a feature, not a bug!

yxhuvud 12 years ago |

Somehow I prefer the Ruby solution to add another method to do the unary variant. Compare

  ["A","b","C"].sort {|a, b| a.downcase <=> b.downcase }

  ["A","b","C"].sort_by {|k| k.downcase }

(or equivalently)

  ["A","b","C"].sort_by &:downcase

raiph 12 years ago | |

The unary sort is useful for two reasons. It simplifies the code AND it does key caching.

But sometimes you need BOTH a key extraction closure (to get key caching for performance) AND a comparison closure (to specify a custom sort). In P6 you just specify both closures (P6 figures out which is which because one has one arg and the other has two).

Here's the P6 equivalent of your last couple lines:

    ["A","b","C"].sort: { .lc } # A, b, C
    ["A","b","C"].sort:  *.lc   # same thing

Here's a custom sort:

    ["A","b","C"].sort: { $^b.lc leg $^a.lc } # C, b, A

Now combining them:

    ["A","b","C"].sort: *.lc, { $^b leg $^a } # C, b, A

The latter will run faster.

colomon 12 years ago | | |

The latter doesn't appear to be implemented in any version of p6.

adrianmsmith 12 years ago |

Great stuff, nevertheless the following is a surely a regression:

> Note that in Perl 6, cmp is smart enough to compare strings with string semantics and numbers with number semantics, so producing numbers in the transformation code generally does what you want.

Perl 5 had a clear distinction between "cmp" (string comparison) and "<=>" (numeric comparison). Trying to work out the data type, and thus the comparison approach, from the actual data itself, is surely going to create subtle bugs, that don't appear in testing, but do appear with live data.

colomon 12 years ago | |

Perl 6 has three comparisons: "<=>" for numeric, "leg" for string (stands for less equal greater), and "cmp" as described above.

colomon 12 years ago | | |

I should have also said that a great deal of thought went into making the default behaviors for "cmp" robust. But if (as in the article's example) you're producing numbers in the transformation code, "cmp" is guaranteed to do the right thing for you. Likewise if you produce strings in the transformation.

orblivion 12 years ago | |

This caught my eye as well. As a Python programmer who never learned Perl, this validates some stereotypes. Or maybe it's the budding Haskell programmer in me. "smart enough"? Sounds pretty dangerous to me.

laumars 12 years ago | | |

I don't know what stereotypes you are referring to, but Perl is actually one of the better loosely typed languages for not causing the aforementioned bugs. And also one of the best languages for not overloading operators too (which is one of the reasons it looks like executable line noise)

ugexe 12 years ago | | |

So don't use the comparison operator that attempts to detects the correct type and use the operator made for the specific type you know you are comparing?

rmc 12 years ago |

Python has this with the "key" argument to "sort".

    list.sort(key=lambda x: x.lower())

maxerickson 12 years ago | |

You can also pass in the method from the str type:

    key=str.lower

It will be faster, but it will break on elements that aren't strings (which could be good or bad).

koenigdavidmj 12 years ago | | |

Then you can use

  key=operator.methodcaller("lower")

and handle any type.

_ZeD_ 12 years ago | |

witch is a little different thing.

python can sort according to a function (using the "key" parameter) or using a custom comparator function (using the "cmp" parameter)

as a side note, Python offers also a sorted[0] function, applyable to any sequence

[0] http://docs.python.org/2/library/functions.html#sorted

orblivion 12 years ago | | |

I don't get the difference.

comex 12 years ago | | |

And incomprehensibly, Python 3 removed the cmp argument, leaving only unwary sort.

orblivion 12 years ago |

One thing this article does not cover is that, even if a comparison function doesn't do anything slow, there is still the fact that it still has to do a Perl function call O(n log n) times. At least in the equivalent with Python, I think it is advised to use the key function rather than comparison funtion for this reason, for speed. Though I guess a key function needs to take more space if it is to memoize.

raiph 12 years ago | |

A one arg closure to the P6 sort builtin (eg { .lc }) is a key function, not a comparison function. Imo the P6 sort builtin is an elegant rethink of P5's Schwartzian Transform, which was invented in 1994 to address precisely the point you make.

orblivion 12 years ago | | |

I agree, this wasn't a point about Python vs Perl.

All I'm saying is that the author bills it as not wanting to run { .lc } twice per comparison. What I'm adding is that the comparison function itself, even if trivial, arguably has a bit of an overhead just from calling it. Thus, having O(n) calls to a key function { .lc } may be better than O(n log n) calls to a comparison function {.lc <=> .lc}.

At least that's how people convinced me to use key= instead of cmp= in Python.

Grue3 12 years ago |

So, it's a feature that has been available in every high-order language since forever, except with more line noise. Typical Perl!

http://www.ai.mit.edu/projects/iiip/doc/CommonLISP/HyperSpec...

tarpden 12 years ago |

While it appears that Perl 6 has many impressive features, I'm much more interested in practical matters such as: module versioning, installation, & removal; the state of MoarVM; and the state of the tutorial book.

raiph 12 years ago | |

After years of the spec being nice but the implementations being very basic, the module versioning and installation implementation in Rakudo is finally shaping up. I anticipate it getting pretty robust over the next 6 months. Imo FROGGS' day 11 advent article is poorly written but might be useful: http://perl6advent.wordpress.com/2013/12/11/day-11-installin...

Rakudo/MoarVM began running this month. As Larry Wall said a few days ago "failed 179 test files ... which ain't too shabby at this point" http://irclog.perlgeek.de/moarvm/2013-12-21#i_8029074

Afaik the Perl 6 book is not really being updated. Maybe this info is of interest to you: http://www.perlmonks.org/?node_id=1033899