·  Why does it play clusters of artists or albums?
 (Entry last updated on June 16th, 2003)

It doesn't. At least not on purpose, and not because of any bugs in its randomization routines.

Assuming the following...

  • You have selected "Random" as its shuffle mode instead of one of the weighted shuffles, and you're really shuffling the whole contents of the player.

  • Your collection is reasonably varied; you're not talking about an artist who takes up a majority of your collection, like all the Rush albums in my collection.

  • Your collection is large enough so that you can reasonably expect a pretty good random shuffle each time. Say, a couple thousand songs at least.

  • You're not doing something special to the playlists to cause certain artists to be weighted differently than others. Such as deactivating the "de-dupe" feature, or using the special features of the playlists to deliberately skew the weighting.

... then the shuffle is truly random and it does not cluster artists or albums any more than it should.

What we're talking about here is perception. Any time the player plays a few songs by the same artist within, say, a stretch of ten songs in one sitting, you're automatically going to complain and say, "This thing's playing nothing but Rush today, it must be a bug."

But that's just your perception based on that particular session. Maybe your "session" was ten songs while driving to and from work that day. Or maybe even a few hours' worth of songs on a long drive. For any given subsection of a large shuffle, any perceived patterns which happen to fall into that session will stand out as unusual. Like four aces coming up in a row at the blackjack table; your immediate reaction is to assume the dealer is cheating.

But what you're not seeing is the actual random distribution of the sample across the whole set. At the blackjack table, you don't see the entire shoe of all six decks at once, you only see one hand's worth of cards at a time. Likewise, you never hear the entire player's song shuffle in one sitting. So patterns that appear in that short sample aren't representative of the true distribution across the entire shuffle set.

When you look at the kinds of sample criteria that you're seeing in the car player (usually a few thousand total songs, with a few albums by each artist, and about a dozen songs in each album), then it turns out that the kinds of patterns that you'd perceive as unusual are not only possible, but they're actually statistically frequent. Even unusual patterns, like "combs" of artists (Rush song, 2 random songs, Rush song, 2 random songs, etc.) are common with this kind of sample set.

Still not convinced? Jeff Sylvester, in a discussion on the Unofficial Empeg BBS, wrote a program to graph this very phenomenon. With this program, you can clearly see how a truly random distribution will produce exactly these kinds of perceived "patterns".

