please empty your brain below

A couple of years ago I noticed a cluster of stations that started with the same first letter, but can't remember where. Help!

dg writes: Blogged here.
This all seems very familiar 😄
There are 23 consecutive stations on the 1 in New York in numerical order from 14th Street to 191st Street.
I am unsure that your comment of '... next station only has a 50/50 chance of following on ...' under 'Stations in alphabetical order' is true. The coin tossing analogy is correct but each station has its own probability of being followed by a station further along the alphabet. In the case of Wood Green this probability would be very low. The 50/50 chance may be true averaged out across the population of stations but just as the geometric mean of a set of numbers is always less than the arithmetical mean, my instinct is that in this case the chance of finding say 6 successive stations is far less than the chance of 6 successive heads or tails each being 1 in 2 to the power 6 ie 1 in 64. Help me mathematicians!
Richmond St Margarets Twickenham

dg writes: Added, thanks.
You may have come across the quiz question: which underground station has a name containing none of the letters in the word mackerel?
I’ve often wondered who created it and how.
Dg was it you??

dg writes: No, it dates back to the 1980s.
Seven in alphabetical order. Barnes, Mortlake, North Sheen, Richmond, St Margaret's, Twickenham, Whitton

dg writes: Added, thanks.
Anorak porn.

As to the 50/50 thing, there is also how frequently different letters in the alphabet are used, although a map of railways means that these results may well be different from normal, because of all the station names that contain 'Central', 'East', 'North', 'South', 'Town', 'West' etc.
Interestingly, there are 6 underground stations in a row on the Picadilly Line beginning with the letter H. Drum roll....
Hounslow East, Hounslow Central, Hounslow West, Hatton Cross, Heathrow Terminal 4, and lastly Heathrow Terminals 1,2,3.
Ken is this the run of stations you mention?

dg writes: Famously, there are 10.
Re: reverse alphabetical order: just travel in the other direction on one of the alphabetical order journeys!
@anon 10:01

Not necessarily: after leaving Moorfields on the way back to Bidston you go round the loop, passing through Liverpool Lime Street and Liverpool Central, before reaching James Street. Thus there are nine (not seven) in reverse alphabetical order
Well that's one way to pass the days of gloomy weather we've been having!! :D
Despite the large numbers of journeys, only once have you ended at Mornington Crescent.

Is this due to enforcement of the Bow Omnibus Halt Sign M (alternate Wednesdays) Rule?
@DG, @Colin

Yes, and yes. Thanks
If you're not worried about the letters being in order, Vauxhall to Weybridge contains every letter from A to Y.
I think the commenter who referred to geometric means above is partly correct. But the probabilities are not independent, so multiplying them could be dodgy.
However, DG said "This is similar to flipping a coin until you get a different result, so a long chain isn't very likely" The only doubtful word here is "so". A long chain isn't likely, and his argument might help to convince us why. The arithmetic/geometric business makes it even less likely, so we have an a fortiori situation.
@Gregg

If you take the direct line to Weybridge, you'll not call at any station with a Q in it.

But if you take the other route, you don't need to go any further than Brentford to find all the letters except Z.
timbo makes it sound like Gregg is wrong.
But the first station after Vauxhall is Queenstown Road (Battersea), which has a Q in it, and Gregg is correct.
The odds are not 50/50 each time. If you take two random stations, the odds are 3/1 that at least one of them (the later one) is in the second half of the alphabet. The odds of the next one down the line being even further towards the end of the alphabet are less than half. As your chain progresses, the number of available stations gets shorter and shorter, so the odds against get longer and longer.

Letter frequency makes no difference, as it merely the stations' positions in the sequence that matters.
Holy crap!! And here I thought I was bored here at work today. I, apparently, need to step up my game.
Splendid stuff. I can only express my admiration to all.
I cannot add anything meaningful to this post, so will probably incur a type character colour change.
But this is what living on the edge is all about.
Timbo said - "If you take two random stations, the odds are 3/1 that at least one of them (the later one) is in the second half of the alphabet."

That's not true, because stations aren't divided equally between the two halves of the alphabet.

For example, 36 more tube stations start with a letter from the first half of the alphabet than the second half. In this particular case the odds Timbo mentions are near enough 2/1, not 3/1.
A similar exercise on foot is much easier of course as there are quite a few alphabetical runs of road names.
Correction. The second half of the alphabetical list of stations, which is what actually matters.
If we assume the alphabetical position of station names is independent (i.e. for any station X and Y, P(X comes before Y) = 50%) then we can calculate probabilities. To simplify matters assume also that no station name is repeated. That means for any line of n stations we can replace the name with a number between 1 to n.

We can now view this as a permutation problem. Given a shuffle of the numbers 1 to n (with all permutations equally likely), what is the probability the shuffle is in order. If n=1, then the probability is 1. Otherwise there are 2 ordered shuffles (1 2 3 ... and n n-1 .. 2 1) with n! possible shuffles. So for n>=2, the probability is 2/n!.

For a wordy approach, consider a race with each runner equally likely to finish first. If one runner isn't recorded on the timing screens due to a technical issue, even if we know the relative position of all the other runners on the timing screen that one runner is still as likely to have won as before. So in a race of n runners, there is a 1/n chance the runner not on the screen finished first and (n-1)/n chance the leader on the timing screens finished first. The other runners on the timing screen have 0 probability of being first - we know someone who ran it faster.

To then consider a chain of stations, given a chain of n ordered stations, the chance the next one beats (either later if chain is A-Z or earlier if Z-A) is 1/(n+1) - this is both necessary and sufficient for it to be a chain of n+1 stations. So with 2 ordered stations, it's 1/3 chance we then have a chain of 3, 1/4 given a chain of 3 we have a chain of 4, i.e. 2/n! as above.

Note 1: With duplicate station names, it's easiest to give them a unique position in the numerical ordering and then add the probability that only the duplicate station numbers were out of place. This would double the probability if one station was duplicated in the chain, but having two stations with the same name adjacent and treating them as different entities for this seems ridiculous.

Note 2: The less mathematical approach is more often considered as how likely you are to get a personal best if your ability remains constant. (Assumes your results could be strictly ordered best to worst) the probability of your nth try being your personal best is 1/n. At any point, you'd expect at least one more personal best if you tried enough further attempts since 1 + 1/2 + 1/3 + ... never converges.
OK, so if I've got this right...

2 stations are always in alphabetical order.

3 stations are in alphabetical order one third of the time (because there are 6 possible combinations, two of which are in order)
ABC✓ ACB✗ BAC✗ BCA✗ CAB✗ CBA✓

4 stations are in alphabetical order one twelfth of the time (because there are 24 possible combinations, two of which are in order).

The general case is that only two of the permutations can be in order, (one forwards, one backwards), hence the probability is 2/n!.

2 stations: 2/2 = 1
3 stations: 2/6 = 1/3
4 stations: 2/24 = 1/12
5 stations: 2/120 = 1/60
6 stations: 2/720 = 1/360
7 stations: 2/2520 = 1/1260

7 stations should only appear in order 0.08% of the time... but there are a heck of a lot of chains of 7 stations on the London rail map, which is why it manages to occur once.
...and no coins should be flipped.
I'm from Bridgend! This never occurred to me though.
Whoosh! The sound of maffs flying past my head!!
I'll stick to drawing straight lines on a map. I can do that!
@Gregg

Stratford-on-Avon (Parkway) to Jewellery Quarter covers all 26 letters of the alphabet in 20 stops
@Timbo - your route omits X
On the same line as Lee-Mottingham-New Eltham after Sidcup there is Albany Park-Bexley-Crayford-Dartford










TridentScan | Privacy Policy