Nav links

Monday, 20 June 2005

Followup to Prevalence of Americanisms

Following the interesting results I found in Prevalence of Americanisms I wrote a short Perl script to query Google using their web API.

For the previous results I used Google with the site: operator to restrict the results to particular countries based on the server's domain. Using their web API I was able to target these results more specifically, using an option which determines site locations based on the "top level domain name of the server and the geographic location of the server IP address". This would now correctly identify those numerous sites in the .com space but which reside in other countries, presumably giving more accurate results for my survey. In addition, I could now include US sites, which are impossible to distinguish based on domain name alone.











































































































































































 UKAustraliaCanadaUS
aeroplane / airplane1:1.61:1.41:181:20
aluminium / aluminum2:15.4:11:21:11
burgled / burglarized14:15:11:1.51:2
car boot / car trunk100:15:11:21:4
car park / parking lot14:114:11:211:8
colour / color5:12.5:11:1.51:16
estate agent / realtor13:140:11:1.21:2
father christmas / santa claus1:1.11:6.51:131:12
forthcoming / upcoming3:11:1.31:71:7
frying pan / skillet2:14:11:21:1.7
fortnight / two weeks*1:1.21:41:111:25
flatmate / roommate9:13:11:341:44
gaol / jail1:5.71:5.31:491:78
infant school / kindergarten1.7:11:511:1731:300
liquorice / licorice2.8:11:1.61:91:9
maths / math2.5:11:11:301:71
mobile phone / cell phone2:17:11:2.21:2.8
nappy / diaper3:13:11:8.41:7.8
pavement / sidewalk9:14:11:1.31:1
spelt / spelled1.2:11.3:11:61:7
theatre / theater7:16.6:14:11:1.7
tv aerial / tv antenna7:11:41:61:16
write to us at / write us at54:112:11.3:11.6:1


* fortnight / two weeks isn't a strict pairing, because two weeks can often be used in UK english where fortnight is not applicable. However, it's still useful as a comparison between countries.

The Australian and UK results (the languages with which I am more familiar) look to be more accurate than the previous table.

I wanted to put up an online tool to enable anyone to determine the ratios for any words, but I don't have the web server facilities at the moment.

I didn't know that cell phone was so popular in the UK, though I have been noticing its increasing use on the BBC's international services.

Note that I had to use "write us as" rather than "write us" so as to exclude some other common usages such as "write us a letter". For example, "write me a letter soon" is normal UK english, but "write me soon" is not.

Updated 20 June: Fixed tv aerial / tv antenna results. Added write to us at / write us at, mobile phone / cell phone pairs.

Updated 30 June: Added aeroplane / airplane, father christmas / santa claus, gaol / jail, infant school / kindergarten, liquorice / licorice, maths / math pairs.

Updated 10 July: Added many more pairs, and tried (unsuccessfully) to make the table layout more readable. I should really colour-code each entry, with shades indicating how great the ratio is.