Editing Talk:1571: Car Model Names

Jump to: navigation, search
Ambox notice.png Please sign your posts with ~~~~

Warning: You are not logged in. Your IP address will be publicly visible if you make any edits. If you log in or create an account, your edits will be attributed to your username, along with other benefits.

The edit can be undone. Please check the comparison below to verify that this is what you want to do, and then save the changes below to finish undoing the edit.
Latest revision Your text
Line 1: Line 1:
 
Suzuki Sexism kinda has a ring to it... [[User:Bbruzzo|Bbruzzo]] ([[User talk:Bbruzzo|talk]]) 14:39, 31 August 2015 (UTC)
 
Suzuki Sexism kinda has a ring to it... [[User:Bbruzzo|Bbruzzo]] ([[User talk:Bbruzzo|talk]]) 14:39, 31 August 2015 (UTC)
  
I like the sound of the Hyundai Climax. I'd drive that. [[User:DollarStoreBa'al]], 11 March 2024 [[User:DollarStoreBa'al|DollarStoreBa'al]] ([[User talk:DollarStoreBa'al|talk]]) 16:35, 11 March 2024 (UTC)
+
== Averages ==
 
 
Worth noting that there actually was an engine manufacturer named "Coventry Climax", who produced a range of racing engines and specialty machinery like forklift trucks.  Coventry Climax's engine works were eventually bought out by Jaguar Cars in the 1960s. {{unsigned ip|141.101.98.154}}
 
 
 
Considering the existence of the Civic RX and the CR-V EX, Cervixxx should have been a Honda model. - [[User:Frankie|Frankie]] ([[User talk:Frankie|talk]]) 16:44, 2 September 2015 (UTC)
 
 
 
A simple Lua script I wrote to calculate these ratings: http://pastebin.ubuntu.com/12259822/
 
Run it with your favorite Lua interpreter, and it should ask for a name. [[Special:Contributions/108.162.216.160|108.162.216.160]] 03:01, 3 September 2015 (UTC)
 
 
 
For people who don't have a favorite Lua interpreter, here's a simple python3 script I wrote : https://gist.github.com/Syncrossus/dd69d185d9af39d84f8a600871b27691
 
You can run it with a model name as command line argument, or run it with no argument and it will prompt you for model names. [[Special:Contributions/141.101.69.105|141.101.69.105]] 14:22, 3 June 2019 (UTC)
 
 
 
Interestingly, "xkcd" has a high score of 4.1. {{unsigned ip|199.27.129.59}}
 
 
 
:Could be a Jaguar, as homage to the XKS. --[[User:SaturNine|SaturNine]] ([[User talk:SaturNine|talk]]) 12:25, 7 September 2015 (UTC)
 
 
 
;Scores
 
  
 
Anyone know how the averages are calculated? I tried a couple but I don't arrive at the same numbers:
 
Anyone know how the averages are calculated? I tried a couple but I don't arrive at the same numbers:
Line 62: Line 46:
 
:Forgot to add what I meant to put here...  Apostrophes.  Very rare in car names (just the {{w|Kia_Cee%27d|Kia Cee'd}}), fairly often (over)used in standard English text.  I wonder what its value is?  (Not as easily 'assume it's a letter' as the x/times symbol.) [[Special:Contributions/141.101.99.108|141.101.99.108]] 01:44, 1 September 2015 (UTC)
 
:Forgot to add what I meant to put here...  Apostrophes.  Very rare in car names (just the {{w|Kia_Cee%27d|Kia Cee'd}}), fairly often (over)used in standard English text.  I wonder what its value is?  (Not as easily 'assume it's a letter' as the x/times symbol.) [[Special:Contributions/141.101.99.108|141.101.99.108]] 01:44, 1 September 2015 (UTC)
  
;Order of the scores
+
== Typo or Deliberate? ==
There are two possible explanations
 
;Score(x) = Frequency_in_cars(x) - Frequency_in_English(x)
 
I'm pretty sure it's a comparative scale between cars and English, not just a car-like/not-car-like scale.
 
:Randall uses positive numbers if a letter is more common in car models than in typical English (as X) which he then calls carlike. He used negative numbers if a letter's relative frequency in car models is lower than in typical English (as O) and he calls it English-like (more suitable for readable text). The letters F and B, with scores of 5 and -5, respectively, are about as common in English as in car models. With this nomenclature, the most English-like letter is Y because, while not the most common English letter, it is apparently extremely rare in car models.
 
 
 
;Score(x) = Frequency_in_cars(x)
 
English has no relationship with the score
 
:It seems that Randall arbitrarily used positive and negative numbers: if a letter is very common in car models (as X) he calls it carlike. If a letter is very uncommon in car models (as O) he calls it English-like. With this nomenclature the most English-like letter is Y, but actually Y is the least carlike letter. The most common letter in ordinary English is E. Y on the other hand is just in the middle (place 13), which can't be called English-like.
 
[[User:Xhfz|Xhfz]] ([[User talk:Xhfz|talk]]) 12:56, 1 September 2015 (UTC)
 
::"Y (...) can't be called English-like".  Well, it can be, as it's not uncommon.  And on the relative scale, it's much more indicative of being English than it is of being a car.  And I'm going to give the explanation a further tweak, I think, hopefully small and agreeable.  Also don't think the reversion helped (without checking the edit-changes), it was almost right. [[Special:Contributions/141.101.99.108|141.101.99.108]] 13:24, 1 September 2015 (UTC)
 
Now I understood your idea. I think I tweaked it to be more understandable. X is a letter that supports your claim. [[User:Xhfz|Xhfz]] ([[User talk:Xhfz|talk]]) 13:41, 1 September 2015 (UTC)
 
 
 
I'd like to suggest a third possibility, I figured it was a ratio: Score(x) = 100*(Frecuency_in_cars(x) / Frequency_in_English(x) - 1).  This allows numbers to be negative or positive and would explain the questions raised above. [[User:Djbrasier|Djbrasier]] ([[User talk:Djbrasier|talk]]) 13:53, 1 September 2015 (UTC)
 
 
 
Well, my "little tweak" became a big overhaul, then edit-conflicted.  For the record, it became the following monstrosity:
 
Scores for letters and numbers are presumably taken from their frequency in car models. [[Randall]] doubtless analysed a car-name database, in a manner similar to that used to derive the {{w|https://en.wikipedia.org/wiki/Letter_frequency#Relative_frequencies_of_letters_in_the_English_language|letter frequency statistics for written English}} against which the former seems to have been compared.  From these, letters that appeared equally commonly in both lists (either rare or frequent, but consistently between the two) would have been given a hypothetical value of zero, whilst ones that were almost exclusively in one medium would have a high-magnitude score; positive for more car-like and negative for more English-like.
 
Without the raw car-letter frequency data it's hard to derive the exact formula used, but taking the mathematical {{w|Logarithm|log value}} of a ratio would give us zero for 1:1 (equally car-like and English-like) and high positive/negative values for comparisons more skewed more towards the former/latter.
 
The closest letters to zero in the comic are F at +5 and B at -5 and may hover somewhere around the same ratios in car-names as in English (around 2.2% and 1.4% of total usage in the above link), with just a slight car/English dominance.  The most 'car-like' letter is X, that seems to be quite common in cars whilst very rare (<1% of usage) in English.
 
The most 'English-like' letter in the comic is Y with a score of -90.  Y is not common in English (~2%), but presumably even more disproportionately uncommon in car names.  The next most 'English-like' letter, O, with a given score of -80.  It is significantly more frequent in English (~7.5%, and perhaps the fourth most encountered individual letter), and so is likely also more frequent in the raw car-name data, alone, albeit similarly much less than 'expected' from its English occurances.
 
It makes some sense that rarer English letters are over-chosen (for the novelty and stand-out effect) for car names, at the general expense of several commoner English letters without particular bias, thus the highest positive peak is greater in magnitude than the lowest negative trough.  Although you could also point out that 'x' (used for 'times') is also a more useful car-name 'letter', whilst the letter O might be surpressed in alphanumeric sequences so as not to be confused with a zero.
 
When looking at the numbers in the table, Randall's analysis may have dealt with the decimal digits entirely seperately, based upon something like {{w|Benford%27s_law|Benford's Law}} for the natural occurance of numbers in common data, rather than from their disproportionately rare occurance within largely alphabetic English.  It is thus not unexpected that the 1 that is most common in data is underepresented within numbers in car-names, whilst sub-avearge 5 becomes a 'power number' in the world of cars, and the third most car-like character in the comic.
 
There are 19 positive scores and 17 negative scores.  They each add up to a score of 735 and -722, respectively, with the grand total being +13, suggesting that without rounding errors the whole system could have a neutral score.  The numbers alone  give a total offset of -0, the letters alone thus account for a not particularly unreasonable +0.5 'error' per character, and may also support the idea of separate analyses of these two sets.
 
...there was no easy way to resolve the differences, so the above is FYI.  ('''TLDR: perhaps it's a Log function?''')  In editing it down, I'd also had another bit:
 
The letters I and T may appear in non-word model-name strings to represent "Injection" and "Turbo", respectively, but with their overwhelming commonality already in English text they still appear ''more'' more in English than in cars.
 
...which was looked less useful and too wordy even for me, but might also be a useful fragment to consider. [[Special:Contributions/141.101.99.108|141.101.99.108]] 15:09, 1 September 2015 (UTC)
 
 
 
;Typo or Deliberate?
 
  
 
Randall gave REV-4 as an example car name. Did he accidentally misspell the (Toyota) RAV4, or was this a deliberate reference to chapter 4 of Revelations?--[[Special:Contributions/173.245.54.26|173.245.54.26]] 02:31, 1 September 2015 (UTC)
 
Randall gave REV-4 as an example car name. Did he accidentally misspell the (Toyota) RAV4, or was this a deliberate reference to chapter 4 of Revelations?--[[Special:Contributions/173.245.54.26|173.245.54.26]] 02:31, 1 September 2015 (UTC)
  
;Old Goths
+
== Old Goths ==
  
 
49 is a reasonable age for those who grew up Goth in the 80s, just sayin'.
 
49 is a reasonable age for those who grew up Goth in the 80s, just sayin'.
Line 100: Line 57:
 
I thought this too. It could be a joke on a youth sub-culture growing up (old).
 
I thought this too. It could be a joke on a youth sub-culture growing up (old).
 
-- [[Special:Contributions/108.162.229.157|108.162.229.157]] 11:28, 1 September 2015 (UTC)
 
-- [[Special:Contributions/108.162.229.157|108.162.229.157]] 11:28, 1 September 2015 (UTC)
 
;'Quick' and Dirty Car Data
 
Examining {{w|List_of_automobile_sales_by_model|this page}}, which has notable exceptions (I specifically looked for the Toyota '''Y'''aris and the Kia Cee'd, neither of which were there), using a quick script to isolate the car names, a lengthy ''manual'' process of sanitising all the exceptions the quick script couldn't handle and then another script to analyse letter frequencies of the model names (''not'' the make/marque part), I came up with the following undefinitive data, that is almost certainly flawed but may yet be useful:
 
<spaces> = 85 (but this count of whitespace may not be accurate and is superfluous...
 
& = 1  (...as are these first four items of punctuation, given their absence from Randall's chart)
 
- = 23
 
. = 3
 
/ = 10
 
0 = 104
 
1 = 73
 
2 = 54
 
3 = 43
 
4 = 35
 
5 = 54
 
6 = 35
 
7 = 18
 
8 = 26
 
9 = 17
 
A = 231 (includes à)
 
B = 30
 
C = 95
 
D = 54
 
E = 210 (includes é and ë)
 
F = 46
 
G = 52
 
H = 18
 
I = 122
 
J = 12
 
K = 13
 
L = 113
 
M = 83
 
N = 99
 
O = 145 (includes ó)
 
P = 80
 
Q = 4
 
R = 202
 
S = 127 (includes Š)
 
T = 166
 
U = 45
 
V = 38
 
W = 19
 
X = 25
 
Y = 33
 
Z = 14
 
Comparing just B and F (natural frequency 1.4% and 2.2%, above 30 to 46, both instances being approximately 1:1.5 when comparing the two letters within the same source), this matches the similarly close-to-zero scores given to them by Randall.  O vs. Y is 4.4:1, above, real life is 3.8:1 and adjusting for O being 1/9th 'more carlike' we get a similar value.  But Z vs J is 7:6, real life it's 1:2 and I can't reconcile that with the 1.3:1 on Randall's chart.  Probably indicates something non-linear (e.g. a log function) along the way, if O:Y wasn't so easy to distinguish.  Might, of course, be a differently biased dataset and thus GIGO. [[Special:Contributions/141.101.99.108|141.101.99.108]] 00:35, 2 September 2015 (UTC)
 
 
I thought that R would be used quite frequently.. (i.e Audi RS5). {{unsigned|Thomas 633}}
 
 
Surprised nobody mentioned before now the irony of Lincoln, the late 20th C. status symbol luxury vehicle, being paired with Marxism.--[[User:SaturNine|SaturNine]] ([[User talk:SaturNine|talk]]) 12:36, 7 September 2015 (UTC)
 
 
Marx wouldn't be happy having his name associated with corporate scum. Then again, the Lincoln part of it makes sense since Marx and Lincoln wrote to each other a lot. I hate corporatism, but there isn't really an "open" car like there are open softwares or hardwares for laptops and desktops. Stuck buying corporate boojie trash for now. If this so-called "Lincoln Marxism" was an electric or hybrid, I'd consider it because of its name. IFL Marx haha :P I'd think "Bolshevik" would also score high... Lenin, maybe? CHOICeS x'D [[User:International Space Station|International Space Station]] ([[User talk:International Space Station|talk]]) 04:35, 22 April 2016 (UTC
 
 
"Doody" means faeces where you come from??? Interesting. I only came across it as meaning a baby's soother -- not sure what that's called over your way, maybe a dummy? Anyway 'doody' is what you get if someone who is too young to pronounce S properly tries to.say 'soother' (especially if they are sacking on one at the same time)
 
[[Special:Contributions/162.158.38.166|162.158.38.166]] 07:27, 6 August 2019 (UTC)
 
 
It's ridiculous how much Toyota Cervix sounds like a real car model. [[Special:Contributions/172.71.10.213|172.71.10.213]] 22:37, 19 April 2023 (UTC)
 

Please note that all contributions to explain xkcd may be edited, altered, or removed by other contributors. If you do not want your writing to be edited mercilessly, then do not submit it here.
You are also promising us that you wrote this yourself, or copied it from a public domain or similar free resource (see explain xkcd:Copyrights for details). Do not submit copyrighted work without permission!

To protect the wiki against automated edit spam, we kindly ask you to solve the following CAPTCHA:

Cancel | Editing help (opens in new window)

Templates used on this page: