Difference between revisions of "Talk:1683: Digital Data"

Explain xkcd: It's 'cause you're dumb.
Jump to: navigation, search
Line 8: Line 8:
 
Here is the decoded title text:
 
Here is the decoded title text:
 
     “If you can read this, congratulations–the archive youʼre you're using still knows about the mouseover text”!
 
     “If you can read this, congratulations–the archive youʼre you're using still knows about the mouseover text”!
 +
 +
:Grungy details:
 +
:*“ -> ''convert to hex'' -> E2-80-9C -> ''UTF8 decode'' -> 0010-000000-011100 -> [http://www.fileformat.info/info/unicode/char/201C/index.htm U-201C "LEFT DOUBLE QUOTATION MARK"]
 +
:*— -> ''convert to hex'' -> E2-80-94 -> ''UTF8 decode'' -> 0010-000000-010100 -> [http://www.fileformat.info/info/unicode/char/2014/index.htm U-2014 "EM DASH"]
 +
:*’ -> ''convert to hex'' -> E2-80-99 -> ''UTF8 decode'' -> 0010-000000-011001 -> [http://www.fileformat.info/info/unicode/char/2019/index.htm U-2019 "RIGHT SINGLE QUOTATION MARK"]
 +
:*”! -> ''convert to hex'' -> E2-80-9D -> ''UTF8 decode'' -> 0010-000000-011101 -> [http://www.fileformat.info/info/unicode/char/201d/index.htm U-201D "RIGHT DOUBLE QUOTATION MARK"]
 +
:[[User:Odysseus654|Odysseus654]] ([[User talk:Odysseus654|talk]]) 17:31, 20 May 2016 (UTC)
  
 
He's written you're twice, but one is with a curly apostrophe, often favoured by americans (and maybe brits?), possible because of their keyboard. The simple apostrophe is “just” html-formatted, whereas the curly one has been molested by a UTF-8 / ISO-8859-1 misreading.
 
He's written you're twice, but one is with a curly apostrophe, often favoured by americans (and maybe brits?), possible because of their keyboard. The simple apostrophe is “just” html-formatted, whereas the curly one has been molested by a UTF-8 / ISO-8859-1 misreading.

Revision as of 17:31, 20 May 2016

Ewww, Verizon? **** them International Space Station (talk) 04:58, 20 May 2016 (UTC)

Ironically, the title text on explainxkcd is different from the one on xkcd.com, demonstrating the reinterpretation of text encoded in UTF-8 as if it were encoded in ISO 8859-1. 162.158.85.231 05:45, 20 May 2016 (UTC)

-Exactly; this nicely proves Randall's point. On my computer, different characters appear in different browsers, but of course in one browser the characters are reproducible.--Jkrstrt (talk) 07:26, 20 May 2016 (UTC)

Here is the decoded title text:

   “If you can read this, congratulations–the archive youʼre you're using still knows about the mouseover text”!
Grungy details:
Odysseus654 (talk) 17:31, 20 May 2016 (UTC)

He's written you're twice, but one is with a curly apostrophe, often favoured by americans (and maybe brits?), possible because of their keyboard. The simple apostrophe is “just” html-formatted, whereas the curly one has been molested by a UTF-8 / ISO-8859-1 misreading. --108.162.229.16 07:51, 20 May 2016 (UTC)

I'm British, and I don't have the curly apostrophe anywhere on my keyboard. Enchantedsleeper (talk) 11:01, 20 May 2016 (UTC)
I'm American, and I also don't have the curly apostrophe anywhere on my keyboard, but word processing programs (like MS-Word) are configured by default to automatically replace an ASCII apostrophe in a conjunction with the fancy right-single-quote mark. Also when using quotation marks around text those programs automatically replace the repeated single ASCII quotation marks with the fancy left and right quotation marks (single if using single quotes, double if using double quotes). Most people don't care enough to disable that "feature"... 162.158.252.143 15:13, 20 May 2016 (UTC)

This is a phenomenon that has always both fascinated me and frustrated me. I find it fascinating how, even today, data degrades as more and more people copy it (remember the old days when people used to copy VHS tapes, and the further you were from the original tape the more copying artefacts your copy had in it?). It also frustrates me, though, when I'm trying to find an original, undegraded image or video and it seems impossible to find. It's also annoying because it's actually pretty easy to copy something without causing any quality loss, yet practically every copied image on the internet has been degraded in some way or another. 141.101.98.130 07:08, 20 May 2016 (UTC)

If you haven't yet, you should check out this guy who ripped and reuploaded his own Youtube video 1000 times: https://www.youtube.com/watch?v=jEIzS_27Vt0 162.158.222.150 08:28, 20 May 2016 (UTC)
...and after 100 iterations https://www.youtube.com/watch?v=k6GMvihskBQ ...and the summary of all of them https://www.youtube.com/watch?v=icruGcSsPp0 Odysseus654 (talk) 16:50, 20 May 2016 (UTC)
It can be frustrating to try to convince new people drawing schematics on the computer to not use 4-way junctions because they don't expect digital images to degrade over multiple generations of copying. This xkcd demonstrates the way multiple generations can degrade even digital images, potentially making it difficult to differentiate two crossing (but electrically separate) signal lines from a 4-way junction on a schematic. Sorry, I'll get off my soap box now. ;-) 162.158.252.143 15:13, 20 May 2016 (UTC)

It's also funny because just a few moments ago I was trying to compress some video to send to someone. 141.101.98.130 07:12, 20 May 2016 (UTC)

http://fotoforensics.com/analysis.php?id=274fcf46426f2da31b057f1652ae5269cfdbd70a.190103 this page highlights the encoding blocks so that the degration of quality can be seen better. 141.101.91.205 09:42, 20 May 2016 (UTC)

The phenomenon that Randall is making fun of in this comic is actually called a "shitpic" http://www.theawl.com/2014/12/the-triumphant-rise-of-the-shitpic The explanation should probably make reference to that. Enchantedsleeper (talk) 10:57, 20 May 2016 (UTC)

I think the watermarks on the last frame are from an unregistered screenshot tool, not "9gag" or similar. The references to shit pics are interesting, but aren't you over interpreting the whole thing? 162.158.83.174 (talk) (please sign your comments with ~~~~)

There's a 9gag thing in the image, clean your glasses and look again. --173.245.54.46 12:15, 20 May 2016 (UTC)

Both screenshots from iOS definitely. Safari browser and… anybody knows? Some kind of other web browser? Maybe Chrome or Opera? <Need to finally create account> 162.158.202.152 15:32, 20 May 2016 (UTC)

Apparently Russians have been getting this a lot, as they (up to the point of the existence of UNICODE) have had to deal a lot with people using bad codepages. Example of their post office dealing with a physical package addressed with a bad codepage: http://worldlanguages.wikia.com/wiki/Mojibake?file=Letter_to_Russia_with_krokozyabry.jpg Odysseus654 (talk) 16:54, 20 May 2016 (UTC)