Editing 1683: Digital Data

Jump to: navigation, search

Warning: You are not logged in. Your IP address will be publicly visible if you make any edits. If you log in or create an account, your edits will be attributed to your username, along with other benefits.

The edit can be undone. Please check the comparison below to verify that this is what you want to do, and then save the changes below to finish undoing the edit.
Latest revision Your text
Line 10: Line 10:
 
{{w|Digital information}} has the potential to be copied such that the copy is 100% identical to the original. While physical media themselves (such as books, or hard drives) and information stored by analog means may degrade as the universe continues, digital information as expressed by specific values, such as combinations of binary zeros and ones, does not decay over time and can be copied indefinitely with no changes.
 
{{w|Digital information}} has the potential to be copied such that the copy is 100% identical to the original. While physical media themselves (such as books, or hard drives) and information stored by analog means may degrade as the universe continues, digital information as expressed by specific values, such as combinations of binary zeros and ones, does not decay over time and can be copied indefinitely with no changes.
  
However, in this comic, [[Randall]] points out that while digital information itself doesn't need to degrade, things that are on the Internet are often degraded through copying when the copy is not a 1:1 copy or changes are deliberately introduced. In addition, as technology advances, the method to save or call the information changes and the medium to view it changes, occasionally causing misinterpreted information. (This is also demonstrated with the mouseover text.) As the frames continue, they gain the appearance of images which have been screenshotted repeatedly, with a resulting loss of quality due to compression of the original resolution and {{w|JPEG}} {{w|compression artifact|artifacting}}. (The JPEG format is intended for representing photorealistic grayscale or color images; when misused for line drawings, such as comic strips, any compression artifacts become particularly noticeable, as the background is normally of completely uniform color.) In the last frame, this is taken to an extreme, as the frame appears to have been very sloppily screenshotted off of at least two different smartphones (not the same device that uses the bottom frame in the third panel as the top border in panel four), and the final image is covered both with a watermark from an unregistered screenshot program, as well as references to at least two different web sites: {{w|9GAG}} (bottom right image) and {{w|Tumblr}} in the web address bottom left. 9GAG is an online platform and social media website where users upload and share content of their own, or of other networks. It is often accused of rehosting other sites' funny content without attribution and adding their own watermark to the image or video.
+
However, in this comic, [[Randall]] points out that while digital information itself doesn't need to degrade, things that are on the Internet are often degraded through copying when the copy is not a 1:1 copy or changes are deliberately introduced. In addition, as technology advances, the method to save or call the information changes and the medium to view it changes, occasionally causing misinterpreted information. (This is also demonstrated with the title text.) As the frames continue, they gain the appearance of images which have been screenshotted repeatedly, with a resulting loss of quality due to compression of the original resolution and {{w|JPEG}} {{w|compression artifact|artifacting}}. (The JPEG format is intended for representing photorealistic grayscale or color images; when misused for line drawings, such as comic strips, any compression artifacts become particularly noticeable, as the background is normally of completely uniform color.) In the last frame, this is taken to an extreme, as the frame appears to have been very sloppily screenshotted off of at least two different smartphones (not the same device that uses the bottom frame in the third panel as the top border in panel four), and the final image is covered both with a watermark from an unregistered screenshot program, as well as references to at least two different web sites: {{w|9GAG}} (bottom right image) and {{w|Tumblr}} in the web address bottom left. 9GAG is an online platform and social media website where users upload and share content of their own, or of other networks. It is often accused of rehosting other sites' funny content without attribution and adding their own watermark to the image or video.
 
<!--
 
<!--
 
[9Gag is well known; maybe also provide the example of iFunny. Talk about things like "unregistered HyperCam" and the phenomenon in more detail.]
 
[9Gag is well known; maybe also provide the example of iFunny. Talk about things like "unregistered HyperCam" and the phenomenon in more detail.]
Line 18: Line 18:
 
As an easter egg, the [http://imgs.xkcd.com/comics/digital_data_2x.png high-resolution] (pixel-doubled) version of the comic is merely the comic resized to 50% and then to 400%, making it an image of poorer quality rather than a higher resolution image as for other comics, demonstrating how repeated {{w|image scaling}} can also introduce artifacts into images.
 
As an easter egg, the [http://imgs.xkcd.com/comics/digital_data_2x.png high-resolution] (pixel-doubled) version of the comic is merely the comic resized to 50% and then to 400%, making it an image of poorer quality rather than a higher resolution image as for other comics, demonstrating how repeated {{w|image scaling}} can also introduce artifacts into images.
  
The mouseover text is seemingly addressed to a reader in the future who will only be able to access xkcd through a {{w|digital archive}}. Digital information might not degrade with time, but it can't be properly displayed without knowledge of the encoding. As new encodings and file formats get developed and old ones abandoned, the webpage format of the comic might not be available in the future, when users would need special archives to view content from today's world. The mouseover text contains seemingly {{w|mojibake|garbage characters}}, which typically result from data being interpreted according to a {{w|character encoding}} different from the one used to encode it. In this case, the characters are the result of encoding the string [https://ftfy.vercel.app/?s=%C3%A2%E2%82%AC%C5%93If+you+can+read+this%2C+congratulations%C3%A2%E2%82%AC%E2%80%9Dthe+archive+you%C3%A2%E2%82%AC%E2%84%A2re+using+still+knows+about+the+mouseover+text%C3%A2%E2%82%AC%C2%9D%21 <tt>“If you can read this, congratulations—the archive you’re using still knows about the mouseover text”!</tt>] using {{w|UTF-8}} (which represents non-{{w|ASCII}} {{w|Unicode}} characters as multibyte sequences) and then interpreting the resulting bytes as the still commonly used {{w|Windows-1252}}  encoding (which uses only one byte per character, but utilizes the non-ASCII codepoints for a limited selection of extra letters and symbols such as "â" or "€"). This shows that degradation of digital data through conversions isn't restricted to images. Furthermore, as screen navigation moves away from the mouse toward touch, voice recognition, and modes still to be implemented, mouseover text will itself become archaic.
+
The mouseover text is seemingly addressed to a reader in the future who will only be able to access xkcd through a {{w|digital archive}}. Digital information might not degrade with time, but it can't be properly displayed without knowledge of the encoding. As new encodings and file formats get developed and old ones abandoned, the webpage format of the comic might not be available in the future, when users would need special archives to view content from today's world. The title text contains seemingly {{w|mojibake|garbage characters}}, which typically result from data being interpreted according to a {{w|character encoding}} different from the one used to encode it. In this case, the characters are the result of encoding the string [https://ftfy.vercel.app/?s=%C3%A2%E2%82%AC%C5%93If+you+can+read+this%2C+congratulations%C3%A2%E2%82%AC%E2%80%9Dthe+archive+you%C3%A2%E2%82%AC%E2%84%A2re+using+still+knows+about+the+mouseover+text%C3%A2%E2%82%AC%C2%9D%21 <tt>“If you can read this, congratulations—the archive you’re using still knows about the mouseover text”!</tt>] using {{w|UTF-8}} (which represents non-{{w|ASCII}} {{w|Unicode}} characters as multibyte sequences) and then interpreting the resulting bytes as the still commonly used {{w|Windows-1252}}  encoding (which uses only one byte per character, but utilizes the non-ASCII codepoints for a limited selection of extra letters and symbols such as "â" or "€"). This shows that degradation of digital data through conversions isn't restricted to images. Furthermore, as screen navigation moves away from the mouse toward touch, voice recognition, and modes still to be implemented, mouseover text will itself become archaic.
  
 
==Transcript==
 
==Transcript==

Please note that all contributions to explain xkcd may be edited, altered, or removed by other contributors. If you do not want your writing to be edited mercilessly, then do not submit it here.
You are also promising us that you wrote this yourself, or copied it from a public domain or similar free resource (see explain xkcd:Copyrights for details). Do not submit copyrighted work without permission!

To protect the wiki against automated edit spam, we kindly ask you to solve the following CAPTCHA:

Cancel | Editing help (opens in new window)