Explain xkcd: It's 'cause you're dumb.
U+202e is a unicode control character that changes all proceeding text to right-to-left (RTL, as the title references). In the comic, Black Hat tires of Cueball's complaining and inserts a U+202e character in the middle of Cueball's speech, turning his complaints into gibberish - sentences that must be read from right-to-left. U+202c returns text back to its normal direction.
When multiple writers work on the same text, arguments can often arise with some writers resorting to vandalizing the works of other writers. The title text takes this up a level, suggesting the use of U+202e and other direction control characters in editor wars to disrupt other people's work.
- The title of the comic on the xkcd website actually has a U+202e character preceding it; when copied and pasted, the title of the comic actually reads "LTR". The page title is "xkcd: [U+202e]LTR", which causes Firefox to use "xkcd: xoferiF allizoM - RTL" as the window title. This also occurs in Chromium and Opera.
- In the version originally published there was a typo in the reverse text ("ETH" instead of "EHT" for "THE"). This mistake was corrected within a couple of hours.
- In some browsers, this page's title damages the appearence of all older entries in the archive page.
- [Cueball and Black Hat. Cueball is standing. Black Hat is sitting down and using a laptop.]
- Cueball: And that's not even the worst part! The worst part is that—
- Black Hat: U+202e
- Cueball: ...neve t'ndid yehT— (Flipped translation = "— They didn't even...")
- Cueball: ?lleh eht tahW... (Flipped translation = "...What the hell?")
- Cueball: ...uoy did woH (Flipped translation = "How did you...")
- Cueball: .elohssA... (Flipped translation = "...Asshole.")
Unicode Control Characters
| Unicode number
|| LEFT-TO-RIGHT EMBEDDING
|| The following text will be left-to-right. This will not change directionality of characters, so for example Arabic letters will stay right-to-left. This character alone does nothing in an English text, since the text direction is left-to-right by default.
|| RIGHT-TO-LEFT EMBEDDING
|| The following text will be right-to-left. This will not change directionality of characters, so Latin letters will stay left-to-right. Full stops, which don't have a directionality on their own, will be left of the sentence. Use this character for some little misplacings that cause big confusion.
|| POP DIRECTIONAL FORMATTING
|| The following text is formated like the text before the last U+202a, U+202b, U+202d or U+202e character.
|| LEFT-TO-RIGHT OVERRIDE
|| The following text will be left-to-right. Additionally, the directionality of characters is changed to left-to-right. Used alone in an English text, this will only affect characters that are right-to-left by default, like Arabic letters.
|| RIGHT-TO-LEFT OVERRIDE
|| The following text will be right-to-left. Additionally, the directionality of characters is changed to right-to-left. Use this character to completely screw up an English text.
add a comment! ⋅ refresh comments!
There's a typo in the comic - hte should be eht for "the" spelled backwards -jars99
- Unless you consider "th" a single character, which by the way makes a lot of sense as it is derived from old-english "eth". -- 184.108.40.206 (talk) (please sign your comments with ~~~~)
- Unless you further consider that "they" doesn't share that in the comic, making it internally inconsistent. 220.127.116.11 11:40, 21 November 2012 (UTC)
- It should be noted that at some point, "the" was corrected. Rawmustard (talk) 16:42, 22 November 2012 (UTC)
Acutally, unicode 202e doesn't "flip proceeding text back-to-front", it overrides the direction, setting it to "right-to-left" for the following text. It's back-to-front for most of us like "left-to-right" is to other writing systems. I know it's nitpicking, but xkcd readers should appreciate the symmetry. BKA (talk) 07:23, 21 November 2012 (UTC)
I don't see the reversed title. My window manager is not UTF-8 compatible, so when a window title is set to string containing UTF-8 characters, it doesn't change. This brings the question if it really is a browser problem or if the browsers behave as expected and the window manager is at fault. -- 18.104.22.168 09:17, 21 November 2012 (UTC)
- It's not a problem per sec. Browsers that get the reversed title are processing the UTF symbol correctly, there's no bug there. And the window manager has no bearing on the title text except for maybe font. Davidy22(talk) 09:32, 21 November 2012 (UTC)
- Well its the window manager that renders the window title, but it is composed by the Browser. I think that the browser should insert an appropriate number of U+202c characters, in this case it should be "xkcd: [U+202e]LTR[U+202c] - Mozilla Firefox". That would render as xkcd: RTL - Mozilla Firefox". By the way, the tab caption in Firefox is "xkcd: LTR". In Chromium and Opera it is shown correctly as "xkcd: RTL". Joha.ma (talk) 09:47, 21 November 2012 (UTC)
Another way to see this in effect is to try to type in this test page: http://www.fileformat.info/info/unicode/char/202e/browsertest.htm - and this also works in etherpad, as suggested in the caption.--Anarcat (talk) 00:09, 22 November 2012 (UTC)
Black Hat seems to have used U+202b, not e. The individual characters are left-to-right. Check the D, E, L, N, S, and ? 22.214.171.124 02:46, 22 November 2012 (UTC)
- No, U+202e does not actually mirror the characters themselves, just the displayed order. U+202b only changes the order for characters that don't have embedded direction, such as the period, which can be used with multiple languages. Bugefun (talk) 05:00, 22 November 2012 (UTC)
- Actually, it does mirror some characters, namely those that have the mirrored property. For example, the parentheses or mathematical relations like the less-than sign. Here is a list of them. --Ulm (talk) 12:34, 22 November 2012 (UTC)
- The text is not always mirrored - some browsers and font-sets do not 'understand' U+202E, so the text here might be displayed with a box before it. 126.96.36.199 15:24, 27 December 2012 (UTC)
The title of this comic has an embedded RTL in the title, as has been noted somewhere. Because of this, Randall's Archive page is now screwed up, with all titles before this comic (listed after this comic on the Archive page in descending order) now being reversed. Ha, ha! He hacked himself!
01:59, 14 January 2013 (UTC)larK