Clan x86

General Forums => General Discussion => Topic started by: AntiVirus on January 03, 2007, 11:56:45 PM

Title: What the hell?
Post by: AntiVirus on January 03, 2007, 11:56:45 PM
This is weird.  I saw this on another forum.  What you do is:

1. Open up Microsoft's Notepad

2. Type: Bush hid the facts

3. Save it

4. Reopen it

It comes out in little [][][][][][][][][][] things.  Why's that?  Tad strange?!

[Edit]
Nvm, I know why. 
http://www.hoax-slayer.com/bush-hid-the-facts-notepad.html

Still neat though!
Title: Re: What the hell?
Post by: Armin on January 03, 2007, 11:58:55 PM
It's a bug in notepad. I think it does it for just about any 4 word phrase, where the first word is 4 letters, second is 3, etc.
Title: Re: What the hell?
Post by: AntiVirus on January 04, 2007, 12:01:41 AM
Yeah, most sentences that follow: 4 3 3 5
Title: Re: What the hell?
Post by: Newby on January 04, 2007, 12:09:19 AM
iago use the linux
Title: Re: What the hell?
Post by: AntiVirus on January 04, 2007, 12:12:36 AM
Quote from: Newby on January 04, 2007, 12:09:19 AM
iago use the linux
Looks like he is missing out on all the Windows fun!
Title: Re: What the hell?
Post by: Sidoh on January 04, 2007, 12:23:44 AM
I'm pretty sure this was posted on here before.  iago or someone articulated the reasoning behind it quite well, if I remember correctly.
Title: Re: What the hell?
Post by: AntiVirus on January 04, 2007, 12:37:49 AM
Quote from: Sidoh on January 04, 2007, 12:23:44 AM
I'm pretty sure this was posted on here before.  iago or someone articulated the reasoning behind it quite well, if I remember correctly.
Well la de da.  Didn't look for it.  I didn't recall hearing about it, so I assumed it wasn't already posted. :P
Title: Re: What the hell?
Post by: Sidoh on January 04, 2007, 12:43:49 AM
Quote from: AntiVirus on January 04, 2007, 12:37:49 AM
Well la de da.  Didn't look for it.  I didn't recall hearing about it, so I assumed it wasn't already posted. :P

I wasn't accusing you of anything; I was just making a somewhat tangential statement.  I didn't mean to be rude or condescending.

It very well could have been posted before you even registered here.
Title: Re: What the hell?
Post by: Rule on January 04, 2007, 02:59:44 AM
No, I remember it being posted here less than a year ago.  "Old" :P

Here's the link. (http://www.x86labs.org:81/forum/index.php/topic,6243.0.html)
Title: Re: What the hell?
Post by: AntiVirus on January 04, 2007, 03:09:44 AM
Well damn, I was even here on these forums when that was posted!!  Looks like I am behind the times!  Well for those of you new people that didn't know, now you do! :D
Title: Re: What the hell?
Post by: Sidoh on January 04, 2007, 04:21:06 AM
Quote from: Rule on January 04, 2007, 02:59:44 AM
No, I remember it being posted here less than a year ago.  "Old" :P

Here's the link. (http://www.x86labs.org:81/forum/index.php/topic,6243.0.html)

Ah.  My sense of time on these sorts of things isn't that well established. :(
Title: Re: What the hell?
Post by: disco on January 04, 2007, 09:32:30 AM
THIS APP CAN BREAK
Title: Re: What the hell?
Post by: iago on January 07, 2007, 08:26:12 PM
Long story short: it's because of a bug when deciding whether or not a file is unicode.

And incidentally, it's not all about 4 3 3 5.  For example, "Bush hid the truth" doesn't work.
Title: Re: What the hell?
Post by: Joe on January 07, 2007, 08:51:25 PM
Yeah, HABP.

I thought it was about detecting from the line ending, though.
Title: Re: What the hell?
Post by: Skywing on January 07, 2007, 10:33:49 PM
Quote from: iago on January 07, 2007, 08:26:12 PM
Long story short: it's because of a bug when deciding whether or not a file is unicode.

And incidentally, it's not all about 4 3 3 5.  For example, "Bush hid the truth" doesn't work.
It's not really a bug.  A heuristic is being used to guess whether a text file with no indication as to whether it is Unicode or not is probably Unicode.  As such, it's really just making an educated guess based on whether something is likely Unicode or not, and like most guesses, sometimes it gets it wrong.
Title: Re: What the hell?
Post by: AntiVirus on January 08, 2007, 11:40:12 PM
Quote from: Skywing on January 07, 2007, 10:33:49 PM
Quote from: iago on January 07, 2007, 08:26:12 PM
Long story short: it's because of a bug when deciding whether or not a file is unicode.

And incidentally, it's not all about 4 3 3 5.  For example, "Bush hid the truth" doesn't work.
It's not really a bug.  A heuristic is being used to guess whether a text file with no indication as to whether it is Unicode or not is probably Unicode.  As such, it's really just making an educated guess based on whether something is likely Unicode or not, and like most guesses, sometimes it gets it wrong.
And it doesn't learn from its mistakes either!  Sheesh. :P
Title: Re: What the hell?
Post by: Sidoh on January 09, 2007, 02:30:43 AM
Quote from: AntiVirus on January 08, 2007, 11:40:12 PM
And it doesn't learn from its mistakes either!  Sheesh. :P

If it could identify a mistake, there wouldn't be any...
Title: Re: What the hell?
Post by: AntiVirus on January 09, 2007, 11:35:24 AM
Quote from: Sidoh on January 09, 2007, 02:30:43 AM
Quote from: AntiVirus on January 08, 2007, 11:40:12 PM
And it doesn't learn from its mistakes either!  Sheesh. :P

If it could identify a mistake, there wouldn't be any...
....  It was a joke. :P

And there would be a mistake.  It would have to make atleast one mistake first before it learned from it.   That's how you learn from mistakes, you make them.
Title: Re: What the hell?
Post by: Sidoh on January 09, 2007, 01:05:52 PM
Quote from: AntiVirus on January 09, 2007, 11:35:24 AM
....  It was a joke. :P

And there would be a mistake.  It would have to make atleast one mistake first before it learned from it.   That's how you learn from mistakes, you make them.

How's it going to learn from a mistake if it's unable to identify it?  That was my point. :P
Title: Re: What the hell?
Post by: AntiVirus on January 09, 2007, 03:57:24 PM
Quote from: Sidoh on January 09, 2007, 01:05:52 PM
Quote from: AntiVirus on January 09, 2007, 11:35:24 AM
....  It was a joke. :P

And there would be a mistake.  It would have to make atleast one mistake first before it learned from it.   That's how you learn from mistakes, you make them.

How's it going to learn from a mistake if it's unable to identify it?  That was my point. :P
... My first post was about it identifying the mistake and then learning about it.. :P
Title: Re: What the hell?
Post by: Sidoh on January 09, 2007, 04:10:54 PM
I don't know about you, but I'd hate to have to click "Yep, it's encoded correctly" every time I opened something with notepad. :P
Title: Re: What the hell?
Post by: iago on January 09, 2007, 07:45:35 PM
Hmm, I wonder if there's some way to fool it for longer text?

That might be useful for somebody who wants to raise the bar for viewing source on their website, since IE opens source in notepad. 
Title: Re: What the hell?
Post by: Sidoh on January 09, 2007, 07:49:05 PM
Quote from: iago on January 09, 2007, 07:45:35 PM
Hmm, I wonder if there's some way to fool it for longer text?

That might be useful for somebody who wants to raise the bar for viewing source on their website, since IE opens source in notepad. 

An interesting question... I'd guess it gets increasingly hard as the text it has to sample gets longer, though.
Title: Re: What the hell?
Post by: iago on January 09, 2007, 09:02:06 PM
That's assuming they take the entire file as a sample.  Perhaps they take the first 1024 bytes, in which case it'd be much more possible.
Title: Re: What the hell?
Post by: Skywing on January 09, 2007, 09:23:27 PM
Quote from: iago on January 09, 2007, 09:02:06 PM
That's assuming they take the entire file as a sample.  Perhaps they take the first 1024 bytes, in which case it'd be much more possible.
Notepad will use up to the first 1024 bytes in conjunction with IsTextUnicode (http://msdn.microsoft.com/library/default.asp?url=/library/en-us/intl/unicode_81np.asp) (using all available tests).  Determining just what probabilities the tests use is left as an exercise for the reader.
Title: Re: What the hell?
Post by: AntiVirus on January 09, 2007, 10:23:17 PM
Quote from: Sidoh on January 09, 2007, 04:10:54 PM
I don't know about you, but I'd hate to have to click "Yep, it's encoded correctly" every time I opened something with notepad. :P
-.-;;

I mean the program had almost like AI or something. :P Might not be possible, but it'd be nice.