I've been trying to use grep for at least 30 minutes now trying to do something simple and it just doesn't seem to be working . It's most likely something I'm doing wrong, but I can't figure it out.
I have old msn chat logs on my other hard drive. I want to simply use grep to search for a particular string/word. First of all, I am in the correct directory with the logs which MSN stores in html format.
It seems simple enough, grep -i 'string' *.html
But I've tried with futility to get this to work.I've tried using a string 'hello' and other common words, and it NEVER find anything. I thought something weird was going on after a while, so I opened up one of the logs in gedit, and saved an exact duplicate (named Fakelog) in the same folder.
Strangely, enough grep could find matches in this new copy, but not in the original?
I thought this was weird, so I did "file *" and had this output
Code:
Code:
April 2008.html: XML document text
August 2007.html: XML document text
August 2008.html: XML document text
August 2009.html: XML document text
December 2007.html: XML document text
December 2008.html: XML document text
February 2008.html: XML document text
Fakelog 2005.html: ASCII text
As you can see, it seems to be calling my Fakelog an Ascii test, and every other file an XML text. I wonder if this is why grep won't work on the original log files?
Can someone help me understand the logic behind this seemingly illogical problem? I can't understand it. I can cat the .html logs and they output , but grep just won't work on them.
Edit: I'm still trying to make sense of it, and I tried this:
Code:
$ tail -2 October\ 2007.html <----- printing the last two lines of a log
</html> <--- it worked
$ tail -2 October\ 2007.html | grep html <--- this didn't work
$ echo '</html>' | grep html <-- this did
</html>
$
It doesn't work when I 'tail', but it does when I echo the same thing? I don't get it. I noticed that the last line of the file is seemingly blank and I can't remove the blank line. I wonder if that's causing problems, really I have no idea.