[SOLVED] Where can I get Wall Street Journal Penn Treebank for free LEGALLY?
ProgrammingThis forum is for all programming questions.
The question does not have to be directly related to Linux and any language is fair game.
Notices
Welcome to LinuxQuestions.org, a friendly and active Linux Community.
You are currently viewing LQ as a guest. By joining our community you will have the ability to post topics, receive our newsletter, use the advanced search, subscribe to threads and access many other special features. Registration is quick, simple and absolutely free. Join our community today!
Note that registered members see fewer ads, and ContentLink is completely disabled once you log in.
If you have any problems with the registration process or your account login, please contact us. If you need to reset your password, click here.
Having a problem logging in? Please visit this page to clear all LQ-related cookies.
Get a virtual cloud desktop with the Linux distro that you want in less than five minutes with Shells! With over 10 pre-installed distros to choose from, the worry-free installation life is here! Whether you are a digital nomad or just looking for flexibility, Shells can put your Linux machine on the device that you want to use.
Exclusive for LQ members, get up to 45% off per month. Click here for more info.
You Google for it, and follow the links....I found much with a quick search, have you tried that?
Also, if you're talking about a piece of commercial software, you should PAY FOR IT...no one here is going to help you steal.
1. Its not a software.
2. Yes I tried googling it (obviously) and didn't find it for free hence the thread here. Please do some research before you state something as a fact.
Quote:
Originally Posted by stress_junkie
Your other four threads suggest that you already have it in source form.
Thanks for the reply.
Last edited by ghantauke; 03-11-2011 at 01:35 PM.
Reason: Just so people won't get confused with what I actually did with the .crp file.
2. Yes I tried googling it (obviously) and didn't find it for free hence the thread here. Please do some research before you state something as a fact.
Didn't try too hard, I guess. I get 110,000 hits just by putting in "penn treebank", with the first four links having alot of what you're looking for. How about YOU doing some research before you state something as fact??
Quote:
Thanks for the reply but heres the problem.
The file I mentioned in that thread is in .crp format which can only be used with tgrep (which is an older version of tgrep2). I tried converting the file with the tgrep2 -p command but it gives me the error "ERROR: Tree 1 doesn't start with (.". Therefore I want the source in .t2c format.
Didn't try too hard, I guess. I get 110,000 hits just by putting in "penn treebank", with the first four links having alot of what you're looking for. How about YOU doing some research before you state something as fact??
According to the tgrep2 man pages, you have to use a combination of tgrep and tgrep2 to convert the files. Did you read/search the pages?
Please do point out any one of those 110,000 links that actually lets you download the wall street journal in penn treebank form for free. Having 110,000 usless links and 1 useful link are two completely different things. I'll correct myself. "Please do some 'proper' research" before you state something.
About the documentation, thats a good advice which I appreciate. I have already had a look at it and the manual says you need to have tgrep command installed to change the format of a tgrep (.crp) file to tgrep2 file (.t2c). Unfortunately, I cannot install tgrep in my machine as its outdated and has a lot of bugs in the installing process which I have spent days to try and debug to no avail. I have started a thread concerning that but I gave up debugging it because its too much trouble. This thread is for the alternative approach.
I find, if you can't get something from 11,000 links for free, its not meant to be there for free.
I did manage to get it in .crp format for free so there's a good chance that its out there in a different format too. The only question is where exactly. Appreciate your view though.
As for everyone out there who's going to post a reply please try and give a better answer than just "google it" as no one in their right mind would be wasting their time here if they didn't do that already.
I find, if you can't get something from 11,000 links for free, its not meant to be there for free.
Exactly. The "alternative approach" is to steal it. To do that, you don't come here; you put on your hip boots and wallow around in the muck of warez sites and such. This is not such a site.
Oh. And. Be aware that when I searched for Wall Street Journal Penn Treebank, I found that the third (with duckduckgo) and the fifth (with google) entry was this question: "Where can I get Wall Street Journal Penn Treebank for free?" And yes, they point right to this thread. If you're going to steal something, you need to learn to be more discreet. Lawsuits, or worse, can cost a little.
I did manage to get the .crp file "legally" for free.
That's quite possible, if you got it from an acquaintance. You might not have committed a crime, but it's almost certain that if you got it this way, your acquaintance has violated the terms of the license under which he got it.
Let the record show that ghantauke has edited the title of the thread. The old title:
Quote:
Where can I get Wall Street Journal Penn Treebank for free?
The new title:
Quote:
Where can I get Wall Street Journal Penn Treebank for free LEGALLY?
It's clear he's becoming a little nervous. His justification for editing his original post (edit time: 8:32AM PST) is "too many people misunderstanding". Changing the title won't help the participants in the thread understand better; it will just change the search results so he's less likely to be caught.
No matter. He's made such a vigorous defense of the legality of what he's doing that now he has me curious. I've sent electronic mail to Daniel Bernard, Digital Product Chief, The Wall Street Journal Digital Network, with a link to this thread. If I hear back from him, I'll convey the results.
Let the record show that ghantauke has edited the title of the thread. The old title:
The new title:
It's clear he's becoming a little nervous. His justification for editing his original post (edit time: 8:32AM PST) is "too many people misunderstanding". Changing the title won't help the participants in the thread understand better; it will just change the search results so he's less likely to be caught.
No matter. He's made such a vigorous defense of the legality of what he's doing that now he has me curious. I've sent electronic mail to Daniel Bernard, Digital Product Chief, The Wall Street Journal Digital Network, with a link to this thread. If I hear back from him, I'll convey the results.
Now that I've read this, I'm hesitant to review anything negatively or complain about the FCC or anything like that because it seems that anything I/we say will be reported back the the party in question by corporate narcs.
Now that I've read this, I'm hesitant to review anything negatively or complain about the FCC or anything like that because it seems that anything I/we say will be reported back the the party in question by corporate narcs.
I love Microsoft. All hail FCC! Go Riaa.
Oh, piffle. He wasn't reviewing anything negatively or complaining. He made a request for information about action which would be of dubious legality at best. If he's right, he has nothing to worry about. If he's wrong, then he shouldn't be bull****ting us in the first place. I don't go around playing sheriff, but I hate to be bull****ted. If WSJ even bothers to respond (which they probably won't), we'll find out whether
LinuxQuestions.org is looking for people interested in writing
Editorials, Articles, Reviews, and more. If you'd like to contribute
content, let us know.