LinuxQuestions.org
Latest LQ Deal: Latest LQ Deals
Home Forums Tutorials Articles Register
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Software
User Name
Password
Linux - Software This forum is for Software issues.
Having a problem installing a new program? Want to know which application is best for the job? Post your question in this forum.

Notices


Reply
  Search this Thread
Old 06-19-2007, 06:03 PM   #1
eoinrua
Member
 
Registered: May 2007
Location: Lurgan, Northern Ireland
Distribution: Ubuntu, Xubuntu and Puppy
Posts: 121

Rep: Reputation: 15
Any ideas for statistics software?


Hi,

I've recently converted to Linux and am checking out the options for statistical analysis.

(I know that most of you will have switched off by now!)

I'm well used to the package SPSS on Windows but is there any open source program that comes reasonably close in Linux? I'm not looking for perfection, but something that does crosstabs, Chi-square, Spearman's, ANOVA and simple regression would be great.

The only stipulation is that it would need to be GUI-based, rather than command-line run.

I've googled the subject, but I can only find programs that are a bit archaic and seem to be command-driven.

Does anyone know of any program (I know there's simple statistics in Open Office, but that's too basic) which would let me enter and analyse data in a straightforward way?

I've worked hard on getting ubuntu just the way I like it and a decent statistics package that means I'm not forever syntax-checking would complete my transition from Windows.

I look forward to hearing from the few of you who are still reading this!
 
Old 06-19-2007, 07:33 PM   #2
weibullguy
ReliaFree Maintainer
 
Registered: Aug 2004
Location: Kalamazoo, Michigan
Distribution: Slackware 14.2
Posts: 2,815
Blog Entries: 1

Rep: Reputation: 261Reputation: 261Reputation: 261
Try R --> www.r-project.org. It is really a CLI application, but you can install the Rcmdr package for a GUI. This is a Tcl/Tk interface, but provides a "spreadsheet" type interface for entering datasets and menu-driven analysis. R will also import SPSS (among other) datasets. The Rcmdr package will also "log" everything you do which can be saved as a script for reuse. Yes, it is more than capable of the analyses you are interested in performing.

You can also use Gedit or gVim to write your scripts; both have R syntax highlighting. You can 'load' the script into an R session (it's as simple as source /path/to/your/script/script from the R prompt), do your editing in Gedit or gVim, and 'run' the script in R. If that doesn't make sense, let me know...after I read it, I'm not sure I understand what I was saying.

R is developed on a Debian machine and most of the developers are the guys who developed S.

Last edited by weibullguy; 06-20-2007 at 07:59 AM.
 
Old 06-20-2007, 03:41 PM   #3
eoinrua
Member
 
Registered: May 2007
Location: Lurgan, Northern Ireland
Distribution: Ubuntu, Xubuntu and Puppy
Posts: 121

Original Poster
Rep: Reputation: 15
Thanks for that. I'll check it out - and I will get back to you if I'm struggling to understand it!

Cheers.
 
Old 06-20-2007, 05:50 PM   #4
ehedman
Member
 
Registered: Jan 2003
Location: Wisconsin, USA
Distribution: Rehat 7.3
Posts: 33

Rep: Reputation: 15
I have not tried it, but I have heard of people using the windows version of Minitab under wine.
 
Old 06-22-2007, 06:40 PM   #5
eoinrua
Member
 
Registered: May 2007
Location: Lurgan, Northern Ireland
Distribution: Ubuntu, Xubuntu and Puppy
Posts: 121

Original Poster
Rep: Reputation: 15
Sadly, I've only got a license for SPSS. I checked out wine compatibility and it's pretty dodgy. It starts the install procedure but hangs. I'd be happy to use Minitab (or SAS) but statistics software isn't cheap.

weibullguy - I've been looking at the r.project and it seems promising, especially as it has a ubuntu directory that's bang up to date. I'd like to find out more, but (forgive my newbieness) what is the Rmcdr package?

Is it specific to R, or is it something more general for building a GUI? I looked through the Feisty directory on R's CRAN but I couldn't find it anywhere.

Your advice would be appreciated - again!
 
Old 06-22-2007, 09:26 PM   #6
weibullguy
ReliaFree Maintainer
 
Registered: Aug 2004
Location: Kalamazoo, Michigan
Distribution: Slackware 14.2
Posts: 2,815
Blog Entries: 1

Rep: Reputation: 261Reputation: 261Reputation: 261
Yes, Rcmdr is specific to R. After you install the base R package, start it from the command line. Then install Rcmdr (or any other package)
Code:
install.packages("Rcmdr", dependencies=TRUE)
That will install Rcmdr and all the other packages it needs. Exit R
Code:
q()
Restart R and load Rcmdr
Code:
library(Rcmdr)
Here is a quick (not high quality) screenshot of Rcmdr in action ---> http://webpages.charter.net/weibullguy/gfx/R.png
The available R packages that can be installed using the install.package command can be found at http://lib.stat.cmu.edu/R/CRAN/. Just follow the link called packages and scroll down the page to Rcmdr. Also save a copy of "The R Reference Index" found under the Manuals link. I have a short reference card somewhere. If I find it, I'll upload it and post a link.

Eventually you'll be using the second method I mentioned anyway (gedit or gVim to write the scripts).

Last edited by weibullguy; 06-22-2007 at 09:44 PM.
 
Old 06-24-2007, 08:49 AM   #7
Kintail
LQ Newbie
 
Registered: Nov 2006
Location: Scotland
Distribution: SUSE 10.1
Posts: 16

Rep: Reputation: 0
Statistical software

I did a search to satisfy my own curiosity as I am also a fan of ANOVA. It was surprising to see that even Mac is better catered for than us. Anyway, I have come up with two possibilities which both include ANOVA and regression, and are GUI I think.

First one is http://www.statpages.org/miller/openstat/LinosOS.html

The second is http://www.gnu.org/software/dap/dap.html

By the way, if you want to be intimidated have a look at 'miller's' Background (oh dear, oh dear).

There is a third possibility in a book I bought recently

'Developing Statistical Software in Fortran 95'
by David R Lemmon & Joseph L Schafer (ISBN 0-387-23817-4)
I have been in Fortran since Fortran 1.5, and was on the Standards Committee which upgraded Fortran 77, but I find F95 heavy going. Consequently, I am going to stick to For77 or possibly For90.
The above book gives a website on page 12 where you could see source code from the book, and there is a lot of it.
http://methodology.psu.edu/fortranbook
 
Old 06-24-2007, 06:02 PM   #8
eoinrua
Member
 
Registered: May 2007
Location: Lurgan, Northern Ireland
Distribution: Ubuntu, Xubuntu and Puppy
Posts: 121

Original Poster
Rep: Reputation: 15
Well, I've got R up and running from the terminal - but it's scaring the **** out of me.

I suppose I've been well spoilt in that I've only ever used SPSS in GUI mode. And it's a doddle. My professor at uni has told me about the days when he had to do everything through syntax only. I think he reckons today's SPSS users have it too easy.

Kintail, I'll definitely check out those links. (As for Miller's background, by the way, I look forward to reading it. Just for the record, when I was starting to study statistics, the worst book I ever found had the most inappropriate title - it was called Statistics Without Tears. I was weeping buckets by the end of the first chapter, for the author made the simplest procedures seem complicated. He almost scared me off even crosstabs.)

Weibullguy, As for R, I think it's got potential, but I'd really need to go GUI at the start before I can even begin to get my head around the syntax. I'm having problems there at the minute.

I found Rcmdr, as per your instructions, on the r-project page.

I did what you suggested and that got me into R, but the "install.packages..." line is only turning up syntax errors. I'm following the code exactly and running it from R.

I cut and pasted the code directly into R, so is it possible that I'm doing it wrong (remember, I'm still pretty much a newbie) and need to do some sort of edit on the line?

If not, is there any other way to get Rcmdr installed outside the R shell?

Really appreciate the help on this one guys. I didn't think a statistics thread would produce as much assistance as this one has.
 
Old 06-24-2007, 06:13 PM   #9
eoinrua
Member
 
Registered: May 2007
Location: Lurgan, Northern Ireland
Distribution: Ubuntu, Xubuntu and Puppy
Posts: 121

Original Poster
Rep: Reputation: 15
Whao, I've just read Millar's background. Phew.
 
Old 06-24-2007, 08:37 PM   #10
maroonbaboon
Senior Member
 
Registered: Aug 2003
Location: Sydney
Distribution: debian
Posts: 1,495

Rep: Reputation: 48
In the scientific area R is one of the killer apps of open source software. Compatibility with S-Plus there are lots of supporting examples and documentation. But it's certainly aimed at a math/stat audience. Even for people with programming experience I suspect it is not very intuitive.

I had a look on Freshmeat for alternatives. Searching on 'statistics' or 'statistical' is a bit too general but 'spss' gets a few hits.

Salstat looks interesting, although the last release was 2003, with disclaimers about lack of testing. Might be worth a look if you are stuck and it does what you want. It's written in python.

The Salstat author has a blog. It seems he is still working on the program, and writing a book to go with it.

Last edited by maroonbaboon; 06-24-2007 at 09:06 PM.
 
Old 06-24-2007, 09:18 PM   #11
weibullguy
ReliaFree Maintainer
 
Registered: Aug 2004
Location: Kalamazoo, Michigan
Distribution: Slackware 14.2
Posts: 2,815
Blog Entries: 1

Rep: Reputation: 261Reputation: 261Reputation: 261
Quote:
Originally Posted by eoinrua
Well, I've got R up and running from the terminal - but it's scaring the **** out of me.

I did what you suggested and that got me into R, but the "install.packages..." line is only turning up syntax errors. I'm following the code exactly and running it from R.
Hehe. It takes a little while to get used to. You are installing software and, thus, you will need to run R as root when you install R packages. Otherwise, post your errors and maybe we can figure out what is wrong.

If you are using R for a university source, I'd guess you won't need any scripts that that in-depth. I might have some examples of regression, AOV, and DOE laying around. I do mostly survival analysis myself. I also found the short reference card I was looking for earlier.
 
Old 06-24-2007, 10:43 PM   #12
maroonbaboon
Senior Member
 
Registered: Aug 2003
Location: Sydney
Distribution: debian
Posts: 1,495

Rep: Reputation: 48
The R short reference card is great.

But it's the first time I've seen a double-sided A3 sheet called a 'card'
 
Old 06-25-2007, 05:47 PM   #13
eoinrua
Member
 
Registered: May 2007
Location: Lurgan, Northern Ireland
Distribution: Ubuntu, Xubuntu and Puppy
Posts: 121

Original Poster
Rep: Reputation: 15
I'm afraid I'm working in a Windows-only environment at the moment but when I get back to my Linux box at home I'll run R as root before trying to install.

If there are any problems, I'll post the output. Thanks for digging out the reference card. My university, unfortunately, is a totally Linux-free zone. There aren't even any Linux manuals on sale in the uni bookshop. Not one.

I'll check out the salstat pages too.

Cheers for now.
 
Old 06-25-2007, 08:22 PM   #14
weibullguy
ReliaFree Maintainer
 
Registered: Aug 2004
Location: Kalamazoo, Michigan
Distribution: Slackware 14.2
Posts: 2,815
Blog Entries: 1

Rep: Reputation: 261Reputation: 261Reputation: 261
You can get R for Windows too. Then you can keep working on a consistent platform where ever you are. I'm surprised a University stats program isn't using primarily SAS though. Way tooooooooo pricey for the average student, but probably the most widely used in industry (in the US anyway).
 
Old 06-27-2007, 05:08 PM   #15
eoinrua
Member
 
Registered: May 2007
Location: Lurgan, Northern Ireland
Distribution: Ubuntu, Xubuntu and Puppy
Posts: 121

Original Poster
Rep: Reputation: 15
Sorry for the delay in getting back to the thread, but I’ve been trying out the various suggestions so many of you have helpfully made.

So it’s time for an update. The best I’ve managed with R is to get it running from the terminal, but the Windows .exe actually works on Vista. A bit creaky, but it works, albeit with no facility for GUI data-entry.

Salstat also works in Windows, but it’s more than creaky. I tried various versions of the tarballs on Linux (some are incomplete, with a lot of dependencies required), while there is one bigger download that is self-contained. I got it unpacked but the executable won’t run, either from the folder or from the terminal. I think there is a .deb package (handy, as I use ubuntu), but the link’s not active.

My biggest success to date has been with OpenStat (the one that Mr Miller is responsible for) and, to be honest, it’s a crackingly good piece of software for GUI-users. It’s not too far away from earlier versions of SPSS in terms of what it can do. The Miller link above (see Kintail's post) is down but you can google to find the software elswhere.

The only thing is that I have to run it in Windows (it runs surprisingly well in Vista – the only unavailability is the Help menu, but it’s fairly intuitive for someone used to SPSS). Mr Miller is to be congratulated – the help file thing is just a Vista problem.

There is a Linux version of OpenStat but the download links are not working. So if anyone can find me an active link to the package for Linux I’d be very grateful. It’s a great program and I’d love to run it in Linux. When I find the time I’ll see if I can get it running under Wine. But if you find that active link, please let me know!!!

The drawback with most of the above is that you can’t import SPSS data directly and unfortunately, because my own version of SPSS just won’t work on Vista, I can’t access my files to re-save them in an appropriate format for import. I'm graduating next week and don't think I'll have the chance to convert all my datasets in uni.

Just a final word or two on R for weibullguy. The GUI in R looks promising, especially if I could get entering data. This, again, might be a Vista problem. But I’d like to have another go at getting it up and running in Linux, so here’s the output when I try to access Rcmdr as per your instructions.

After the code, running as root:

install.packages("Rcmdr", dependencies=TRUE)

R returns:

Error: syntax error, unexpected AND, expecting ',' in "install.packages(&"


As a footnote, it turns out that SAS was there all the time, but I was only ever trained in SPSS and just looked for it automatically.

By the way, I hope the thread has been useful for those who work with statistics but don't have the big bucks for the big name software.

If I can either get the R GUI running in Linux, or get a working Linux version of OpenStat, it'll all have been worthwhile. The Windows option is a bit of a compromise.

Thanks again folks.

Last edited by eoinrua; 06-27-2007 at 05:14 PM.
 
  


Reply



Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
Software ideas mhelliwell Linux - General 2 02-27-2005 08:38 PM
Software for resource statistics lupus9537 Linux - Software 5 07-10-2004 05:19 AM
free web site statistics software on Linux? htic22 Linux - Software 7 04-29-2004 09:08 PM
Does anyone know of add-on software for sendmail to gather statistics? ForumKid Linux - Software 3 07-14-2003 01:42 PM
Statistics and Analsys software aqoliveira Linux - General 2 11-29-2002 01:35 PM

LinuxQuestions.org > Forums > Linux Forums > Linux - Software

All times are GMT -5. The time now is 02:26 PM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration