PDLStats statistics modules in Perl Data Language 

at CPAN Download Changes PODs Get support 
This is a collection of statistics modules in Perl Data Language, with a quickstart guide for nonPDL people. They make perldlthe simple shell for PDLwork like a teenie weenie R, but with PDL threading"the fast (and automagic) vectorised iteration of 'elementary operations' over arbitrary slices of multidimensional data"of procedures including ttest, ordinary least squares regression, and kmeans clustering. Of course, they also work in perl scripts, which makes the package an ideal tool for statistical natural language processingyou get all the text processing power of perl as well as the fast number crunching capabilities of a data language. Documentations (PODs)Loads modules named below. Pod includes quickstart for nonPDL people. Basic statistics and related utilities (standard deviation, variance, correlation, ttest, etc.). Parameter estimations and probability density functions for distributions. General linear modeling methods (multiple linear regression, factorial, repeated measures, and mixed model anova, etc.) and logistic regression. Classic kmeans cluster analysis. Basic time series analysis functions. PDL interface to GSL Cumulative Distribution Functions. DependenciesPerl Data Language. Preferably installed with a Fortran compiler. A few methods (logistic regression and all plotting methods) will only work with a Fortran compiler and some methods (ordinary least squares regression and pca) work much faster with a Fortran compiler. The recommended PDL version is 2.4.8. PDL2.4.7 introduced a bug in lu_decomp() which caused a few functions in PDL::Stats::GLM to fail. Otherwise the minimum compatible PDL version is 2.4.4. GSL (Optional) GNU Scientific Library. This is required by PDL::Stats::Distr and PDL::GSL::CDF, the latter of which provides pvalues for PDL::Stats::GLM. GSL is NOT required for core PDL::Stats modules to work, ie. Basic, GLM, and Kmeans. PGPLOT (Optional) PDLStats currently uses PGPLOT for plotting. There are three pgplot/PGPLOT modules. This has led to much confusion upon installation. First there is the pgplot Fortran library. Then there is the perl PGPLOT module, the perl interface to pgplot. Finally there is PDL::Graphics::PGPLOT, which depends on pgplot and PGPLOT, that PDLStats uses for plotting. InstallationIf you are using debian Wheezy or Ubuntu 12.10 and above, you can easily install the package by sudo aptget install libpdlstatsperl Or, you can use cpan sudo cpan PDL::Stats You can also follow the standard perl module installation method in *nix environment and build it from the source, tar xvf PDLStatsxxx.tar.gz cd PDLStatsxxx perl Makefile.PL make make test sudo make install If you have got PDL (mostly) installed, this should be trivial. If you have trouble installing PDL, you can look for help at the PDL wiki or the PDL mailing list. Thanks to Sisyphus, Windows users can download and install the ppm version of PDLStats and all dependencies using the PPM utility included in ActiveState perl or Strawberry perl. You can also get the PPM utility from CPAN. ppm install http://www.sisyphusion.tk/ppm/PGPLOT.ppd ppm install http://www.sisyphusion.tk/ppm/PDL.ppd ppm install http://www.sisyphusion.tk/ppm/PDLStats.ppd
