Index of /BDGraphs

[ICO]NameLast modifiedSizeDescription

[PARENTDIR]Parent Directory  -  
[IMG]AdultBval0pt2.jpg2013-09-17 20:12 17K 
[IMG]AdultRatioMF.jpg2013-09-17 20:12 36K 
[   ]BARUG.pdf2013-11-16 10:03 945K 
[   ]BDGraphs_0.5.0.tar.gz2014-01-24 21:10 41K 
[IMG]BankMarried.jpg2013-09-17 13:55 21K 
[IMG]Baseball.jpg2013-09-17 13:55 30K 
[IMG]BaseballGGParcoord.jpg2013-09-17 16:30 33K 
[IMG]BaseballGGParcoordAlph0pt2.jpg2013-09-17 16:30 45K 
[   ]BigNGraphs_0.5.0.tar.gz2014-01-24 21:10 41K 
[   ]DataMining.pdf2013-11-16 10:04 1.0M 
[TXT]Examples.html2013-09-21 10:04 8.9K 
[   ]IADIAHSFO.eps2013-09-21 09:22 1.3M 
[IMG]IADIAHSFO.jpg2013-09-17 21:07 18K 
[   ]JSM.pdf2013-08-05 06:52 1.2M 
[IMG]Letters.jpg2013-09-17 13:55 37K 
[IMG]LettersExtr2.jpg2013-09-17 14:11 37K 
[IMG]LettersRand50.jpg2013-09-17 14:11 53K 
[   ]NoteToNM2013-11-04 21:05 63  
[   ]Proceeds.pdf2013-09-27 23:12 706K 
[   ]README2013-04-11 13:24 54  
[TXT]README.html2013-09-27 23:13 2.7K 
[DIR]RFiles/2014-03-16 20:27 -  
[   ]freqparcoord_1.0.1.tar.gz2014-03-28 16:44 809K 
[   ]parcoordf_1.0.0.tar.gz2014-03-16 15:22 808K 
[   ]pc.tar2014-03-16 14:31 2.3M 
[   ]u2013-11-19 00:02 183  

Norm Matloff's Big Data Visualization Tools

(Here I define Big Data rather generally. Any data set that is large enough to fill major portions of the screen when the points are plotted counts as Big from my point of view.)

I've developed a new graphical package for R, BDGraphs, which you can download here, or in individual files form, here. (In order to avoid confusion with an unrelated package BDGraph on CRAN, I will soon change the name of my package to BigNGraphs.) It consists of some novel tools for visualization of large data sets. They are computationally intensive, but use parallel processing to greatly reduce the workload.

Here are the main tools (best understood by clicking on the Examples link):

All tools here use nonparametric curve estimation methods, which may be computationally intensive, so that the package offers parallel computation, on either multicore machines or clusters.

My JSM talk is here, and the full paper is here.

Note: No warranties made of any kind regarding the software or methodology.