Parallel R e-bog
123,90 DKK
(inkl. moms 154,88 DKK)
Its tough to argue with R as a high-quality, cross-platform, open source statistical software productunless youre in the business of crunching Big Data. This concise book introduces you to several strategies for using R to analyze large datasets, including three chapters on using R and Hadoop together. Youll learn the basics of Snow, Multicore, Parallel, Segue, RHIPE, and Hadoop Streaming, incl...
E-bog
123,90 DKK
Forlag
O'Reilly Media
Udgivet
21 oktober 2011
Længde
126 sider
Genrer
Computer programming / software engineering
Sprog
English
Format
epub
Beskyttelse
LCP
ISBN
9781449320331
Its tough to argue with R as a high-quality, cross-platform, open source statistical software productunless youre in the business of crunching Big Data. This concise book introduces you to several strategies for using R to analyze large datasets, including three chapters on using R and Hadoop together. Youll learn the basics of Snow, Multicore, Parallel, Segue, RHIPE, and Hadoop Streaming, including how to find them, how to use them, when they work well, and when they dont.With these packages, you can overcome Rs single-threaded nature by spreading work across multiple CPUs, or offloading work to multiple machines to address Rs memory barrier.Snow: works well in a traditional cluster environmentMulticore: popular for multiprocessor and multicore computersParallel: part of the upcoming R 2.14.0 releaseR+Hadoop: provides low-level access to a popular form of cluster computingRHIPE: uses Hadoops power with Rs language and interactive shellSegue: lets you use Elastic MapReduce as a backend for lapply-style operations