Field Guide to Hadoop e-bog
223,05 DKK
(inkl. moms 278,81 DKK)
If your organization is about to enter the world of big data, you not only need to decide whether Apache Hadoop is the right platform to use, but also which of its many components are best suited to your task. This field guide makes the exercise manageable by breaking down the Hadoop ecosystem into short, digestible sections. Youll quickly understand how Hadoops projects, subprojects, and relat...
E-bog
223,05 DKK
Forlag
O'Reilly Media
Udgivet
2 marts 2015
Længde
132 sider
Genrer
UNC
Sprog
English
Format
epub
Beskyttelse
LCP
ISBN
9781491947883
If your organization is about to enter the world of big data, you not only need to decide whether Apache Hadoop is the right platform to use, but also which of its many components are best suited to your task. This field guide makes the exercise manageable by breaking down the Hadoop ecosystem into short, digestible sections. Youll quickly understand how Hadoops projects, subprojects, and related technologies work together.Each chapter introduces a different topicsuch as core technologies or data transferand explains why certain components may or may not be useful for particular needs. When it comes to data, Hadoop is a whole new ballgame, but with this handy reference, youll have a good grasp of the playing field.Topics include:Core technologiesHadoop Distributed File System (HDFS), MapReduce, YARN, and SparkDatabase and data managementCassandra, HBase, MongoDB, and HiveSerializationAvro, JSON, and ParquetManagement and monitoringPuppet, Chef, Zookeeper, and OozieAnalytic helpersPig, Mahout, and MLLibData transferScoop, Flume, distcp, and StormSecurity, access control, auditingSentry, Kerberos, and KnoxCloud computing and virtualizationSerengeti, Docker, and Whirr