Big Data Glossary e-bog
108,68 DKK
(inkl. moms 135,85 DKK)
To help you navigate the large number of new data tools available, this guide describes 60 of the most recent innovations, from NoSQL databases and MapReduce approaches to machine learning and visualization tools. Descriptions are based on first-hand experience with these tools in a production environment.This handy glossary also includes a chapter of key terms that help define many of these to...
E-bog
108,68 DKK
Forlag
O'Reilly Media
Udgivet
13 september 2011
Længde
62 sider
Genrer
UNA
Sprog
English
Format
epub
Beskyttelse
LCP
ISBN
9781449317133
To help you navigate the large number of new data tools available, this guide describes 60 of the most recent innovations, from NoSQL databases and MapReduce approaches to machine learning and visualization tools. Descriptions are based on first-hand experience with these tools in a production environment.This handy glossary also includes a chapter of key terms that help define many of these tool categories:NoSQL DatabasesDocument-oriented databases using a key/value interface rather than SQLMapReduceTools that support distributed computing on large datasetsStorageTechnologies for storing data in a distributed wayServersWays to rent computing power on remote machinesProcessingTools for extracting valuable information from large datasetsNatural Language ProcessingMethods for extracting information from human-created textMachine LearningTools that automatically perform data analyses, based on results of a one-off analysisVisualizationApplications that present meaningful data graphicallyAcquisitionTechniques for cleaning up messy public data sourcesSerializationMethods to convert data structure or object state into a storable format