- Author: Michael Manoochehri
- Language: English
- Published: December 29, 2013
- Page: 245
- Size: 3 MB
- Format: pdf
Coverage includes
- Mastering the four guiding principles of Big Data success—and avoiding common pitfalls
- Emphasizing collaboration and avoiding problems with siloed data
- Hosting and sharing multi-terabyte datasets efficiently and economically
- “Building for infinity” to support rapid growth
- Developing a NoSQL Web app with Redis to collect crowd-sourced data
- Running distributed queries over massive datasets with Hadoop, Hive, and Shark
- Building a data dashboard with Google BigQuery
- Exploring large datasets with advanced visualization
- Implementing efficient pipelines for transforming immense amounts of data
- Automating complex processing with Apache Pig and the Cascading Java library
- Applying machine learning to classify, recommend, and predict incoming information
- Using R to perform statistical analysis on massive datasets
- Building highly efficient analytics workflows with Python and Pandas
- Establishing sensible purchasing strategies: when to build, buy, or outsource
- Previewing emerging trends and convergences in scalable data technologies and the evolving role of the Data Scientist
No comments:
Post a Comment