(By: Boris Lublinsky, Kevin T. Smith, Alexey Yakubovich )
With the increasing use of Hadoop all over the world, it becomes necessary for enterprise architects to understand the relationship between Hadoop API and frameworks. They also need to understand that how both of them can be integrated for resolving to real-world issues.
With continuous upgradations, this task is not easy. However, a group of Hadoop programmers has come together to offer a practical and detailed guide that enables the user to build and implement these solutions. The members of this team include Boris Lublinsky, a principal architect at Nokia who has written more than 70 publications.
The second member is Kevin T. Smith who is working as Director of Technology Solutions at Novetta Solutions where he leads various teams for building data-oriented, highly secure solutions and Alexey Yakubovich, who is a vastly experienced system architect working at Hortonworks. In addition, he is a respected member of Object Management Group SIG on model-driven architecture and SOA governance.
Who Should Read This Book?
This books is one of the best hadoop books. It is a result of community effort, the main focus of this book is offering a detailed guide that builds and implements various solutions including code-level instructions. The book covers storage of database with HBase and HDFS, data process with MapReduce and automation of data processes with Oozie.
This book also covers in-depth information regarding Hadoop security, automation of Hadoop processes in real-time and running on Hadoop with Amazon Web Services. The book is rich with examples in XML and Java. Further, it covers the usage of APIs and their inner workings.
As this book offers comprehensive information about the Hadoop ecosystem, it is easy to assume that this book is primarily targeted towards developers, architects, and designers, who are the main players in building and developing Hadoop software system and require information to utilize the fabulous power of this software.
What Is Good About This Book?
With increasing use of Hadoop, it has become necessary for people who work primarily with massively scalable analytics to understand the power of Hadoop technologies and take its advantage for building and implementing real-time solutions.
As this book is primarily targeted towards developers, architects and designers who are involved in building Hadoop software systems, this book is the primary operations guide that can be used as a reference to tackling any situation. The users will learn how to build advanced enterprise applications and code-level information for delivering real-world solutions.
It provides information regarding data designing and its impact upon implementation. Further, readers will know about the functioning of MapReduce and reformulate various business issues in MapReduce. To make it easy for the users, the authors have included Java code examples that can be used in real-life situations. These Java codes have been derived from various applications that have been built by these authors and deployed in organizations.
Additionally, it covers various approaches that can be used for successful running of Hadoop in AWS or Amazon Web Services. The book showcases how to extend the capabilities of Hadoop’s security including auditing, SSO or Single-Sign-On, authorization, authentication, and encryption. Further, it offers information on building Hadoop-based applications in real-time.
To ensure this, a team of highly experienced Hadoop specialists has collaborated on this book. Although this book is directed towards the experienced programmers, the beginner can also use this book as a reference. It is because the authors have used very easy language and extensive code examples in XML and Java for users.
All in all, this is a must have a book in the reference library of a Hadoop developers, designers, and architects.
What Is Not So Good About This Book?
This book has certain downsides. For instance, although this book offers exhaustive information about a vast range of Hadoop relates topics, it has some editing issues. This is because some of the code samples are broken; the typos are another issue throughout the book.
Some of them can also be found in the source code. Some of the source codes that are given in the book do not work in real-life. However, these are minor issues and can be easily tackled by experienced architects.
A reference book that allows readers to tap the vast power of Hadoop in order to utilize it for various projects, it’s a must-have for architects, designers, and developers. Broadly, other than them, the book is targeted towards anyone who would like to understand the intricacies of the software.