I’ve created a new E-Book providing an Overview on the Hadoop technology. The usual price is 4.99 USD but is available until the end of the week for only 0.99 cent, which is a massive discount for early buyers. The E-Book gives an overview of Hadoop projects and is intended to those that need to get started fast with Hadoop. It focuses on explaining the technology stack rather than explaining details about each technology itself.
From the cover:
Kick Start: Hadoop is an e-book on the Hadoop Technology. The focus of the kick start series is to provide a very fast entry into a new technology. This e-book is useful if you need to build up knowledge on Hadoop within hours and don’t want to spend weeks learning the content. The e-book is useful for consultants, managers, trainers, students and sales staff, that need an overview of all Hadoop technologies but don’t need to understand the technical details. This book is all about get you started fast without the need to spend days or even weeks on trying to understand the technology.
From the Index:
1 Introduction
1.1 Overview on Big Data
1.2 What is Hadoop and why is it important for Big Data?
1.3 The Hadoop Stack
2 Cluster Management with Hadoop
2.1 Apache Ambari
2.2 ZooKeeper
2.3 Oozie
3 Infrastructure and Support
3.1 The Hadoop File System (HDFS)
3.2 Hadoop Commons
3.3 Apache Yarn
4 Storing Data with Hadoop
4.1 HBase
4.2 Accumulo
4.3 Other Databases
5 Accessing Data with Hadoop
5.1 MapReduce for Native Data Access
5.2 SQL Tools in Hadoop with Apache Hive and Apache HCatalog
5.3 Scripting Data with Apache Pig
5.4 Accessing Streaming Data with Apache Storm
5.5 Accessing Real-Time Data with Apache S4
5.6 Graph Data in Hadoop with Apache Giraph and Tez
6 Data Science in Hadoop with Apache Mahout
7 Data Governance and Data Integration In Hadoop
7.1 Apache Falcon
7.2 Apache Flume
7.3 Apache Sqoop
7.4 Apache Avro
8 User Interface in Hadoop with Apache Hue
You can obtain the E-Book on Amazon for Kindle here:
Shortcuts are small Books covering one Topic at a time
Kick Start: Big Data is an E-Book about Big Data. A kick start is an ebook that readers can read within short amount and get started really fast without the need to invest days in reading a book. The target of Kick starts is to learn all the important things about a specific topic in a short and easy to read ebook. The first of this series is on Big Data. Readers will learn what Big Data is, what core technologies are involved and where you can go from there. Some technologies featured in this ebook are: Hadoop, NoSQL Databases, Data Storage techniques, Data analytic techniques and many more.
Availabe in Amazon Stores:
Index:
- 1.1 Defining Big Data……………………………………………………………………. 7
- 1.2 Characteristics for Big Data……………………………………………………. 14
Challenges for Big Data ………………………………………………………………… 23
- 2.1 Storage Performance ……………………………………………………………. 23
- 2.2 Different Storage Systems …………………………………………………….. 25
- 2.3 Data partitioning and concurrency …………………………………………. 26
- 2.4 Moving Data for Analysis ………………………………………………………. 27
Creating Big Data Applications………………………………………………………. 29
3.1 Big Data Analysis iteration …………………………………………………….. 29
Big Data Management …………………………………………………………………. 32
4.1 Hardware Foundations …………………………………………………………. 32
- 4.1.1 Storage devices …………………………………………………………….. 32
- 4.1.2 Raid Systems ………………………………………………………………… 33
- 4.1.3 Requirements for private and public Cloud Solutions ………… 34
4.2 Data Storage and Software attributes …………………………………….. 39
- 4.2.1 Data Quality Attributes ………………………………………………….. 40
- 4.2.2 CAP Theorem ……………………………………………………………….. 42
- 4.2.3 Relational Database Management Systems ……………………… 45
- 4.2.4 NoSQL………………………………………………………………………….. 48
- 4.2.5 Hybrid RDBMS/NoSQL Systems ………………………………………. 52
Big Data Platforms ………………………………………………………………………. 55
5.1 Apache Hadoop……………………………………………………………………. 55
5.1.1 Hadoop Projects……………………………………………………………. 55
Big Data Analytics………………………………………………………………………… 58
- 6.1 Machine Learning…………………………………………………………………. 58
- 6.2 Data Mining…………………………………………………………………………. 58
- 6.3 Apache Mahout……………………………………………………………………. 60
Big Data Utilization………………………………………………………………………. 61
Appendix ……………………………………………………………………………………. 63
- 8.1 Table of Figures ……………………………………………………………………. 63
- 8.2 Table of Listings……………………………………………………………………. 64
References …………………………………………………………………………………. 65
I am happy to announce that my new E-Book is in stores now! The book is 85 pages in lenght and the target for the book is to provide an overview of Amazon Web Services for .NET Developers. The E-Book by “developer.press” is called “Shortcut” and aims at delivering this topic in one or two evenings to read right after work.
The book starts with a description of the service categories offered by Amazon with a brief description of available services. Due to the fact that Services are released often and the book authoring timeframe is about 3-6 month, services such as Amazon Glacier are not yet included. The other 6 chapters focus entirely on building an Application with AWS. Amazon Elastic Beanstalk is used with Asp.NET MVC. Next, a focus on S3, SQS, DynamoDB and Amazon EC2 is given.
Subscribe to this channel to get updates about the book.
You can download the Source Code here.
The book is currently available only in German, an English Version is planned.
The E-Book is available in the iTunes Store and in the Kindle Store.