Kick Start: Big Data is an E-Book about Big Data. A kick start is an ebook that readers can read within short amount and get started really fast without the need to invest days in reading a book. The target of Kick starts is to learn all the important things about a specific topic in a short and easy to read ebook. The first of this series is on Big Data. Readers will learn what Big Data is, what core technologies are involved and where you can go from there. Some technologies featured in this ebook are: Hadoop, NoSQL Databases, Data Storage techniques, Data analytic techniques and many more.
Availabe in Amazon Stores:
Index:
- 1.1 Defining Big Data……………………………………………………………………. 7
- 1.2 Characteristics for Big Data……………………………………………………. 14
Challenges for Big Data ………………………………………………………………… 23
- 2.1 Storage Performance ……………………………………………………………. 23
- 2.2 Different Storage Systems …………………………………………………….. 25
- 2.3 Data partitioning and concurrency …………………………………………. 26
- 2.4 Moving Data for Analysis ………………………………………………………. 27
Creating Big Data Applications………………………………………………………. 29
3.1 Big Data Analysis iteration …………………………………………………….. 29
Big Data Management …………………………………………………………………. 32
4.1 Hardware Foundations …………………………………………………………. 32
- 4.1.1 Storage devices …………………………………………………………….. 32
- 4.1.2 Raid Systems ………………………………………………………………… 33
- 4.1.3 Requirements for private and public Cloud Solutions ………… 34
4.2 Data Storage and Software attributes …………………………………….. 39
- 4.2.1 Data Quality Attributes ………………………………………………….. 40
- 4.2.2 CAP Theorem ……………………………………………………………….. 42
- 4.2.3 Relational Database Management Systems ……………………… 45
- 4.2.4 NoSQL………………………………………………………………………….. 48
- 4.2.5 Hybrid RDBMS/NoSQL Systems ………………………………………. 52
Big Data Platforms ………………………………………………………………………. 55
5.1 Apache Hadoop……………………………………………………………………. 55
5.1.1 Hadoop Projects……………………………………………………………. 55
Big Data Analytics………………………………………………………………………… 58
- 6.1 Machine Learning…………………………………………………………………. 58
- 6.2 Data Mining…………………………………………………………………………. 58
- 6.3 Apache Mahout……………………………………………………………………. 60
Big Data Utilization………………………………………………………………………. 61
Appendix ……………………………………………………………………………………. 63
- 8.1 Table of Figures ……………………………………………………………………. 63
- 8.2 Table of Listings……………………………………………………………………. 64
References …………………………………………………………………………………. 65