Foundations
Grids and Virtualization
Service-Oriented Architecture
Enterprise Service Bus
Enterprise Message Bus
The Cloud
The Hadoop Ecosystem
HDFS: Hadoop Distributed File System
Resource Negotiators: YARN, Mesos, and Spark; ZooKeeper
Hadoop Map/Reduce
Spark
Hadoop Ecosystem Distributions: Cloudera, Hortonworks, OpenSource
Big Data, NOSQL, and ETL
Big Data vs. RDBMS
NOSQL: Not Only SQL
Relational Databases: Oracle, MariaDB, DB/2, SQL Server, PostGreSQL
Key/Value Databases: JBoss Infinispan, Terracotta, Dynamo, Voldemort
Columnar Databases: Cassandra, HBase, BigTable
Document Databases: MongoDB, CouchDB/CouchBase
Graph Databases: Giraph, Neo4J, GraphX
Apache Hive
Common Data Formats
Leveraging SQL and SQL variants
ETL: Exchange, Transform, Load
Data Ingestion, Transformation, and Loading
Exporting Data
Sqoop, Flume, Informatica, and other tools
Enterprise Integration Patterns and Message Busses
Enterprise Integration Patterns: Apache Camel and Spring Integration
Enterprise Message Busses: Apache Kafka, ActiveMQ, and other tools
Developing in Hadoop Ecosystem
Languages: R, Python, Java, Scala, Pig, and BPMN
Libraries and Frameworks
Development, Testing, and Deployment
Artificial Intelligence and Business Systems
Artificial Intelligence: Myths, Legends, and Reality
The Math
Statistics
Probability
Clustering Algorithms, Mahout, MLLib, SciKit, and Madlib
Business Rule Systems: Drools, JRules, Pegasus
The Team
Agile Data Science
NOSQL Data Architects and Administrators
Developers
Grid Administrators
Business and Data Analysts
Management
Evolving your Team
Growing your Infrastructure
Reviews
There are no reviews yet.