Apache Spark
Unified analytics engine for large-scale data processing with built-in modules for streaming, SQL, machine learning and graph processing
Distributed Join Algorithms
Sort-merge, hash, and broadcast joins in distributed systems
MapReduce Fundamentals
Understanding the map-reduce programming model for big data
