Apache SparkUnified analytics engine for large-scale data processing with built-in modules for streaming, SQL, machine learning and graph processing