Spark Script
-
Apache SparkDistributedSystem/Spark 2019. 9. 20. 00:55
1. Overview An open-source distributed general-purpose cluster computing framework with mostly in-memory data processing engine that can do ETL, analytics, machine learning, and graph processing on large volumes of data at rest(batch processing) or in motion(streaming processing) with rich concise high-level APIs for the programming languages: Scala, Python, Java, R, and SQL 2. Description 2.1 A..