DSM
-
Difference between RDD and DSMDistributedSystem/Spark 2019. 9. 25. 04:30
1. Overview The RDD (resilient distributed DataSet) elastic distributed data set is the core data structure of spark. DSM (distributed shared memory) is a common memory data abstraction. In DSM, applications can read and write to any location in the global address space. The main difference between RDD and DSM is that not only can the RDD be created by bulk conversion (i.e. "write"), but it can ..