Apache Spark Basics

25 / 69
Apache Spark - Which of the following is not true about RDD?...

Which of the following is not true about RDD?

  • RDD is divided into partitions
  • RDD contains records which are divided amongst partitions
  • Each partition of RDD could be on different machines
  • Each partition of RDD can be processed by different CPU in parallel
  • If a parition of RDD goes down, it is recreated automatically by Spark
  • RDD is loaded entirely into memory