Apache Spark Basics

35 / 69
Apache Spark - Is this method of creating rdd correct: val myrdd =...

Is this method of creating rdd correct: val myrdd = sc.parallelize(scala.io.Source.fromFile("./myfile").getLines.toList) ?

  • Yes
  • No, This method first reads data into the memory which would overflow if the file is big