Login using Social Account
Login using your credentials
Slides - Adv Spark Programming (1)
Slides - Adv Spark Programming (2)
Want to create exercises like this yourself? Click here.
No hints are availble for this assesment
Answer is not availble for this assesment
1 Adv Spark Programming - Understanding Persistence (Python)
2 Spark Programming - RDD caching is used for avoiding re-computation of an RDD:...
3 Spark Programming - With persistence we can replicate the RDD so that spark...
4 Spark Programming - RDD cachng is basically persisting an RDD into the RAM?...
5 Adv Spark Programming - Persistence StorageLevel (Python)
6 Spark Programming - The persist() method accepts an argument which is an object...
7 Spark Programming - Say, you are creating an RDD rdd1 after a lot...
8 Spark Programming - Which of the following is not true for MEMORY_AND_DISK_2 storage...
9 Spark Programming - StorageLevel does not specify the following configuration?...
10 Adv Spark Programming - Data Partitioning
11 Adv Spark Programming - Partitioning HandsOn (Python)
12 Adv Spark Programming - Data Partitioning Example
13 Spark Programming - Apache Spark's Data partitioning in RDD is useful if...
14 Spark Programming - Which of the following is not true about Data partitioning?...
15 Spark Programming - Operations that can not benefit from Partitioning?...
16 Adv Spark Programming - Custom Partitioner (Python)
17 Spark Programming - You can create own partitioner?...
18 Adv Spark Programming - Shared Variables
19 Spark Programming - When we pass a function say f to map or...
20 Spark Programming - If we have some data, which of the following is...
21 Spark Programming - If we have some data of few kilobyte size, which...
22 Adv Spark Programming - Accumulators (Python)
23 Spark Programming - Accumulators are used for...
24 Spark Programming - Accumulators will give incorrect results in the cases where:...
25 Spark Programming - Which of the use-case is not right for accumulator:...
26 How to create custom Accumulators in Apache Spark (Python)
27 Adv Spark Programming - Broadcast Variables
28 Spark Programming - Broadcast variables are used because...
29 Spark Programming - To share few bytes data with workers we use:...
30 Adv Spark Programming - Broadcast Variables Example (Python)
31 Adv Spark Programming - Key Performance Considerations - Parallelism
32 Spark Programming - If you have an RDD with 1000 records. The function...
33 Adv Spark Programming - Key Performance Considerations - Partitions
34 Spark Programming - If we have a text file of 8GB in HDFS...
35 Spark Programming - If we have 1GB data as an array and we...
36 Spark Programming - Which of the following is not right way of controlling...
37 Adv Spark Programming - Serialization Format
38 Spark Programming - Serialization happens when...
39 Spark Programming - Performance of serializer is important because in Spark the serialization...
40 Spark Programming - Which serializer is slowest?...
41 Adv Spark Programming - Memory Management
42 Spark Programming - How much memory is consumed for loading an RDD?...
43 Spark Programming - Spark reserves 20% of memory to shuffle and agregation buffers....
44 Spark Programming - Spark reserves last 20% of memory to code. What does...
45 Adv Spark Programming - Hardware Provisioning
46 Spark Programming - Which of the following hardware parameters do not matter for...
47 Spark Programming - Spark is able to achieve linear scaling. What does it...
48 Spark Programming - Why the memory beyond 64GB is discouraged?...
49 Adv Spark Programming - Slides
Loading comments...