Spark Project - Log Parsing

2 / 5

Spark - Project - Apache log parsing - Top 10 requested URLs

Problem 1 -

Write spark code to find out top 10 requested URLs along with count of number of times they have been requested (This information will help company to find out most popular pages and how frequently they are accessed)

Sample output -

    URL Count

    shuttle/missions/sts-71/mission-sts-71.html 549
    shuttle/resources/orbiters/enterprise.html 145