Spark Project - Log Parsing

2 / 5

Spark - Project - Apache log parsing - Top 10 requested URLs

Problem 1

Write spark code to find out top 10 requested URLs along with a count of the number of times they have been requested (This information will help the company to find out most popular pages and how frequently they are accessed)

Sample output

    URL Count

    shuttle/missions/sts-71/mission-sts-71.html 549
    shuttle/resources/orbiters/enterprise.html 145