An Application that calculates page rank of URLs from an input stream.
A Spark program that calculates page rank.
PageRankWorkflow which connect a Spark program followed by a MapReduce
MapReduce job which counts the total number of pages for every unique page rank
A reducer that sums up the counts for each key.
A mapper that emits each url's page rank with a value of 1.
Spark PageRank program
Copyright © 2018 Cask Data, Inc.. All rights reserved.