This one is deprecated since the demise of Gadgets on sites.
This was also part of each evenings scheduled run. The site was serialized and each analytics entry was matched up to a page on the site. Some of the analytics data has variable quality of site links.
Here’s how the pages are matched. Each Url is matched to as deep a level as possible – when the matching stops, it falls back to the last successful match. That way every page can be allocated somewhere on the site within the topic to which it belongs, even though an exact page match is not possible.
On completion I had a nice tree structure with one of these for each site page.
