Follow @BikerJaredThe phrase “big data” is used so often it’s almost trite. These days, nearly all large enterprises have established a data science or data integration practice that is used for analysis projects. In my experience, however, many smaller companies (or often smaller teams within large enterprises) have yet to adopt any sort of big data […]
Author: Jared Stevens
Using Secondary Sort to Enhance Adobe Data Feed Processing in Hadoop
In my last post, I described the basics for processing Adobe Analytics Click Stream Data Feeds using Hadoop. While the solutions outlined there will scale remarkably well, there is a more memory efficient way to do it. Having this flexibility is nice if you have lots of CPU cores available but not as much ram. […]
Introduction to Processing Click Stream Data Feeds with Hadoop and Map/Reduce
In an earlier post, Matt Moss showed how to process data feed data using an SQL database. This can be useful in a pinch when you have a smaller amount of data and need an answer quickly. What happens though when you now need to process the data at a large scale? For example, you […]
Parsing Products and Events in ClickStream Data Feeds
A lot of companies that I’ve worked with are initially confused when processing Adobe Analytics Data Feeds. The data comes out of Adobe Analytics in TSV format and you’d naturally expect that the data is ‘flat’ (meaning just rows and columns). Unfortunately, this isn’t the case. Columns like ‘post_product_list’ and ‘event_list’ are lists of data that […]
Dealing with Special Characters When Parsing Adobe Analytics Data Feeds
Adobe Clickstream Data Feeds are the most granular way to view your analytics data. They effectively contain all the information that Adobe Analytics needs to build its reports. Having a good understanding of how to use these feeds will allow you to use Analytics data in ways that aren’t possible through LiveStream, the Web Services […]
First Post
This is the first post on this blog. I’m hoping that a year from now, we’ll look back on this post and remember what our perspectives were like at the beginning. The intent for this blog is to provide easy to understand information that helps marketers (and possibly others) get the most out of their […]