This example goes beyond the simple word count program and introduces the following concepts:
- Mapping data fields to case classes
- Incorporating JSON based configuration
- Using the 'RichDate' library to perform range scans in HDFS dated directories and to perform common date related tasks
This must be run against:
Scala version 2.9.2 (extra configuration is required for newer versions)
sbt/sbt eclipse
sbt/sbt assembly
sbt/sbt \
"run example.analytics.Main \
--impressions data/input/impression.log \
--clicks data/input/click.log \
--output data/output \
--date 2012-10-01 2012-10-02 \
--local"