commit	7c0363ff8d810a7108c2287aed51cedcb638d1b5	[log] [tgz]
author	Claudio Pacchiega <claudio.pacchiega@gmail.com>	Sun Aug 27 22:33:56 2017 +0200
committer	Claudio Pacchiega <claudio.pacchiega@gmail.com>	Sat Sep 30 16:18:05 2017 +0100
tree	2624fce9eb14f8e6893fb32015b9010707d2bdc4
parent	099ab16e888090f5b53abea2ac15a57f57cfcd65 [diff]

tree: 2624fce9eb14f8e6893fb32015b9010707d2bdc4

README.md

spark-gerrit-analytics-etl

Spark ETL to extra analytics data from Gerrit Projects.

Job can be launched with the following parameters:

bin/spark-submit \
    --conf spark.es.nodes=es.mycompany.com \
    --conf spark.es.net.http.auth.user=elastic \
    --conf spark.es.net.http.auth.pass=changeme \
    $JARS/SparkAnalytics-assembly-1.0.jar \
    --since 2000-06-01 \
    --aggregate email_hour \
    --url http://gerrit.mycompany.com \
    -e gerrit/analytics

Parameters

since, until, aggregate are the same defined in Gerrit Analytics plugin see: https://gerrit.googlesource.com/plugins/analytics/+/master/README.md
-u --url Gerrit server URL with the analytics plugins installed
-e --elasticIndex specify as / to be loaded in Elastic Search if not provided no ES export will be performed
-o --out folder location for storing the output as JSON files if not provided data is saved to /analytics- where is the system temporary directory