commit | 7c0363ff8d810a7108c2287aed51cedcb638d1b5 | [log] [tgz] |
---|---|---|
author | Claudio Pacchiega <claudio.pacchiega@gmail.com> | Sun Aug 27 22:33:56 2017 +0200 |
committer | Claudio Pacchiega <claudio.pacchiega@gmail.com> | Sat Sep 30 16:18:05 2017 +0100 |
tree | 2624fce9eb14f8e6893fb32015b9010707d2bdc4 | |
parent | 099ab16e888090f5b53abea2ac15a57f57cfcd65 [diff] |
Extract organization field to ElasticSearch. Obtained by the @domain part of committer email. Change-Id: Ib5b3b9996ec78780007cd6f95a47e2dccd570364
Spark ETL to extra analytics data from Gerrit Projects.
Job can be launched with the following parameters:
bin/spark-submit \ --conf spark.es.nodes=es.mycompany.com \ --conf spark.es.net.http.auth.user=elastic \ --conf spark.es.net.http.auth.pass=changeme \ $JARS/SparkAnalytics-assembly-1.0.jar \ --since 2000-06-01 \ --aggregate email_hour \ --url http://gerrit.mycompany.com \ -e gerrit/analytics