Use RoaringBitmap instead of BitSet in TagSet

Leverage the RoaringBitmap libary [1] to optimize usage and storage
of TagSets. This makes a significant improvement (see below)
particularly for repos with a combination of many tags and many
non-tag/non-change refs.

Also increment the git_tags cache version since the storage format is
changed.

This change intends to only do a straightforward replacement of BitSet
and does not attempt any further optimizations RoaringBitmap may
provide.

Performance comparison:

These numbers were gathered after running `git ls-remote` against a
very large repo with 1.3M refs (including ~375K tags and ~26.5K other
non-skippable refs).
|                               | BitSets  | RoaringBitmap (this change) |
| h2 PageStore heap memory used | 2.5 GB   | 155 MB                      |
| disk size of git_tags cache   | 1.5 GB   | 58 MB                       |
| runtime                       | 5.75 min | 5.5 min                     |

Further, with Gerrit's heapLimit (java -Xmx) setting set to 26g, the
BitSet implementation was not able to successfully execute the
ls-remote, whereas the RoaringBitmap implementation continued to perform
with a consistent runtime down to at least 16g and even still worked at
10g (lowest tested) but runtime then increased to 6.5 min.

[1] https://github.com/RoaringBitmap/RoaringBitmap

Change-Id: Ief36b7bb2312899a64efb9a10565b3d095ed5584
Release-Notes: Reduced memory and storage needs for TagSet by using RoaringBitmaps
10 files changed
tree: b278cfa9fb4b1cd26573fe864ed8e51d4067a712
  1. .settings/
  2. .ts-out/
  3. antlr3/
  4. contrib/
  5. Documentation/
  6. e2e-tests/
  7. java/
  8. javatests/
  9. lib/
  10. modules/
  11. plugins/
  12. polygerrit-ui/
  13. prolog/
  14. prologtests/
  15. proto/
  16. resources/
  17. tools/
  18. webapp/
  19. .bazelignore
  20. .bazelproject
  21. .bazelrc
  22. .bazelversion
  23. .editorconfig
  24. .git-blame-ignore-revs
  25. .gitignore
  26. .gitmodules
  27. .gitreview
  28. .mailmap
  29. .pydevproject
  30. .zuul.yaml
  31. BUILD
  32. COPYING
  33. INSTALL
  34. Jenkinsfile
  35. package.json
  36. README.md
  37. SUBMITTING_PATCHES
  38. version.bzl
  39. web-dev-server.config.mjs
  40. WORKSPACE
  41. yarn.lock
README.md

Gerrit Code Review

Gerrit is a code review and project management tool for Git based projects.

Build Status Maven Central

Objective

Gerrit makes reviews easier by showing changes in a side-by-side display, and allowing inline comments to be added by any reviewer.

Gerrit simplifies Git based project maintainership by permitting any authorized user to submit changes to the master Git repository, rather than requiring all approved changes to be merged in by hand by the project maintainer.

Documentation

For information about how to install and use Gerrit, refer to the documentation.

Source

Our canonical Git repository is located on googlesource.com. There is a mirror of the repository on Github.

Reporting bugs

Please report bugs on the issue tracker.

Contribute

Gerrit is the work of hundreds of contributors. We appreciate your help!

Please read the contribution guidelines.

Note that we do not accept Pull Requests via the Github mirror.

Getting in contact

The Developer Mailing list is repo-discuss on Google Groups.

License

Gerrit is provided under the Apache License 2.0.

Build

Install Bazel and run the following:

    git clone --recurse-submodules https://gerrit.googlesource.com/gerrit
    cd gerrit && bazel build release

Install binary packages (Deb/Rpm)

The instruction how to configure GerritForge/BinTray repositories is here

On Debian/Ubuntu run:

    apt-get update && apt-get install gerrit=<version>-<release>

NOTE: release is a counter that starts with 1 and indicates the number of packages that have been released with the same version of the software.

On CentOS/RedHat run:

    yum clean all && yum install gerrit-<version>[-<release>]

On Fedora run:

    dnf clean all && dnf install gerrit-<version>[-<release>]

Use pre-built Gerrit images on Docker

Docker images of Gerrit are available on DockerHub

To run a CentOS 8 based Gerrit image:

    docker run -p 8080:8080 gerritcodereview/gerrit[:version]-centos8

To run a Ubuntu 20.04 based Gerrit image:

    docker run -p 8080:8080 gerritcodereview/gerrit[:version]-ubuntu20

NOTE: release is optional. Last released package of the version is installed if the release number is omitted.