Uploaded image for project: 'Infinispan'
  1. Infinispan
  2. ISPN-4841

TopologyAwareConsistentHashFactory is slow for large cluster

XMLWordPrintable

    • Icon: Enhancement Enhancement
    • Resolution: Unresolved
    • Icon: Major Major
    • None
    • 7.0.0.CR1
    • Core
    • None

      A user observed 100% CPU usage for a long time on coordinator node when booting 500 nodes with 500 caches defined.

      It looks like the TopologyAwareConsistentHashFactory performs O(n^2), it has double loop for all Machines. It takes 50 sec to compute rebalance with 1 cache 500 nodes. This calculation is performed on every cache, so it eats 25000 sec CPU times with 500 nodes 500 caches.

      The hprof shows 90% of the time is consumed in the TopologyInfo.computeMaxSegmentsForMachine().

            Unassigned Unassigned
            rhn-support-tkimura Takayoshi Kimura
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

              Created:
              Updated: