Uploaded image for project: 'JGroups'
  1. JGroups
  2. JGRP-2361

Error related to Jgroup and Database connection is getting reset

    XMLWordPrintable

Details

    • Bug
    • Resolution: Cannot Reproduce
    • Major
    • None
    • 3.6.11
    • None
    • Hide

      clustermode=true
      cluster.broadcast.methods=jgroups
      cluster.id=0
      cluster.maxid=4
      cluster.broadcast.method.jgroups.tcp.bind_addr=xxx.yy.qq.vv
      cluster.broadcast.method.jgroups.channel.name=hybris-broadcast
      cluster.broadcast.method.jgroups.configuration=jgroups-tcp.xml
      cluster.broadcast.method.jgroups=de.hybris.platform.cluster.jgroups.JGroupsBroadcastMethod
      cluster.broadcast.method.jgroups.tcp.bind_port=65000

      Show
      clustermode=true cluster.broadcast.methods=jgroups cluster.id=0 cluster.maxid=4 cluster.broadcast.method.jgroups.tcp.bind_addr=xxx.yy.qq.vv cluster.broadcast.method.jgroups.channel.name=hybris-broadcast cluster.broadcast.method.jgroups.configuration=jgroups-tcp.xml cluster.broadcast.method.jgroups=de.hybris.platform.cluster.jgroups.JGroupsBroadcastMethod cluster.broadcast.method.jgroups.tcp.bind_port=65000

    Description

      Hi ,

      we are facing an issue with our cluster configuration and due to this JVM responding time also takes more time, after clearing the cache / restarting all nodes application works as expected.
      When issue arises one of the core occupies 100% cpu utilization then it confirms to restart the server otherwise it never process any request. Below is our configuration in local.properties. Also providing error logs as attachment. could see error in logs related to Jgroups blocking and connection getting terminated between nodes.
      Let us know your valuable inputs, on what exactly the issue i.e causing the slowness then blocking the whole server.

      Attached cluster configuration for each nodes and error logs
      Adding to this we are getting below error while doing deployment/restarting of servers

      WARN [localhost-startStop-1] [GMS] hybrisnode-0: JOIN(hybrisnode-0) sent to hybrisnode-2 timed out (after 3000 ms), on try 3
      WARN [pool-3-thread-1] [GMS] hybrisnode-3: JOIN(hybrisnode-3) sent to hybrisnode-1 timed out (after 3000 ms), on try 4

      Attachments

        1. error Jgroups.txt
          3 kB
        2. Jgroup error in preprod-000.txt
          14 kB
        3. Jgroup node configuration.txt
          1.0 kB
        4. Jgroups blocking and terminating connection.txt
          0.7 kB
        5. Jgroups error in console.txt
          30 kB
        6. jgroups-tcp.xml
          3 kB

        Activity

          People

            rhn-engineering-bban Bela Ban
            karthiklevis karthikeyan Aruljothi (Inactive)
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: