Uploaded image for project: 'JBeret'
  1. JBeret
  2. JBERET-154

Support restarting killed or crashed job execution

    XMLWordPrintable

Details

    • Enhancement
    • Resolution: Done
    • Major
    • 1.1.0.Final
    • 1.1.0.Beta1
    • jberet-core
    • None

    Description

      When a job execution is killed or server crashed, the batch runtime does not have enough time to update its batch status to FAILED in job repository, so its status is left as STARTING or STARTED or STOPPING. When user tries to restart the failed job execution, the batch runtime will not restart it, because only FAILED or STOPPED job execution can be restarted.

      jberet-core should be able to detect such failed job executions, update its status in job repository, and properly restart them.

      Attachments

        Issue Links

          Activity

            People

              cfang@redhat.com Cheng Fang
              cfang@redhat.com Cheng Fang
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: