Uploaded image for project: 'Infinispan'
  1. Infinispan
  2. ISPN-5575

Shared write-behind store can read stale entries on joiner

This issue belongs to an archived project. You can view it, but you can't modify it. Learn more

    • Icon: Bug Bug
    • Resolution: Obsolete
    • Icon: Major Major
    • None
    • 8.0.0.Alpha2, 7.2.3.Final
    • Core, Loaders and Stores
    • None

      The AsyncCacheWriter modification queue is not sent with state transfer when the store is shared. A joiner can then read from the shared store a stale version of entries that have updates in the modification queue but are no longer in memory (because they were either removed explicitly, or evicted).

            [ISPN-5575] Shared write-behind store can read stale entries on joiner

            One possible fix would be to delay the end of state transfer until the modifications that have not reached the joiner (i.e. with an old topology id) are actually written to the shared store.

            AsyncCacheWriter could have a new method CompletableFuture<Object> getSyncFuture(), and when the returned future completes, the current state of AsyncCacheWriter is guaranteed to be written to the store OR overwritten by a newer update. The state provider would call this method when starting an outbound transfer, and wait on the future before sending the last state chunk.

            To avoid concurrency issues, we may also have to hold StateTransferLock's exclusive topology lock while calling getSyncFuture(), and its shared topology lock while calling AsyncCacheWriter.store().

            Dan Berindei (Inactive) added a comment - One possible fix would be to delay the end of state transfer until the modifications that have not reached the joiner (i.e. with an old topology id) are actually written to the shared store. AsyncCacheWriter could have a new method CompletableFuture<Object> getSyncFuture() , and when the returned future completes, the current state of AsyncCacheWriter is guaranteed to be written to the store OR overwritten by a newer update. The state provider would call this method when starting an outbound transfer, and wait on the future before sending the last state chunk. To avoid concurrency issues, we may also have to hold StateTransferLock's exclusive topology lock while calling getSyncFuture(), and its shared topology lock while calling AsyncCacheWriter.store().

            It doesn't matter if the shared store is configured with shared="true" or with singleton="true". Both options will allow a joiner to read keys that it doesn't have in memory directly from the store, after state transfer is over, and state transfer can finish before AsyncCacheWriter writes all the modifications to the store.

            Dan Berindei (Inactive) added a comment - It doesn't matter if the shared store is configured with shared="true" or with singleton="true" . Both options will allow a joiner to read keys that it doesn't have in memory directly from the store, after state transfer is over, and state transfer can finish before AsyncCacheWriter writes all the modifications to the store.

              Unassigned Unassigned
              dberinde@redhat.com Dan Berindei (Inactive)
              Archiver:
              rhn-support-adongare Amol Dongare

                Created:
                Updated:
                Resolved:
                Archived: