mesos-reviews mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Joseph Wu <>
Subject Re: Review Request 69980: Modified when master responds to operation status updates.
Date Wed, 20 Feb 2019 00:47:34 GMT

This is an automatically generated e-mail. To reply, visit:

(Updated Feb. 19, 2019, 4:47 p.m.)

Review request for mesos, Benno Evers, Gastón Kleiman, and Greg Mann.


Modified comment per suggestion.

Bugs: MESOS-9542

Repository: mesos


When dealing with orphaned operation status updates, there are two
cases the master must deal with:
- The simple case is when the master knows the framework is completed.
  These status updates can be acknowledged by the master.
- However, a completed framework can be rotated out of the master's
  memory.  In addition, after master failover, if an agent reregisters
  before the framework, an operation can appear to be orphaned until
  the framework reregisters.

This adds a fixed delay between agent reregistration and when the
master acknowledges operation status updates from unknown frameworks.
The delay should give frameworks ample time to reregister.

The delay is based on agent reregistration in order to mitigate the
delay of acknowledging status updates of frameworks rotated out of
the completed frameworks buffer.

Diffs (updated)

  src/master/constants.hpp b0ab9187b8c672180e2ffb8b63cb7349dbe43ac4 
  src/master/master.cpp 106d924bf16231b3bda3fb719db68c01d73644ee 




TODO: This case needs unit tests.


Joseph Wu

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message