mesos-reviews mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Greg Mann <g...@mesosphere.io>
Subject Re: Review Request 70325: Updated the master to allocate recovered orphan operation resources.
Date Wed, 27 Mar 2019 19:57:46 GMT

-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/70325/
-----------------------------------------------------------

(Updated March 27, 2019, 7:57 p.m.)


Review request for mesos, Benjamin Mahler, Gastón Kleiman, Joseph Wu, and Meng Zhu.


Bugs: MESOS-9635
    https://issues.apache.org/jira/browse/MESOS-9635


Repository: mesos


Description (updated)
-------

This patch updates the master's framework recovery code to use
the allocator's `addAgentResources()` method rather than
`updateSlave()` when recovering orphan operations, which has the
benefit of tracking the allocation of the operations' consumed
resources, avoiding situations in which those resources would be
incorrectly offered to frameworks while the operation is still
in a pending state.


Diffs
-----

  src/master/master.cpp acc67d3763ddee9027e6cf375f1d495ff5805026 


Diff: https://reviews.apache.org/r/70325/diff/1/


Testing (updated)
-------

`make check`

To verify the flaky test fix, the following command was executed both before and after the
patches were applied, while `stress -c <num_cores_on_machine>` was being run:
`bin/mesos-tests.sh --gtest_filter="*AgentPendingOperationAfterMasterFailover*" --gtest_repeat=-1
--gtest_break_on_failure`


Thanks,

Greg Mann


Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message