mesos-reviews mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Joseph Wu <jos...@mesosphere.io>
Subject Review Request 67972: RFC: Added RetentionPolicy for task metadata and sandboxes.
Date Thu, 19 Jul 2018 12:59:09 GMT

-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/67972/
-----------------------------------------------------------

Review request for mesos, Gilbert Song, Qian Zhang, and Vinod Kone.


Bugs: MESOS-6285 and MESOS-7947
    https://issues.apache.org/jira/browse/MESOS-6285
    https://issues.apache.org/jira/browse/MESOS-7947


Repository: mesos


Description
-------

This adds a protobuf which tells the agent to garbage collect
more directories than it currently does.  The agent currently garbage
collects directories at the executor level, which is not ideal for
certain types of long-lived executors that launch many tasks or
nested containers over its lifetime.

Each task launched under the same executor will result in a checkpointed
TaskInfo in the agent's metadata.  This can result in slow agent
recovery, as described in MESOS-6285, where an excessive number of tasks
will actually cause the agent to be OOM-killed.

For the default executor, each task will be launched as a nested
container, which will include a sandbox directory (under the executor's
sandbox). If too many nested containers are launched without removing
the associated sandboxes, the agent may run out of disk space.


Diffs
-----

  include/mesos/agent/agent.proto 74488e873cbf99ca487403b70691912cf3788288 
  include/mesos/mesos.proto 5a985fca39cdfb7e9b4775650a7e5dbe68c3b8ae 


Diff: https://reviews.apache.org/r/67972/diff/1/


Testing
-------


Thanks,

Joseph Wu


Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message