mesos-reviews mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Mesos Reviewbot Windows <revi...@mesos.apache.org>
Subject Re: Review Request 66001: MESOS-6575: Add soft limit and kill to disk/xfs.
Date Thu, 15 Mar 2018 14:36:14 GMT

-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/66001/#review199261
-----------------------------------------------------------



FAIL: Some of the unit tests failed. Please check the relevant logs.

Reviews applied: `['66001']`

Failed command: `Start-MesosCITesting`

All the build artifacts available at: http://dcos-win.westus.cloudapp.azure.com/mesos-build/review/66001

Relevant logs:

- [libprocess-tests-stdout.log](http://dcos-win.westus.cloudapp.azure.com/mesos-build/review/66001/logs/libprocess-tests-stdout.log):

```
[ RUN      ] TimeTest.Output
[       OK ] TimeTest.Output (0 ms)
[----------] 4 tests from TimeTest (4 ms total)

[----------] 3 tests from TimeSeriesTest
[ RUN      ] TimeSeriesTest.Set
[       OK ] TimeSeriesTest.Set (1 ms)
[ RUN      ] TimeSeriesTest.Sparsify
[       OK ] TimeSeriesTest.Sparsify (1 ms)
[ RUN      ] TimeSeriesTest.Truncate
[       OK ] TimeSeriesTest.Truncate (2 ms)
[----------] 3 tests from TimeSeriesTest (6 ms total)

[----------] 3 tests from JWTTest
[ RUN      ] JWTTest.Parse
[       OK ] JWTTest.Parse (9 ms)
[ RUN      ] JWTTest.Create
[       OK ] JWTTest.Create (1 ms)
[ RUN      ] JWTTest.Stringify
[       OK ] JWTTest.Stringify (1 ms)
[----------] 3 tests from JWTTest (12 ms total)

[----------] 1 test from SSL
[ RUN      ] SSL.Disabled
[       OK ] SSL.Disabled (10 ms)
[----------] 1 test from SSL (12 ms total)

[----------] 17 tests from SSLTest
[ RUN      ] SSLTest.SSLSocket
```

- [libprocess-tests-stderr.log](http://dcos-win.westus.cloudapp.azure.com/mesos-build/review/66001/logs/libprocess-tests-stderr.log):

```
ABORT: (D:\DCOS\mesos\mesos\3rdparty\libprocess\include\process/ssl/gtest.hpp:171): Could
not generate certificate: Failed to set common name: X509_NAME_add_entry_by_txt
```

- [mesos-tests-stdout.log](http://dcos-win.westus.cloudapp.azure.com/mesos-build/review/66001/logs/mesos-tests-stdout.log):

```
[       OK ] ContentType/SchedulerTest.Revive/1 (250 ms)
[ RUN      ] ContentType/SchedulerTest.Suppress/0
[       OK ] ContentType/SchedulerTest.Suppress/0 (261 ms)
[ RUN      ] ContentType/SchedulerTest.Suppress/1
[       OK ] ContentType/SchedulerTest.Suppress/1 (249 ms)
[ RUN      ] ContentType/SchedulerTest.NoOffersWithAllRolesSuppressed/0
[       OK ] ContentType/SchedulerTest.NoOffersWithAllRolesSuppressed/0 (238 ms)
[ RUN      ] ContentType/SchedulerTest.NoOffersWithAllRolesSuppressed/1
[       OK ] ContentType/SchedulerTest.NoOffersWithAllRolesSuppressed/1 (258 ms)
[ RUN      ] ContentType/SchedulerTest.NoOffersOnReregistrationWithAllRolesSuppressed/0
[       OK ] ContentType/SchedulerTest.NoOffersOnReregistrationWithAllRolesSuppressed/0 (311
ms)
[ RUN      ] ContentType/SchedulerTest.NoOffersOnReregistrationWithAllRolesSuppressed/1
[       OK ] ContentType/SchedulerTest.NoOffersOnReregistrationWithAllRolesSuppressed/1 (289
ms)
[ RUN      ] ContentType/SchedulerTest.Message/0
[       OK ] ContentType/SchedulerTest.Message/0 (287 ms)
[ RUN      ] ContentType/SchedulerTest.Message/1
[       OK ] ContentType/SchedulerTest.Message/1 (326 ms)
[ RUN      ] ContentType/SchedulerTest.Request/0
[       OK ] ContentType/SchedulerTest.Request/0 (99 ms)
[ RUN      ] ContentType/SchedulerTest.Request/1
[       OK ] ContentType/SchedulerTest.Request/1 (110 ms)
[ RUN      ] ContentType/SchedulerTest.SchedulerReconnect/0
[       OK ] ContentType/SchedulerTest.SchedulerReconnect/0 (92 ms)
[ RUN      ] ContentType/SchedulerTest.SchedulerReconnect/1
[       OK ] ContentType/SchedulerTest.SchedulerReconnect/1 (76 ms)
[----------] 32 tests from ContentType/SchedulerTest (15497 ms total)

[----------] 4 tests from ContentTypeAndSSLConfig/SchedulerSSLTest
[ RUN      ] ContentTypeAndSSLConfig/SchedulerSSLTest.RunTaskAndTeardown/0
```

- [mesos-tests-stderr.log](http://dcos-win.westus.cloudapp.azure.com/mesos-build/review/66001/logs/mesos-tests-stderr.log):

```
I0315 14:36:01.583261  7064 master.cpp:1678] Recovering from registrar
I0315 14:36:01.584307  7544 registrar.cpp:391] Successfully fetched the registry (0B) in 1.046016ms
I0315 14:36:01.584307  7544 registrar.cpp:495] Applied 1 operations in 0ns; attempting to
update the registry
I0315 14:36:01.585307  9948 registrar.cpp:552] Successfully updated the registry in 999936ns
I0315 14:36:01.585307  9948 registrar.cpp:424] Successfully recovered registrar
I0315 14:36:01.586302  5980 master.cpp:1792] Recovered 0 agents from the registry (239B);
allowing 10mins for agents to reregister
I0315 14:36:01.594303  1316 scheduler.cpp:188] Version: 1.6.0
I0315 14:36:01.594303  7396 scheduler.cpp:311] Using default 'basic' HTTP authenticatee
I0315 14:36:01.595307  5480 scheduler.cpp:494] New master detected at master@10.3.1.9:56443
I0315 14:36:01.603309  7064 scheduler.cpp:468] Re-detecting master
I0315 14:36:01.604285  7064 scheduler.cpp:494] New master detected at master@10.3.1.9:56443
I0315 14:36:01.611311  6384 scheduler.cpp:472] Lost leading master
I0315 14:36:01.615309  1316 master.cpp:1136] Master terminating
I0315 14:36:01.637310  1316 cluster.cpp:172] Creating default 'local' authorizer
I0315 14:36:01.643285  6460 master.cpp:463] Master 0836bedd-10c5-4d09-ad32-8679ddb6502e (win-bld-srv-02.zq4gs31qjdiunm1ryi1452nvnh.dx.internal.cloudapp.net)
started on 10.3.1.9:56443
I0315 14:36:01.643285  6460 master.cpp:465] Flags at startup: --acls="" --agent_ping_timeout="15secs"
--agent_reregister_timeout="10mins" --allocation_interval="1secs" --allocator="HierarchicalDRF"
--authenticate_agents="true" --authenticate_frameworks="true" --authenticate_http_frameworks="true"
--authenticate_http_readonly="true" --authenticate_http_readwrite="true" --authenticators="crammd5"
--authorizers="local" --credentials="C:\Users\mesos\AppData\Local\Temp\lmBqsF\credentials"
--filter_gpu_resources="true" --framework_sorter="drf" --help="false" --hostname_lookup="true"
--http_authenticators="basic" --http_framework_authenticators="basic" --initialize_driver_logging="true"
--log_auto_initialize="true" --logbufsecs="0" --logging_level="INFO" --max_agent_ping_timeouts="5"
--max_completed_frameworks="50" --max_completed_tasks_per_framework="1000" --max_unreachable_tasks_per_framework="1000"
--port="5050" --quiet="false" --recovery_agent_removal_limit="100%" --registry="in_memory"
  --registry_fetch_timeout="1mins" --registry_gc_interval="15mins" --registry_max_agent_age="2weeks"
--registry_max_agent_count="102400" --registry_store_timeout="100secs" --registry_strict="false"
--require_agent_domain="false" --root_submissions="true" --user_sorter="drf" --version="false"
--webui_dir="/webui" --work_dir="C:\Users\mesos\AppData\Local\Temp\lmBqsF\master" --zk_session_timeout="10secs"
I0315 14:36:01.645282  6460 master.cpp:514] Master only allowing authenticated frameworks
to register
I0315 14:36:01.646283  6460 master.cpp:520] Master only allowing authenticated agents to register
I0315 14:36:01.646283  6460 master.cpp:526] Master only allowing authenticated HTTP frameworks
to register
I0315 14:36:01.647286  6460 credentials.hpp:37] Loading credentials for authentication from
'C:\Users\mesos\AppData\Local\Temp\lmBqsF\credentials'
I0315 14:36:01.648310  6460 master.cpp:570] Using default 'crammd5' authenticator
I0315 14:36:01.649309  6460 http.cpp:957] Creating default 'basic' HTTP authenticator for
realm 'mesos-master-readonly'
I0315 14:36:01.649309  6460 http.cpp:957] Creating default 'basic' HTTP authenticator for
realm 'mesos-master-readwrite'
I0315 14:36:01.650328  6460 http.cpp:957] Creating default 'basic' HTTP authenticator for
realm 'mesos-master-scheduler'
I0315 14:36:01.650328  6460 master.cpp:649] Authorization enabled
I0315 14:36:01.659286  7544 master.cpp:2119] Elected as the leading master!
I0315 14:36:01.659286  7544 master.cpp:1678] Recovering from registrar
I0315 14:36:01.660295  7064 registrar.cpp:391] Successfully fetched the registry (0B) in 1.00992ms
I0315 14:36:01.660295  7064 registrar.cpp:495] AppliedABORT: (D:\DCOS\mesos\mesos\3rdparty\libprocess\include\process/ssl/gtest.hpp:171):
Could not generate certificate: Failed to set common name: X509_NAME_add_entry_by_txt
```

- Mesos Reviewbot Windows


On March 15, 2018, 1:48 p.m., Harold Dost wrote:
> 
> -----------------------------------------------------------
> This is an automatically generated e-mail. To reply, visit:
> https://reviews.apache.org/r/66001/
> -----------------------------------------------------------
> 
> (Updated March 15, 2018, 1:48 p.m.)
> 
> 
> Review request for mesos and James Peach.
> 
> 
> Bugs: MESOS-6575
>     https://issues.apache.org/jira/browse/MESOS-6575
> 
> 
> Repository: mesos
> 
> 
> Description
> -------
> 
> New Flags for disk/xfs isolator
> - This patch adds a number of flags to handle switching the limit in the
>   `disk/xfs` isolator to allow apps to go over their limit and for mesos
> to kill them if they have gone over their limit.
> 
> New Flags:
> - xfs_disk_hard_limit_offset_pct - Use the `disk` as the soft limit and
>   set the hard limit to be some percentage above the soft limit.
> Allowing
>   for containers to surpass a desired allocation and making them
> killable.
> - xfs_disk_hard_limit_offset - Use the `disk` as the soft limit and set
>   the hard limit to some number of bytes specified above the
>   applications to be a soft limit instead of a hard limit.
> - xfs_kill_after_grace_period - This will kill tasks if they breach the
>   grace period configured using `xfs_quota -x -c "timer -p <time>"`
> - xfs_kill_check_interval - The frequency with which a container will be
>   checked for soft limit violations.
> 
> Functionality
> - Add head room to the hard limit as a percentage or specified amount
>   for each container. This is specified at a flag level and not
>   customizable on a per container basis.
> - Provide the ability for an application to be killed after the
>   configured grace period for projects is violated.
> - Add an interval between which the watcher will check for violations.
> 
> 
> Diffs
> -----
> 
>   src/slave/containerizer/mesos/isolators/xfs/disk.hpp 07e68a777aefba4dd35066f2eb207bba7f199d83

>   src/slave/containerizer/mesos/isolators/xfs/disk.cpp 8d9f8f846866f9de377c59cb7fb311041283ba70

>   src/slave/containerizer/mesos/isolators/xfs/utils.hpp e034133629a9c1cf58b776f8da2a93421332cee0

>   src/slave/containerizer/mesos/isolators/xfs/utils.cpp 2708524add1ff693b616d4fb241c4a0a3070520b

>   src/slave/flags.hpp 0c67bf214ceb93ae7ff088bec2648fa26ddac59e 
>   src/slave/flags.cpp 962b07c1d701f4ab819b14730fbc116b981433bb 
>   src/tests/containerizer/xfs_quota_tests.cpp 64c3e1c3f0bc435897626cb0a13bc19c7cb1a4fe

> 
> 
> Diff: https://reviews.apache.org/r/66001/diff/5/
> 
> 
> Testing
> -------
> 
> 
> Thanks,
> 
> Harold Dost
> 
>


Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message