mesos-reviews mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Mesos Reviewbot Windows <revi...@mesos.apache.org>
Subject Re: Review Request 66001: MESOS-6575: Add soft limit and kill to disk/xfs.
Date Thu, 15 Mar 2018 17:18:05 GMT

-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/66001/#review199270
-----------------------------------------------------------



FAIL: Some of the unit tests failed. Please check the relevant logs.

Reviews applied: `['66001']`

Failed command: `Start-MesosCITesting`

All the build artifacts available at: http://dcos-win.westus.cloudapp.azure.com/mesos-build/review/66001

Relevant logs:

- [libprocess-tests-stdout.log](http://dcos-win.westus.cloudapp.azure.com/mesos-build/review/66001/logs/libprocess-tests-stdout.log):

```
[ RUN      ] TimeTest.Output
[       OK ] TimeTest.Output (1 ms)
[----------] 4 tests from TimeTest (5 ms total)

[----------] 3 tests from TimeSeriesTest
[ RUN      ] TimeSeriesTest.Set
[       OK ] TimeSeriesTest.Set (0 ms)
[ RUN      ] TimeSeriesTest.Sparsify
[       OK ] TimeSeriesTest.Sparsify (1 ms)
[ RUN      ] TimeSeriesTest.Truncate
[       OK ] TimeSeriesTest.Truncate (1 ms)
[----------] 3 tests from TimeSeriesTest (7 ms total)

[----------] 3 tests from JWTTest
[ RUN      ] JWTTest.Parse
[       OK ] JWTTest.Parse (7 ms)
[ RUN      ] JWTTest.Create
[       OK ] JWTTest.Create (1 ms)
[ RUN      ] JWTTest.Stringify
[       OK ] JWTTest.Stringify (1 ms)
[----------] 3 tests from JWTTest (12 ms total)

[----------] 1 test from SSL
[ RUN      ] SSL.Disabled
[       OK ] SSL.Disabled (11 ms)
[----------] 1 test from SSL (13 ms total)

[----------] 17 tests from SSLTest
[ RUN      ] SSLTest.SSLSocket
```

- [libprocess-tests-stderr.log](http://dcos-win.westus.cloudapp.azure.com/mesos-build/review/66001/logs/libprocess-tests-stderr.log):

```
ABORT: (D:\DCOS\mesos\mesos\3rdparty\libprocess\include\process/ssl/gtest.hpp:171): Could
not generate certificate: Failed to set common name: X509_NAME_add_entry_by_txt
```

- [mesos-tests-stdout.log](http://dcos-win.westus.cloudapp.azure.com/mesos-build/review/66001/logs/mesos-tests-stdout.log):

```
[       OK ] ContentType/SchedulerTest.Revive/1 (261 ms)
[ RUN      ] ContentType/SchedulerTest.Suppress/0
[       OK ] ContentType/SchedulerTest.Suppress/0 (270 ms)
[ RUN      ] ContentType/SchedulerTest.Suppress/1
[       OK ] ContentType/SchedulerTest.Suppress/1 (291 ms)
[ RUN      ] ContentType/SchedulerTest.NoOffersWithAllRolesSuppressed/0
[       OK ] ContentType/SchedulerTest.NoOffersWithAllRolesSuppressed/0 (285 ms)
[ RUN      ] ContentType/SchedulerTest.NoOffersWithAllRolesSuppressed/1
[       OK ] ContentType/SchedulerTest.NoOffersWithAllRolesSuppressed/1 (294 ms)
[ RUN      ] ContentType/SchedulerTest.NoOffersOnReregistrationWithAllRolesSuppressed/0
[       OK ] ContentType/SchedulerTest.NoOffersOnReregistrationWithAllRolesSuppressed/0 (343
ms)
[ RUN      ] ContentType/SchedulerTest.NoOffersOnReregistrationWithAllRolesSuppressed/1
[       OK ] ContentType/SchedulerTest.NoOffersOnReregistrationWithAllRolesSuppressed/1 (342
ms)
[ RUN      ] ContentType/SchedulerTest.Message/0
[       OK ] ContentType/SchedulerTest.Message/0 (336 ms)
[ RUN      ] ContentType/SchedulerTest.Message/1
[       OK ] ContentType/SchedulerTest.Message/1 (349 ms)
[ RUN      ] ContentType/SchedulerTest.Request/0
[       OK ] ContentType/SchedulerTest.Request/0 (108 ms)
[ RUN      ] ContentType/SchedulerTest.Request/1
[       OK ] ContentType/SchedulerTest.Request/1 (112 ms)
[ RUN      ] ContentType/SchedulerTest.SchedulerReconnect/0
[       OK ] ContentType/SchedulerTest.SchedulerReconnect/0 (84 ms)
[ RUN      ] ContentType/SchedulerTest.SchedulerReconnect/1
[       OK ] ContentType/SchedulerTest.SchedulerReconnect/1 (87 ms)
[----------] 32 tests from ContentType/SchedulerTest (17470 ms total)

[----------] 4 tests from ContentTypeAndSSLConfig/SchedulerSSLTest
[ RUN      ] ContentTypeAndSSLConfig/SchedulerSSLTest.RunTaskAndTeardown/0
```

- [mesos-tests-stderr.log](http://dcos-win.westus.cloudapp.azure.com/mesos-build/review/66001/logs/mesos-tests-stderr.log):

```
I0315 17:17:49.873275  8584 registrar.cpp:391] Successfully fetched the registry (0B) in 996864ns
I0315 17:17:49.873275  8584 registrar.cpp:495] Applied 1 operations in 0ns; attempting to
update the registry
I0315 17:17:49.874284 12560 registrar.cpp:552] Successfully updated the registry in 1.008128ms
I0315 17:17:49.874284 12560 registrar.cpp:424] Successfully recovered registrar
I0315 17:17:49.875576  7364 master.cpp:1792] Recovered 0 agents from the registry (239B);
allowing 10mins for agents to reregister
I0315 17:17:49.883285  7708 scheduler.cpp:188] Version: 1.6.0
I0315 17:17:49.883285  9820 scheduler.cpp:311] Using default 'basic' HTTP authenticatee
I0315 17:17:49.884280  9788 scheduler.cpp:494] New master detected at master@10.3.1.8:54159
I0315 17:17:49.893291  1996 scheduler.cpp:468] Re-detecting master
I0315 17:17:49.895265  1996 scheduler.cpp:494] New master detected at master@10.3.1.8:54159
I0315 17:17:49.903272 12560 scheduler.cpp:472] Lost leading master
I0315 17:17:49.907268  7708 master.cpp:1136] Master terminating
I0315 17:17:49.933271  7708 cluster.cpp:172] Creating default 'local' authorizer
I0315 17:17:49.941269  7364 master.cpp:463] Master b0ec0bf1-9a9c-4a89-a7bd-e7d3af2795ae (win-bld-srv-01.zq4gs31qjdiunm1ryi1452nvnh.dx.internal.cloudapp.net)
started on 10.3.1.8:54159
I0315 17:17:49.941269  7364 master.cpp:465] Flags at startup: --acls="" --agent_ping_timeout="15secs"
--agent_reregister_timeout="10mins" --allocation_interval="1secs" --allocator="HierarchicalDRF"
--authenticate_agents="true" --authenticate_frameworks="true" --authenticate_http_frameworks="true"
--authenticate_http_readonly="true" --authenticate_http_readwrite="true" --authenticators="crammd5"
--authorizers="local" --credentials="C:\Users\mesos\AppData\Local\Temp\kt97A9\credentials"
--filter_gpu_resources="true" --framework_sorter="drf" --help="false" --hostname_lookup="true"
--http_authenticators="basic" --http_framework_authenticators="basic" --initialize_driver_logging="true"
--log_auto_initialize="true" --logbufsecs="0" --logging_level="INFO" --max_agent_ping_timeouts="5"
--max_completed_frameworks="50" --max_completed_tasks_per_framework="1000" --max_unreachable_tasks_per_framework="1000"
--port="5050" --quiet="false" --recovery_agent_removal_limit="100%" --registry="in_memory"
  --registry_fetch_timeout="1mins" --registry_gc_interval="15mins" --registry_max_agent_age="2weeks"
--registry_max_agent_count="102400" --registry_store_timeout="100secs" --registry_strict="false"
--require_agent_domain="false" --root_submissions="true" --user_sorter="drf" --version="false"
--webui_dir="/webui" --work_dir="C:\Users\mesos\AppData\Local\Temp\kt97A9\master" --zk_session_timeout="10secs"
I0315 17:17:49.944269  7364 master.cpp:514] Master only allowing authenticated frameworks
to register
I0315 17:17:49.944269  7364 master.cpp:520] Master only allowing authenticated agents to register
I0315 17:17:49.944269  7364 master.cpp:526] Master only allowing authenticated HTTP frameworks
to register
I0315 17:17:49.945269  7364 credentials.hpp:37] Loading credentials for authentication from
'C:\Users\mesos\AppData\Local\Temp\kt97A9\credentials'
I0315 17:17:49.946270  7364 master.cpp:570] Using default 'crammd5' authenticator
I0315 17:17:49.947270  7364 http.cpp:959] Creating default 'basic' HTTP authenticator for
realm 'mesos-master-readonly'
I0315 17:17:49.947270  7364 http.cpp:959] Creating default 'basic' HTTP authenticator for
realm 'mesos-master-readwrite'
I0315 17:17:49.948555  7364 http.cpp:959] Creating default 'basic' HTTP authenticator for
realm 'mesos-master-scheduler'
I0315 17:17:49.948555  7364 master.cpp:649] Authorization enabled
I0315 17:17:49.958271  5860 master.cpp:2119] Elected as the leading master!
I0315 17:17:49.958271  5860 master.cpp:1678] Recovering from registrar
I0315 17:17:49.960283  1996 registrar.cpp:391] Successfully fetched the registry (0B) in 2.011904ms
I0315 17:17:49.960283  1996 registrar.cpp:495] Applied 1 operations in 0ns; attempting to
update the registry
I0315 17:17:49.961282  8584 registrar.cpp:552] SuccessfABORT: (D:\DCOS\mesos\mesos\3rdparty\libprocess\include\process/ssl/gtest.hpp:171):
Could not generate certificate: Failed to set common name: X509_NAME_add_entry_by_txt
```

- Mesos Reviewbot Windows


On March 15, 2018, 4:14 p.m., Harold Dost wrote:
> 
> -----------------------------------------------------------
> This is an automatically generated e-mail. To reply, visit:
> https://reviews.apache.org/r/66001/
> -----------------------------------------------------------
> 
> (Updated March 15, 2018, 4:14 p.m.)
> 
> 
> Review request for mesos and James Peach.
> 
> 
> Bugs: MESOS-6575
>     https://issues.apache.org/jira/browse/MESOS-6575
> 
> 
> Repository: mesos
> 
> 
> Description
> -------
> 
> New Flags for disk/xfs isolator
> - This patch adds a number of flags to handle switching the limit in the
>   `disk/xfs` isolator to allow apps to go over their limit and for mesos
> to kill them if they have gone over their limit.
> 
> New Flags:
> - xfs_disk_hard_limit_offset_pct - Use the `disk` as the soft limit and
>   set the hard limit to be some percentage above the soft limit.
> Allowing
>   for containers to surpass a desired allocation and making them
> killable.
> - xfs_disk_hard_limit_offset - Use the `disk` as the soft limit and set
>   the hard limit to some number of bytes specified above the
>   applications to be a soft limit instead of a hard limit.
> - xfs_kill_after_grace_period - This will kill tasks if they breach the
>   grace period configured using `xfs_quota -x -c "timer -p <time>"`
> - xfs_kill_check_interval - The frequency with which a container will be
>   checked for soft limit violations.
> 
> Functionality
> - Add head room to the hard limit as a percentage or specified amount
>   for each container. This is specified at a flag level and not
>   customizable on a per container basis.
> - Provide the ability for an application to be killed after the
>   configured grace period for projects is violated.
> - Add an interval between which the watcher will check for violations.
> 
> 
> Diffs
> -----
> 
>   src/slave/containerizer/mesos/isolators/xfs/disk.hpp 07e68a777aefba4dd35066f2eb207bba7f199d83

>   src/slave/containerizer/mesos/isolators/xfs/disk.cpp 8d9f8f846866f9de377c59cb7fb311041283ba70

>   src/slave/containerizer/mesos/isolators/xfs/utils.hpp e034133629a9c1cf58b776f8da2a93421332cee0

>   src/slave/containerizer/mesos/isolators/xfs/utils.cpp 2708524add1ff693b616d4fb241c4a0a3070520b

>   src/slave/flags.hpp 0c67bf214ceb93ae7ff088bec2648fa26ddac59e 
>   src/slave/flags.cpp 962b07c1d701f4ab819b14730fbc116b981433bb 
>   src/tests/containerizer/xfs_quota_tests.cpp 64c3e1c3f0bc435897626cb0a13bc19c7cb1a4fe

> 
> 
> Diff: https://reviews.apache.org/r/66001/diff/6/
> 
> 
> Testing
> -------
> 
> 
> Thanks,
> 
> Harold Dost
> 
>


Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message