Skip to content

[SYSTEMDS-3273] Federated Timeout#1508

Merged
Baunsgaard merged 1 commit intoapache:mainfrom
Baunsgaard:FederatedTimeOut
Jan 15, 2022
Merged

[SYSTEMDS-3273] Federated Timeout#1508
Baunsgaard merged 1 commit intoapache:mainfrom
Baunsgaard:FederatedTimeOut

Conversation

@Baunsgaard
Copy link
Contributor

This commit adds two things to the federated handler.

  1. A timeout (default unlimited) that is enabled for tests to
    ensure that the tests finish instead of having a controller waiting
    infinitely on a crashed worker.
  2. Log output of errors on the workers to enable debugging if you have
    access to the worker process output. This ensures that if there is sensitive
    information in the error message it is contained to the worker but still
    logged.

This commit adds two things to the federated handler.
1. A timeout (default unlimited) that is enabled for tests to
ensure that the tests finish instead of having a controller waiting
infinitely on a crashed worker.
2. Log output of errors on the workers to enable debugging if you have
access to the worker process output. This ensures that if there is sensitive
information in the error message it is contained to the worker but still
logged.
3. config file overwrite scratch_space if non existing in config file for tests.
This is a bug where if the config file used by the test does not contain
the scratch_space location it will use default, making it a race condition
between parallel tests to create and delete the same scratch_space locations.
@Baunsgaard Baunsgaard merged commit fbdca4f into apache:main Jan 15, 2022
@github-pages github-pages bot temporarily deployed to github-pages January 15, 2022 19:45 Inactive
@Baunsgaard Baunsgaard deleted the FederatedTimeOut branch January 28, 2022 12:35
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant