-
Notifications
You must be signed in to change notification settings - Fork 125
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Use sackd for the login nodes #2979
base: develop
Are you sure you want to change the base?
Conversation
Substitute slurmd for the sackd daemon, this way an x-login partition is not needed.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
What is the behavior of sackd when the slurmctld is non responsive? We currently configure the slurmd service unit to restart when configless slurmd fails to retrieve the configuration.
Added new commit so that at reconfigure sackd is restarted, but you have a valid point about the restart on-failure defined in https://github.com/GoogleCloudPlatform/slurm-gcp/blob/master/ansible/roles/slurm/templates/systemd/slurmd_overrides.j2 I will create a PR also on slurm-gcp for that. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
IMO this should not be merged until new images are published using
Waiting for images-based on 6.7.0 to be relased. DO_NOT_MERGE until then. |
/gcbrun |
Substitute slurmd for the sackd daemon, this way an x-login partition is not needed.
Submission Checklist
Please take the following actions before submitting this pull request.