sdcc_users-l AT lists.bnl.gov
Subject: Scientific Data & Computing Center
List archive
Re: [Sdcc_users-l] The slurm system on IC cluster seems not working
- From: "Wang, Xuelong" <xueang AT bnl.gov>
- To: caramarc <caramarc AT rcf.rhic.bnl.gov>
- Cc: "sdcc_users-l AT lists.bnl.gov" <sdcc_users-l AT lists.bnl.gov>
- Subject: Re: [Sdcc_users-l] The slurm system on IC cluster seems not working
- Date: Fri, 12 Jul 2019 23:49:12 +0000
From: caramarc <caramarc AT rcf.rhic.bnl.gov>
Sent: Friday, July 12, 2019 7:48:01 PM
To: Wang, Xuelong
Cc: sdcc_users-l AT lists.bnl.gov
Subject: Re: [Sdcc_users-l] The slurm system on IC cluster seems not working
Sent: Friday, July 12, 2019 7:48:01 PM
To: Wang, Xuelong
Cc: sdcc_users-l AT lists.bnl.gov
Subject: Re: [Sdcc_users-l] The slurm system on IC cluster seems not working
Hi Xuelong Wang,
The filesystem that holds the slurm job state information was hanging
and all slurm operations timed out. It should be back to normal.
I suggest using the ticketing system we have in place next time. The
user list email is just for announcements:
https://www.sdcc.bnl.gov/#support
Regards,
Costin
On 2019-07-12 19:28, Wang, Xuelong wrote:
> Hi everyone~
>
> When I try to check the status of submitted task on ic cluster and
> submit new job, it says "Socket timed out on send/recv operation". It
> seems like the slurm system stops working. Has anyone run into the
> same situation with me? Does anyone know how to fix this?
>
> Thanks!
>
> _
> _
>
> _Xuelong Wang_
>
> _Chemistry Department_
> _Brookhaven National Laboratory_
> _2 Center Street, Bldg 555_
> _Chemistry Department +1 5712509485_
> _NY, 11973, U.S. xueang AT bnl.gov_
> _______________________________________________
> Sdcc_users-l mailing list
> Sdcc_users-l AT lists.bnl.gov
> https://lists.bnl.gov/mailman/listinfo/sdcc_users-l
The filesystem that holds the slurm job state information was hanging
and all slurm operations timed out. It should be back to normal.
I suggest using the ticketing system we have in place next time. The
user list email is just for announcements:
https://www.sdcc.bnl.gov/#support
Regards,
Costin
On 2019-07-12 19:28, Wang, Xuelong wrote:
> Hi everyone~
>
> When I try to check the status of submitted task on ic cluster and
> submit new job, it says "Socket timed out on send/recv operation". It
> seems like the slurm system stops working. Has anyone run into the
> same situation with me? Does anyone know how to fix this?
>
> Thanks!
>
> _
> _
>
> _Xuelong Wang_
>
> _Chemistry Department_
> _Brookhaven National Laboratory_
> _2 Center Street, Bldg 555_
> _Chemistry Department +1 5712509485_
> _NY, 11973, U.S. xueang AT bnl.gov_
> _______________________________________________
> Sdcc_users-l mailing list
> Sdcc_users-l AT lists.bnl.gov
> https://lists.bnl.gov/mailman/listinfo/sdcc_users-l
-
[Sdcc_users-l] The slurm system on IC cluster seems not working,
Wang, Xuelong, 07/12/2019
-
Message not available
- Re: [Sdcc_users-l] The slurm system on IC cluster seems not working, Wang, Xuelong, 07/12/2019
-
Message not available
Archive powered by MHonArc 2.6.24.