sphenix-software-l AT lists.bnl.gov
Subject: sPHENIX discussion of software
List archive
Re: [Sphenix-software-l] condor and - unrelated - sphnx01 rebooted
- From: pinkenburg <pinkenburg AT bnl.gov>
- To: sphenix-software-l AT lists.bnl.gov
- Subject: Re: [Sphenix-software-l] condor and - unrelated - sphnx01 rebooted
- Date: Fri, 1 Dec 2023 09:58:19 -0500
sorry misleading subject line - condor wasn't restarted, just sphnx01
Chris
On 12/1/2023 9:57 AM, pinkenburg via sPHENIX-software-l wrote:
Hi folks,
as of yesterday noon condor doesn't start our jobs on the sPHENIX farm anymore. This is being actively looked at but there is no idea so far what is wrong with it. For the time being - you can submit jobs from the PHENIX interactive nodes (rcas2061-rcas2068) which will then run on the shared pool. Just a reminder the submission host configuration decides where a job is being run, it is not in your specific job description or account (means: you can submit sPHENIX jobs from your sPHENIX account on those hosts)
sphnx01 lost its connection to gpfs yesterday - this was found when looking into logfiles under /sphenix/user but it is likely unrelated. Sadly sphnx01 needed to be rebooted to recover.
Chris
--
*************************************************************
Christopher H. Pinkenburg ; pinkenburg AT bnl.gov
; http://www.phenix.bnl.gov/~pinkenbu
Brookhaven National Laboratory ; phone: (631) 344-5692
Physics Department Bldg 510 C ; fax: (631) 344-3253
Upton, NY 11973-5000
*************************************************************
-
[Sphenix-software-l] condor and - unrelated - sphnx01 rebooted,
pinkenburg, 12/01/2023
- Re: [Sphenix-software-l] condor and - unrelated - sphnx01 rebooted, pinkenburg, 12/01/2023
Archive powered by MHonArc 2.6.24.