atlas-connect-l AT lists.bnl.gov
Subject: Atlas-connect-l mailing list
List archive
Re: [Atlas-connect-l] Another heads up - 30k job submission failed; resubmitted a 20k batch
- From: Bob Ball <ball AT umich.edu>
- To: Shawn McKee <smckee AT umich.edu>, Rob Gardner <rwg AT hep.uchicago.edu>
- Cc: atlas-connect-l <atlas-connect-l AT lists.bnl.gov>
- Subject: Re: [Atlas-connect-l] Another heads up - 30k job submission failed; resubmitted a 20k batch
- Date: Wed, 29 Jan 2014 10:58:56 -0500
Looks like AGLT2 peaked at around 380 out of a possible max
allocation of 500 before the supply of jobs ran out. bob On 1/28/2014 9:13 PM, Shawn McKee
wrote:
Hi Rob,
Glad to see things are working. Let us know if you see any issues with AGLT2. Thanks,
Shawn
On Tue, Jan 28, 2014 at 9:09 PM, Rob Gardner <rwg AT hep.uchicago.edu> wrote:
All,
just sending along a scale test - 30k jobs submitted.
(We probably need to add the rcc.uchicago.edu pool to cycle
server.)
Current snapshot,
[rwg@login ~]$ condor_q | grep R | wc
2462 29540 201882
[rwg@login ~]$
which seems to be the current “reach” of Atlas
Connect, given our priority settings presently. I
can’t wait until we get stampede connected.
The only snag I hit was the number of log files
written to my home directory (I was asking for 90k files
in a single directory). Once I reduced it to 30k, all
proceeded smoothly.
Some plots with the distribution:
This show’s we’re getting about 280 jobs going on
AGLT2 at the moment, not bad.
Both UC3 and Fresno have topped out:
On Jan 28, 2014, at 6:59 PM, Rob Gardner <rwg AT hep.uchicago.edu>
wrote:
---
Rob
Gardner •
Twitter: @rwg • Skype: rwg773 • g+: rob.rwg • +1 312-804-0859 • University of
Chicago
---
Rob
Gardner • Twitter: @rwg • Skype: rwg773 • g+: rob.rwg • +1
312-804-0859 •
University of Chicago
---
Rob
Gardner • Twitter: @rwg • Skype: rwg773 • g+: rob.rwg • +1
312-804-0859 •
University of Chicago |
Attachment:
pngiKjZhdYqGP.png
Description: PNG image
Attachment:
pngcmIUmS6bFc.png
Description: PNG image
Attachment:
png01N1S4ZDvf.png
Description: PNG image
Attachment:
pngIMDFfzGl6T.png
Description: PNG image
-
Re: [Atlas-connect-l] Another heads up - 30k job submission failed; resubmitted a 20k batch,
Rob Gardner, 01/28/2014
-
Re: [Atlas-connect-l] Another heads up - 30k job submission failed; resubmitted a 20k batch,
Shawn McKee, 01/28/2014
-
Re: [Atlas-connect-l] Another heads up - 30k job submission failed; resubmitted a 20k batch,
Bob Ball, 01/29/2014
- Re: [Atlas-connect-l] Another heads up - 30k job submission failed; resubmitted a 20k batch, Rob Gardner, 01/29/2014
-
Re: [Atlas-connect-l] Another heads up - 30k job submission failed; resubmitted a 20k batch,
Bob Ball, 01/29/2014
-
Re: [Atlas-connect-l] Another heads up - 30k job submission failed; resubmitted a 20k batch,
Shawn McKee, 01/28/2014
Archive powered by MHonArc 2.6.24.