sphenix-magnet-l AT lists.bnl.gov
Subject: sPHENIX discussion of the superconducting solenoid
List archive
Re: [Sphenix-magnet-l] It's the "watch dog timer" (which open the dump resistor contactor) !!!
- From: "Yip, Kin" <kinyip AT bnl.gov>
- To: "sphenix-magnet-l AT lists.bnl.gov" <sphenix-magnet-l AT lists.bnl.gov>
- Cc: "Sandberg, Jon N" <jsandberg AT bnl.gov>, "Schoenfeld, Ralph" <ralphs AT bnl.gov>, "Morris, John" <jtm AT bnl.gov>, "Costanzo, Michael" <mcostanzo AT bnl.gov>
- Subject: Re: [Sphenix-magnet-l] It's the "watch dog timer" (which open the dump resistor contactor) !!!
- Date: Tue, 20 Jun 2023 02:18:21 +0000
Hello,
I communicated with Carl, Chung and Ralph after my last two emails in the last few days.
Our sPHENIX colleagues are hoping that maybe, we can turn the magnet back to full-field after Wednesday’s Maintenance Day (12 hours). I am not sure whether this can be achieved but it may be a good goal …
Kin
From: Yip,
Kin
Hello, Carl and all,
Up to last Friday, I’ve always mistakenly treated your “Watch Dog Timer” as some (unnecessary) companion to the quench-interlock signal.
I’ve been thinking since Friday … Yesterday late afternoon or evening, I started to remember seeing the “watch dog timer” being >1 second earlier than “quench detector” (interlock) signal on the last 4600 A crash on June 11. Then, I realized that last Fri., I/we didn’t check the PLC when we observed the delay of the Labview stopping after we heard the “loud” sound of dump-resistor contactor closing. I realized that it could be ~15 s late (as we were “playing” with 15 s Timeout/delay accidentally).
This morning, I couldn’t wait any more and I just drove to 1008B to look at the PLC history (in front of the Magnet Power Supply).
I include a screenshot of the PLC history on June 16 (last Friday) when we were experiencing the 15 s delay. You can see that there were 2 such instances. Please ignore the lines with the status of “Fault Cleared” which was when I cleared the interlocks on June 16. ( Just look at the lines of “Fault”. )
But for both instances, you can see the “Quench Watch Dog Timer” is ~14 or 15 s earlier than “Quench Detector” !! The “Dump Resistor Contactor Opened” happened at the same second as the “Watch Dog Timer” (though the screen is not big enough to capture the earlier “Dump … Opened”).
I also attach the screenshot of June 11 (last 4600 A crash). Then, at 9:57 am, the “Watch Dog Timer” actually appeared actually at 9:57:28 before “Dump … Opened” at 9:57:29 and “Quench Detector” at 9:57:30. ( On June 7, we changed the code at that time to prevent the PLL-LOSS from stopping the program but there was still the 1 or 0.9xx second delay. )
It seems clear to me that the “quench-detector (interlock) signal” really didn’t get to the PLC/Power-Supply to initiate the fast discharge but it’s your “Watch Dog Timer” (that I have mentally ignored until now) that had opened the Dump Resistor Contactor. The quench-interlock delay was not just on the logged data but real. It’s the “Watch Dog Timer” that has “saved the day”.
Your “Watch Dog Timer” is probably a bit similar to the PLL-loss check in the Labview but it doesn’t have the “Timeout”/delay. We don’t have “Watch Dog Timer” in the logged data but we should add ?
Speculation/wishful thinking: maybe, the “Watch Dog Timer” could be the signal that went down to open Dump Resistor Contactor before everything else ? In some instances, the delay due to the PLL-loss check was long enough (when the Labview couldn’t read the clock signal) after “Watch Dog Timer” already opened Dump Resistor Contactor that “neg_90_deg_offset_jnt” changed enough (>50 mV) to cause the Labview to stop (and finally pull the “interlock” signal). This wouldn’t happen for the 100-150 A test as there wasn’t enough change, but it would for 4600 A. { Maybe, Chung and Chris can help us figure out. }
Kin
From:
Yip, Kin
Hello, Chung and Degen (and all),
This email has two subjects and the first one may be the answer to the question Chung, Degen, Carl and I have been wondering in the last couple of days.
suddenly opened (or otherwise bad) and Labview just couldn’t read the data any more at that time !
Only one cable contains “clock” signal and we swapped the bad one with the other one (as the other one doesn’t carry or need the clock signal).
Labview has been running OK. Let’s wait to see …
As I said, this is clearly shown in the “gif” file of the LogView ! We noticed this comparatively easily this afternoon because the Timeout was 15 s !!!
Therefore, our observation that the voltage-tap signal changed before the quench-interlock might not be true
?! 🤔
This may have important implication ?! Kin
From:
Ho, Chung <chungh AT bnl.gov>
From: Degen, Christopher <degen AT bnl.gov>
How could we have under-run our sample buffer with a 5.0 second timeout??? It’s almost as if our clock stopped.
That is the Questions we need the answer
Should the sample clock source be set to Slot2/PFI1 for all three boards? I’m not sure. I believe all u need is connected it to one of the board . -Chris
From:
Ho, Chung <chungh AT bnl.gov>
From: Degen, Christopher <degen AT bnl.gov>
I have the following observations/questions (incoherent ramblings?) about 24-Ch Loop in Loop v14.vi. The software is complicated, and I don’t have a good understanding of how it works yet.
There appear to be different sample clock configurations for the three analog input cards. Is this intentional?
Carl forgot do all 3 I will fixed it
Do these values ever get out of sync? Don’t know Charlie told me just watch the Avg loop time < 17 ms
This morning’s error seem to indicate that all 3 boards maintained sync, yet we under-ran the buffer. I’m interested in the history of these values prior to a failure. They could be plotted in a waveform chart, to provide a simple post mortum of their values.
Why do we read the three analog input boards sequentially, as opposed to simultaneously? Don’t know Design by Charlie and Zenyep
Also, the timeout for the read of board 1 is 5.0 seconds, and board 2 & 3 are 0.1 seconds. This probably doesn’t matter due to the sequential board reads.
Yet we appear to have under-run the sample buffer, even with a 5.0 second timeout! What happened to our sample clock?? Remain the same 730 Hz
How long did this run until it failed? Somtime days sometime hours
Regards, Chris
Christopher M. Degen Brookhaven National Laboratory Building 924 Upton NY 11973 USA Voice: (631) 344-2492
|
-
[Sphenix-magnet-l] bad ribbon cable ... and quench-interlock-signal in Logged data is delayed by the "Timeout" but not to power supply,
Yip, Kin, 06/16/2023
- Message not available
-
Message not available
- Re: [Sphenix-magnet-l] It's the "watch dog timer" (which open the dump resistor contactor) !!!, Yip, Kin, 06/19/2023
Archive powered by MHonArc 2.6.24.