Subject: AT LAST: THE GOOD NEWS!!!
From: Michael J Mulhearn
Date: Mon, 30 Jan 2006 21:23:22 -0500 (EST)
To: D0 Run2b L1Cal Trigger
CC: zmuda@fnal.gov, Li Zhang , Jaroslav Ban

Hello!

It appears that decabling a large number of ADF->TAB cables is unnecessary!

This afternoon, following the suggestions of many of you, Ted Zmuda and I made some substantial progress on our LVDS issues. As you might expect, it appears we had several problems that needed to be addressed as an "AND" before progress could be observed.

SUMMARY:

1)  The TAB timing with respect to the ADF was too tight.  This was of course the very first thing I tried two weeks ago, but, for reasons I don't understand yet--possibly involving the new firmware and the initial master reset--it wasn't having the desired effect.  In any event, the timing has now been backed off to good effect.  For now I have moved it exactly 132 ns, we will have to see if we can get by with less as this is comparable to our latency budget.

2)  The desquewing sequence is *essential*.

3)  One of the four ADF crates (ADF crate B) is misbehaving, we have
    powered it down, and will keep it so until Dan arrives on wednesday.

4)  There are then several single channel issues:
    -> bent pins on TAB/GAB backplane (not yet fixed!)
        -> some ATC/ADF outputs appear to be invalid,
           and have been swapped with their neighbors.
        -> some cables appear to be bad, and have been
           replaced with spare cables, run informally.

We test after 10 seconds on one TAB the following on board status bits for each input:

 -> parity:  xor of data equals transmitted parity word
 -> sync:    bit is high for least significant bit
 -> bc err:  transmitted BC equals BC TAB receives from SCL.

After all of these fixes, we finally see some nice results! The only errors (non zero status) on one TAB are sync and BC errors, all confirmed as coming from the one ADF crate which is powered down, or from unplugged cables with visible bent pins on the TAB/GAB backplane.  There are no parity errors.

Here we show the parity, sync and BC ERR status after 10 seconds, for each of the three inputs of each of the 10 sliding windows chips of a TAB in slot 3 (count from 1):

CHIP  PARITY  SYNC    BC ERR
0:    0 0 0,  0 0 0,  0 0 0
1:    0 0 0,  0 0 0,  0 0 0
2:    0 0 0,  0 1 1,  0 1 1    <- 2 bent pins
3:    0 0 0,  1 1 1,  1 1 1    <- down ATC crate
4:    0 0 0,  0 0 0,  0 0 0
5:    0 0 0,  0 0 0,  0 0 0
6:    0 0 0,  0 0 1,  0 0 1    <- down ATC crate
7:    0 0 0,  1 1 1,  1 1 1    <- down ATC crate
8:    0 0 0,  0 0 0,  0 0 0
9:    0 0 0,  0 0 0,  0 0 0

We have tested two TABs in this slot with identical results.  Also, no new errors emerge after 100 s instead of 10 s.  Even longer tests will follow soon.

We have also tested another slot, there are about two problems evidently of type (4) above that we will need to address.

CONCLUSION/PROJECTION:

We seem to have found the systematic problem(s)!

However, there still appear to be problems on a few individual channels per TAB that will have to fixed individually.  We will get firmer numbers, but it appears that the 30 spares that have been ordered will be sufficient.

If anyone has any questions, I will be in a Chicago pub trying to forget the past two weeks!

cheers,
mike

















master reset is not needed before the SCL delay can be adjusted.