Adaptec AIC-79xx Problems - Debian

This is a discussion on Adaptec AIC-79xx Problems - Debian ; Dear List, We recently purchased a Supermicro server with an onboard adaptec scsi controller. 0000:03:02.0 SCSI storage controller: Adaptec AIC-7902B U320 (rev 10) 0000:03:02.1 RAID bus controller: Adaptec AIC-7902(B) U320 w/HostRAID (rev 10) After seeing *** Dump Card State Ends ...

+ Reply to Thread
Results 1 to 6 of 6

Thread: Adaptec AIC-79xx Problems

  1. Adaptec AIC-79xx Problems

    Dear List,

    We recently purchased a Supermicro server with an onboard adaptec
    scsi controller.

    0000:03:02.0 SCSI storage controller: Adaptec AIC-7902B U320 (rev 10)
    0000:03:02.1 RAID bus controller: Adaptec AIC-7902(B) U320 w/HostRAID
    (rev 10)

    After seeing *** Dump Card State Ends *** the box crashes.

    Does anyone else have these problems?


    Thanks

    Andrew


    --
    To UNSUBSCRIBE, email to debian-isp-REQUEST@lists.debian.org
    with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org

  2. Re: Adaptec AIC-79xx Problems

    Andrew Miehs wrote:
    > Dear List,
    >
    > We recently purchased a Supermicro server with an onboard adaptec scsi
    > controller.
    >
    > 0000:03:02.0 SCSI storage controller: Adaptec AIC-7902B U320 (rev 10)
    > 0000:03:02.1 RAID bus controller: Adaptec AIC-7902(B) U320 w/HostRAID
    > (rev 10)
    >
    > After seeing *** Dump Card State Ends *** the box crashes.


    Look at termination, cable length, cable type (did you use LVD cables)

    Remove all devices, and try to boot the system (to see if you can go
    beyond this point...) if it still fails -> look at the setup of the
    adapter...

    and as last: does the adapter shares it's interrupt with an other device?

    --
    JJ van Gorkum Knowledge Zone
    If UNIX isn't the solution, you've got the wrong problem


    --
    To UNSUBSCRIBE, email to debian-isp-REQUEST@lists.debian.org
    with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org

  3. Re: Adaptec AIC-79xx Problems

    The machine boots but as soon as we start writing heavily to the
    disks...

    Jun 28 12:00:53 localhost kernel: scsi0: SCBPTR == 0xff0d, SCB_NEXT
    == 0xff00, SCB_NEXT2 == 0x0
    Jun 28 12:00:53 localhost kernel: CDB d 1 0 0 0 0
    Jun 28 12:00:53 localhost kernel: STACK: 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0
    Jun 28 12:00:53 localhost kernel: <<<<<<<<<<<<<<<<< Dump Card State
    Ends >>>>>>>>>>>>>>>>>>
    Jun 28 12:00:53 localhost kernel: DevQ(0:0:0): 0 waiting
    Jun 28 12:00:53 localhost kernel: DevQ(0:1:0): 0 waiting
    Jun 28 12:00:53 localhost kernel: DevQ(0:2:0): 0 waiting
    Jun 28 12:00:53 localhost kernel: DevQ(0:8:0): 0 waiting
    Jun 28 12:00:53 localhost kernel: Recovery SCB completes
    Jun 28 12:00:53 localhost kernel: scsi0:0:1:0: Cmd aborted from QINFIFO


    As for the interrupts:

    :~# lspci
    0000:00:00.0 Host bridge: Intel Corp. Server Memory Controller Hub
    (rev 0c)
    0000:00:00.1 ff00: Intel Corp. Memory Controller Hub Error Reporting
    Register (rev 0c)
    0000:00:01.0 System peripheral: Intel Corp. Memory Controller Hub DMA
    Controller (rev 0c)
    0000:00:02.0 PCI bridge: Intel Corp. Memory Controller Hub PCI
    Express Port A0 (rev 0c)
    0000:00:03.0 PCI bridge: Intel Corp. Memory Controller Hub PCI
    Express Port A1 (rev 0c)
    0000:00:04.0 PCI bridge: Intel Corp. Memory Controller Hub PCI
    Express Port B0 (rev 0c)
    0000:00:06.0 PCI bridge: Intel Corp. Memory Controller Hub PCI
    Express Port C0 (rev 0c)
    0000:00:1d.0 USB Controller: Intel Corp. 82801EB/ER (ICH5/ICH5R) USB
    UHCI #1 (rev 02)
    0000:00:1d.1 USB Controller: Intel Corp. 82801EB/ER (ICH5/ICH5R) USB
    UHCI #2 (rev 02)
    0000:00:1d.2 USB Controller: Intel Corp. 82801EB/ER (ICH5/ICH5R) USB
    UHCI #3 (rev 02)
    0000:00:1d.3 USB Controller: Intel Corp. 82801EB/ER (ICH5/ICH5R) USB
    UHCI #4 (rev 02)
    0000:00:1d.7 USB Controller: Intel Corp. 82801EB/ER (ICH5/ICH5R) USB2
    EHCI Controller (rev 02)
    0000:00:1e.0 PCI bridge: Intel Corp. 82801 PCI Bridge (rev c2)
    0000:00:1f.0 ISA bridge: Intel Corp. 82801EB/ER (ICH5/ICH5R) LPC
    Bridge (rev 02)
    0000:00:1f.1 IDE interface: Intel Corp. 82801EB/ER (ICH5/ICH5R) Ultra
    ATA 100 Storage Controller (rev 02)
    0000:00:1f.3 SMBus: Intel Corp. 82801EB/ER (ICH5/ICH5R) SMBus
    Controller (rev 02)
    0000:02:00.0 PCI bridge: Intel Corp. PCI Bridge Hub A (rev 09)
    0000:02:00.1 PIC: Intel Corp. PCI Bridge Hub I/OxAPIC Interrupt
    Controller A (rev 09)
    0000:02:00.2 PCI bridge: Intel Corp. PCI Bridge Hub B (rev 09)
    0000:02:00.3 PIC: Intel Corp. PCI Bridge Hub I/OxAPIC Interrupt
    Controller B (rev 09)
    0000:03:02.0 SCSI storage controller: Adaptec AIC-7902B U320 (rev 10)
    0000:03:02.1 SCSI storage controller: Adaptec AIC-7902B U320 (rev 10)
    0000:04:02.0 Ethernet controller: Intel Corp. 82546GB Gigabit
    Ethernet Controller (rev 03)
    0000:04:02.1 Ethernet controller: Intel Corp. 82546GB Gigabit
    Ethernet Controller (rev 03)
    0000:07:01.0 VGA compatible controller: ATI Technologies Inc Rage XL
    (rev 27)



    On 28/06/2006, at 12:37 PM, JJ van Gorkum wrote:
    > Look at termination, cable length, cable type (did you use LVD cables)


    This is all 'out of the box' SuperMicro equipment with cables etc
    already in place with a supermicro 8 drive hard disk cage.

    >
    > Remove all devices, and try to boot the system (to see if you can go
    > beyond this point...) if it still fails -> look at the setup of the
    > adapter...
    >
    > and as last: does the adapter shares it's interrupt with an other
    > device?
    >



    --
    To UNSUBSCRIBE, email to debian-isp-REQUEST@lists.debian.org
    with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org

  4. Re: Adaptec AIC-79xx Problems

    Andrew Miehs wrote:
    > Dear List,
    >
    > We recently purchased a Supermicro server with an onboard adaptec scsi
    > controller.
    >
    > 0000:03:02.0 SCSI storage controller: Adaptec AIC-7902B U320 (rev 10)
    > 0000:03:02.1 RAID bus controller: Adaptec AIC-7902(B) U320 w/HostRAID
    > (rev 10)
    >
    > After seeing *** Dump Card State Ends *** the box crashes.
    >
    > Does anyone else have these problems?
    >
    >
    > Thanks
    >
    > Andrew
    >
    >
    > --To UNSUBSCRIBE, email to debian-isp-REQUEST@lists.debian.org
    > with a subject of "unsubscribe". Trouble? Contact
    > listmaster@lists.debian.org
    >
    >

    Yes I do!

    Lots! Been struggling for ages to find a reason.

    Haven't found the exact cause yet, replaced EVERY single component
    barring cpus and chassis and it still happened.
    recently been putting all the disks in this box on separate power
    supplies by jerry-rigging them to work without being plugged into mobo.
    The more power supplies I use, the less errors, I'm now running 13 SCSI
    disks off 3x550W psus + 1x300W and I haven't had the error!
    This may not be your problem.

    Also look at cabling/termination. I thought mine was this but after
    changing many disks and many cables, I figure mine is nothing to do with
    this.

    just my 2 cents.

    -h


    --
    To UNSUBSCRIBE, email to debian-isp-REQUEST@lists.debian.org
    with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org

  5. Re: Adaptec AIC-79xx Problems

    I have a second box now up and running, and this does not seem to
    have the problem. I am starting to get the feeling that its the
    backplane on the SuperMicro SCA cage...

    Andrew


    On 28/06/2006, at 3:18 PM, Hari Sekhon wrote:

    > Andrew Miehs wrote:
    >> Dear List,
    >>
    >> We recently purchased a Supermicro server with an onboard adaptec
    >> scsi controller.
    >>
    >> 0000:03:02.0 SCSI storage controller: Adaptec AIC-7902B U320 (rev 10)
    >> 0000:03:02.1 RAID bus controller: Adaptec AIC-7902(B) U320 w/
    >> HostRAID (rev 10)
    >>
    >> After seeing *** Dump Card State Ends *** the box crashes.
    >>
    >> Does anyone else have these problems?
    >>
    >>
    >> Thanks
    >>
    >> Andrew
    >>
    >>
    >> --To UNSUBSCRIBE, email to debian-isp-REQUEST@lists.debian.org
    >> with a subject of "unsubscribe". Trouble? Contact
    >> listmaster@lists.debian.org
    >>
    >>

    > Yes I do!
    >
    > Lots! Been struggling for ages to find a reason.
    >
    > Haven't found the exact cause yet, replaced EVERY single component
    > barring cpus and chassis and it still happened.
    > recently been putting all the disks in this box on separate power
    > supplies by jerry-rigging them to work without being plugged into
    > mobo.
    > The more power supplies I use, the less errors, I'm now running 13
    > SCSI disks off 3x550W psus + 1x300W and I haven't had the error!
    > This may not be your problem.
    >
    > Also look at cabling/termination. I thought mine was this but after
    > changing many disks and many cables, I figure mine is nothing to do
    > with this.
    >
    > just my 2 cents.
    >
    > -h
    >
    >
    > --
    > To UNSUBSCRIBE, email to debian-isp-REQUEST@lists.debian.org
    > with a subject of "unsubscribe". Trouble? Contact
    > listmaster@lists.debian.org
    >



    --
    To UNSUBSCRIBE, email to debian-isp-REQUEST@lists.debian.org
    with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org

  6. Re: Adaptec AIC-79xx Problems

    On Wed, 28 Jun 2006, Andrew Miehs wrote:
    > I have a second box now up and running, and this does not seem to
    > have the problem. I am starting to get the feeling that its the
    > backplane on the SuperMicro SCA cage...


    I have seen bad backplanes do very nasty stuff that resulted in disk errors
    and data loss. However, if the driver is crashing the box instead of just
    killing all IO to the affected SCSI bus, then either the driver is buggy
    crap (and crashes the kernel because of it), or the hardware is crap that
    doesn't have a place in a server room.

    I had issues with an onboard AIC7901A that looked very much like the one
    you're having (but the kernel didn't hang, it just oopsed). They were
    solved in 2.6.12, but I haven't tried any newer kernels on that particular
    box (which is in an isolated, secure lan).

    --
    "One disk to rule them all, One disk to find them. One disk to bring
    them all and in the darkness grind them. In the Land of Redmond
    where the shadows lie." -- The Silicon Valley Tarot
    Henrique Holschuh


    --
    To UNSUBSCRIBE, email to debian-isp-REQUEST@lists.debian.org
    with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org

+ Reply to Thread