ide problems? - Debian

This is a discussion on ide problems? - Debian ; Hi All, One of my servers locked up last night. I was not able to ssh in, but luckily I had a terminal open which was already connected to it. Any command I tried gave me an I/O error (which ...

+ Reply to Thread
Results 1 to 11 of 11

Thread: ide problems?

  1. ide problems?

    Hi All,

    One of my servers locked up last night. I was not able to ssh in, but
    luckily I had a terminal open which was already connected to it. Any
    command I tried gave me an I/O error (which I should have written down
    because I forget what it said).

    Once I rebooted the machine, It came back up without problems, and has
    been working fine since, but I'd like to figure out what went wrong.

    Here's what syslog had to say about it:
    ------------------------------------------------------------------------
    Jan 9 03:10:48 hostname kernel: hde: dma_timer_expiry: dma status == 0x21
    Jan 9 03:10:58 hostname kernel: hde: DMA timeout error
    Jan 9 03:10:58 hostname kernel: hde: dma timeout error: status=0xd0 {
    Busy }
    Jan 9 03:10:58 hostname kernel: ide: failed opcode was: unknown
    Jan 9 03:10:58 hostname kernel: hde: DMA disabled
    Jan 9 03:10:58 hostname kernel: PDC202XX: Primary channel reset.
    Jan 9 03:11:05 hostname kernel: ide2: reset: success
    Jan 9 03:11:26 hostname kernel: hde: dma_timer_expiry: dma status == 0x21
    Jan 9 03:12:53 hostname kernel: hde: dma_timer_expiry: dma status == 0x21
    Jan 9 03:13:03 hostname kernel: hde: DMA timeout error
    Jan 9 03:13:03 hostname kernel: hde: dma timeout error: status=0xd0 {
    Busy }
    Jan 9 03:13:03 hostname kernel: ide: failed opcode was: unknown
    Jan 9 03:13:03 hostname kernel: hde: DMA disabled
    Jan 9 03:13:03 hostname kernel: PDC202XX: Primary channel reset.
    Jan 9 03:13:38 hostname kernel: ide2: reset timed-out, status=0xd0
    Jan 9 03:13:38 hostname kernel: hde: status timeout: status=0xd0 { Busy }
    Jan 9 03:13:38 hostname kernel: ide: failed opcode was: unknown
    Jan 9 03:13:38 hostname kernel: PDC202XX: Primary channel reset.
    Jan 9 03:13:38 hostname kernel: hde: drive not ready for command
    Jan 9 03:14:07 hostname kernel: ide2: reset: success
    Jan 9 03:14:27 hostname kernel: hde: dma_timer_expiry: dma status == 0x21
    Jan 9 03:14:37 hostname kernel: hde: DMA timeout error
    Jan 9 03:14:37 hostname kernel: hde: dma timeout error: status=0xd0 {
    Busy }
    Jan 9 03:14:37 hostname kernel: ide: failed opcode was: unknown
    Jan 9 03:14:37 hostname kernel: hde: DMA disabled
    Jan 9 03:14:37 hostname kernel: PDC202XX: Primary channel reset.
    Jan 9 03:14:52 hostname kernel: ide2: reset: success
    ------------------------------------------------------------------------

    I'm trying to determine weather its the drive, or a problem with the IDE
    controller or motherboard.

    Anyone have any ideas?

    thanks!
    ~Jack

    (Running Sarge, w/2.6.13.4 vanilla kernel)


    --
    To UNSUBSCRIBE, email to debian-isp-REQUEST@lists.debian.org
    with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org

  2. Re: ide problems?



    --On January 9, 2006 12:44:40 PM -0800 jack wrote:

    > I'm trying to determine weather its the drive, or a problem with the IDE
    > controller or motherboard.
    >
    > Anyone have any ideas?



    In my experience this is indicative of a drive on it's last legs. IDE
    drives usually take out the system (hard lock) when they go (even in
    mirrors with MD RAID, often even with 'hardware' RAIDs that aren't hardware
    RAID at all).


    --
    To UNSUBSCRIBE, email to debian-isp-REQUEST@lists.debian.org
    with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org

  3. Re: ide problems?

    I had similar errors, ignored them for 24 hours and then wowed our users
    with 24 hours of downtime restoring from backup. It was the drive.

    apt-get install smartmontools

    ....may allow the hard drive to tell you that is failing.



    ############
    Michael Loftis wrote:
    >
    >
    > --On January 9, 2006 12:44:40 PM -0800 jack wrote:
    >
    >> I'm trying to determine weather its the drive, or a problem with the IDE
    >> controller or motherboard.
    >>
    >> Anyone have any ideas?

    >
    >
    >
    > In my experience this is indicative of a drive on it's last legs. IDE
    > drives usually take out the system (hard lock) when they go (even in
    > mirrors with MD RAID, often even with 'hardware' RAIDs that aren't
    > hardware RAID at all).
    >
    >



    --
    Dan MacNeil
    Fearless Leader, Community Software Lab
    http://thecsl.org/go/vol

    God has no hands or feet or voice except ours, and
    through these, God works. (St. Teresa of Avila)


    --
    To UNSUBSCRIBE, email to debian-isp-REQUEST@lists.debian.org
    with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org

  4. Re: ide problems?



    --On January 9, 2006 10:20:07 PM -0500 Dan MacNeil wrote:

    > I had similar errors, ignored them for 24 hours and then wowed our users
    > with 24 hours of downtime restoring from backup. It was the drive.
    >
    > apt-get install smartmontools
    >
    > ...may allow the hard drive to tell you that is failing.


    Never have I seen S.M.A.R.T. predict nor provide any hint of an impending
    failure. Usually the first signs were the DMA failures, all the while
    SMART reporting everything A/OK, and no errors logged until just before
    they finally died, then it'd usually log some.


    --
    To UNSUBSCRIBE, email to debian-isp-REQUEST@lists.debian.org
    with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org

  5. Re: ide problems?

    Unfortunately SATA drives are not yet covered by smartmontools AFAIK, and
    right now, I dont think there is a method of querying the drives.

    ----- Original Message -----
    From: "Michael Loftis"
    To:
    Sent: Tuesday, 10 January, 2006 11:48 AM
    Subject: Re: ide problems?


    >
    >
    > --On January 9, 2006 10:20:07 PM -0500 Dan MacNeil

    wrote:
    >
    > > I had similar errors, ignored them for 24 hours and then wowed our

    users
    > > with 24 hours of downtime restoring from backup. It was the drive.
    > >
    > > apt-get install smartmontools
    > >
    > > ...may allow the hard drive to tell you that is failing.

    >
    > Never have I seen S.M.A.R.T. predict nor provide any hint of an

    impending
    > failure. Usually the first signs were the DMA failures, all the while
    > SMART reporting everything A/OK, and no errors logged until just before
    > they finally died, then it'd usually log some.
    >
    >
    > --
    > To UNSUBSCRIBE, email to debian-isp-REQUEST@lists.debian.org
    > with a subject of "unsubscribe". Trouble? Contact

    listmaster@lists.debian.org
    >
    >



    --
    To UNSUBSCRIBE, email to debian-isp-REQUEST@lists.debian.org
    with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org

  6. Re: ide problems?

    From: "jack"
    > Hi All,
    >
    > One of my servers locked up last night. I was not able to ssh in, but
    > luckily I had a terminal open which was already connected to it. Any
    > command I tried gave me an I/O error (which I should have written down
    > because I forget what it said).
    >
    > Once I rebooted the machine, It came back up without problems, and has
    > been working fine since, but I'd like to figure out what went wrong.
    >
    > Here's what syslog had to say about it:
    > ------------------------------------------------------------------------
    > Jan 9 03:10:48 hostname kernel: hde: dma_timer_expiry: dma status ==
    > 0x21
    > Jan 9 03:10:58 hostname kernel: hde: DMA timeout error
    > Jan 9 03:10:58 hostname kernel: hde: dma timeout error: status=0xd0
    > { Busy }
    > Jan 9 03:10:58 hostname kernel: ide: failed opcode was: unknown
    > Jan 9 03:10:58 hostname kernel: hde: DMA disabled
    > Jan 9 03:10:58 hostname kernel: PDC202XX: Primary channel reset.
    > Jan 9 03:11:05 hostname kernel: ide2: reset: success
    > Jan 9 03:11:26 hostname kernel: hde: dma_timer_expiry: dma status ==
    > 0x21
    > Jan 9 03:12:53 hostname kernel: hde: dma_timer_expiry: dma status ==
    > 0x21
    > Jan 9 03:13:03 hostname kernel: hde: DMA timeout error
    > Jan 9 03:13:03 hostname kernel: hde: dma timeout error: status=0xd0
    > { Busy }
    > Jan 9 03:13:03 hostname kernel: ide: failed opcode was: unknown
    > Jan 9 03:13:03 hostname kernel: hde: DMA disabled
    > Jan 9 03:13:03 hostname kernel: PDC202XX: Primary channel reset.
    > Jan 9 03:13:38 hostname kernel: ide2: reset timed-out, status=0xd0
    > Jan 9 03:13:38 hostname kernel: hde: status timeout: status=0xd0 {
    > Busy }
    > Jan 9 03:13:38 hostname kernel: ide: failed opcode was: unknown
    > Jan 9 03:13:38 hostname kernel: PDC202XX: Primary channel reset.
    > Jan 9 03:13:38 hostname kernel: hde: drive not ready for command
    > Jan 9 03:14:07 hostname kernel: ide2: reset: success
    > Jan 9 03:14:27 hostname kernel: hde: dma_timer_expiry: dma status ==
    > 0x21
    > Jan 9 03:14:37 hostname kernel: hde: DMA timeout error
    > Jan 9 03:14:37 hostname kernel: hde: dma timeout error: status=0xd0
    > { Busy }
    > Jan 9 03:14:37 hostname kernel: ide: failed opcode was: unknown
    > Jan 9 03:14:37 hostname kernel: hde: DMA disabled
    > Jan 9 03:14:37 hostname kernel: PDC202XX: Primary channel reset.
    > Jan 9 03:14:52 hostname kernel: ide2: reset: success
    > ------------------------------------------------------------------------
    >
    > I'm trying to determine weather its the drive, or a problem with the
    > IDE controller or motherboard.
    >
    > Anyone have any ideas?


    Hi Jack,

    I've got the same problem. My first try was to disable DMA, then
    everything works fine. After enabling DMA they appear again, I solved
    this problem with new cables, the old looks fine, but they wasn't.

    Greetings
    Daniel


    --
    To UNSUBSCRIBE, email to debian-isp-REQUEST@lists.debian.org
    with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org

  7. Re: ide problems?

    Oh it does, it'll catch a failing drive about 20-30% of the time (from
    someone who works in a returns department)

    Michael Loftis wrote:

    >
    >
    > --On January 9, 2006 10:20:07 PM -0500 Dan MacNeil
    > wrote:
    >
    >> I had similar errors, ignored them for 24 hours and then wowed our users
    >> with 24 hours of downtime restoring from backup. It was the drive.
    >>
    >> apt-get install smartmontools
    >>
    >> ...may allow the hard drive to tell you that is failing.

    >
    >
    > Never have I seen S.M.A.R.T. predict nor provide any hint of an
    > impending failure. Usually the first signs were the DMA failures, all
    > the while SMART reporting everything A/OK, and no errors logged until
    > just before they finally died, then it'd usually log some.
    >
    >



    --
    To UNSUBSCRIBE, email to debian-isp-REQUEST@lists.debian.org
    with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org

  8. Re: ide problems?

    Dave Watkins wrote:
    > Oh it does, it'll catch a failing drive about 20-30% of the time (from
    > someone who works in a returns department)
    >
    > Michael Loftis wrote:
    >
    >
    >>
    >>--On January 9, 2006 10:20:07 PM -0500 Dan MacNeil
    >>wrote:
    >>
    >>
    >>>I had similar errors, ignored them for 24 hours and then wowed our users
    >>>with 24 hours of downtime restoring from backup. It was the drive.
    >>>
    >>>apt-get install smartmontools
    >>>
    >>>...may allow the hard drive to tell you that is failing.

    >>
    >>
    >>Never have I seen S.M.A.R.T. predict nor provide any hint of an
    >>impending failure. Usually the first signs were the DMA failures, all
    >>the while SMART reporting everything A/OK, and no errors logged until
    >>just before they finally died, then it'd usually log some.
    >>
    >>

    >
    >
    >


    Thanks for all your suggestions!


    --
    To UNSUBSCRIBE, email to debian-isp-REQUEST@lists.debian.org
    with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org

  9. Re: ide problems?

    On 2006-01-10, Dan MacNeil wrote:
    >
    > apt-get install smartmontools
    >
    > ...may allow the hard drive to tell you that is failing.
    >

    Or, on a desktop system, install smart-notifier and keep it running all
    the time.

    AFAIK it uses D-BUS to display warnings from within X11.


    --
    To UNSUBSCRIBE, email to debian-isp-REQUEST@lists.debian.org
    with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org

  10. Re: ide problems?

    On Friday 23 June 2006 07:35, Jason Spiro wrote:
    > On 2006-01-10, Dan MacNeil wrote:
    > > apt-get install smartmontools
    > >
    > > ...may allow the hard drive to tell you that is failing.

    >
    > Or, on a desktop system, install smart-notifier and keep it running
    > all the time.


    Is this an apt-gettable package?

    -A


    --
    To UNSUBSCRIBE, email to debian-isp-REQUEST@lists.debian.org
    with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org

  11. Re: ide problems?

    Alexandros Papadopoulos wrote:
    > On Friday 23 June 2006 07:35, Jason Spiro wrote:
    >
    >>On 2006-01-10, Dan MacNeil wrote:
    >>
    >>>apt-get install smartmontools
    >>>
    >>>...may allow the hard drive to tell you that is failing.

    >>
    >>Or, on a desktop system, install smart-notifier and keep it running
    >>all the time.

    >
    >
    > Is this an apt-gettable package?
    >
    > -A
    >
    >



    Hi,

    This is the result of a simple search in http://packages.debian.org:


    SMART-NOTIFIER:

    Testing:
    http://packages.debian.org/testing/utils/smart-notifier

    Unstable:
    http://packages.debian.org/unstable/...smart-notifier


    SMARTMONTOOLS:

    Stable:
    http://packages.debian.org/stable/utils/smartmontools

    Testing:
    http://packages.debian.org/testing/utils/smartmontools

    Unstable:
    http://packages.debian.org/unstable/utils/smartmontools



    Greetings,

    --
    José Marcos Chalmés Garc*a
    mailto:chalmes_jos@gva.es
    Conselleria de Sanitat
    UIT - Sistemes
    Av. Campanar, 21. 46009 - Valencia


    --
    To UNSUBSCRIBE, email to debian-isp-REQUEST@lists.debian.org
    with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org

+ Reply to Thread