Problem with dump stalling - FreeBSD

This is a discussion on Problem with dump stalling - FreeBSD ; Hi If I have the wrong list please feel free to redirect me. I'm running 7.0-RELEASE-p4 and trying to backup to an external USB drive. I'm using the following command dump -a0Lf /backup/diskimages/root /dev/mfid0s1a Where df: Filesystem 1K-blocks Used Avail ...

+ Reply to Thread
Results 1 to 7 of 7

Thread: Problem with dump stalling

  1. Problem with dump stalling

    Hi



    If I have the wrong list please feel free to redirect me.



    I'm running 7.0-RELEASE-p4 and trying to backup to an external USB
    drive.



    I'm using the following command

    dump -a0Lf /backup/diskimages/root /dev/mfid0s1a



    Where df:

    Filesystem 1K-blocks Used Avail Capacity Mounted on

    /dev/mfid0s1a 507630 208436 258584 45% /

    /dev/da1s1d 709513458 9853800 642898582 2% /backup



    The following processes start

    75399 p0 I+ 0:00.04 dump -a0Lf /backup/diskimages/root
    /dev/mfid0s1a (dump)

    75402 p0 S+ 0:00.71 dump: /dev/mfid0s1a: pass 4: 77.05% done,
    finished in 0:00 at Wed Oct 8 08:25:06 2008 (dump)

    75403 p0 S+ 0:00.85 dump -a0Lf /backup/diskimages/root
    /dev/mfid0s1a (dump)

    75404 p0 S+ 0:00.96 dump -a0Lf /backup/diskimages/root
    /dev/mfid0s1a (dump)

    75405 p0 S+ 0:00.86 dump -a0Lf /backup/diskimages/root
    /dev/mfid0s1a (dump)



    But it just stops at a random percentage, the system continues to run
    and the processes are killable?



    Perhaps I'm using dump incorrectly if anyone could help would be greatly
    appreciated.



    Regards



    --

    David Peall :: IT Manager

    e-Schools' Network :: http://www.esn.org.za/

    Phone +27 (021) 674-9140

    _______________________________________________
    freebsd-stable@freebsd.org mailing list
    http://lists.freebsd.org/mailman/lis...freebsd-stable
    To unsubscribe, send any mail to "freebsd-stable-unsubscribe@freebsd.org"


  2. Re: Problem with dump stalling

    On Wed, Oct 08, 2008 at 10:29:00AM +0200, David Peall wrote:
    > Hi
    >
    > If I have the wrong list please feel free to redirect me.
    >
    > I'm running 7.0-RELEASE-p4 and trying to backup to an external USB
    > drive.
    >
    > I'm using the following command
    >
    > dump -a0Lf /backup/diskimages/root /dev/mfid0s1a
    >
    > Where df:
    >
    > Filesystem 1K-blocks Used Avail Capacity Mounted on
    > /dev/mfid0s1a 507630 208436 258584 45% /
    > /dev/da1s1d 709513458 9853800 642898582 2% /backup
    >
    > The following processes start
    >
    > 75399 p0 I+ 0:00.04 dump -a0Lf /backup/diskimages/root /dev/mfid0s1a (dump)
    > 75402 p0 S+ 0:00.71 dump: /dev/mfid0s1a: pass 4: 77.05% done, finished in 0:00 at Wed Oct 8 08:25:06 2008 (dump)
    > 75403 p0 S+ 0:00.85 dump -a0Lf /backup/diskimages/root /dev/mfid0s1a (dump)
    > 75404 p0 S+ 0:00.96 dump -a0Lf /backup/diskimages/root /dev/mfid0s1a (dump)
    > 75405 p0 S+ 0:00.86 dump -a0Lf /backup/diskimages/root /dev/mfid0s1a (dump)
    >
    > But it just stops at a random percentage, the system continues to run
    > and the processes are killable?
    >
    > Perhaps I'm using dump incorrectly if anyone could help would be greatly
    > appreciated.


    This is a known problem with dump on UFS2 filesystems. See
    "dump/restore" here:

    http://wiki.freebsd.org/JeremyChadwi...eported_issues

    There is no fix, AFAIK.

    My recommendation is to use something else. I'm particularly fond of
    rsnapshot, but be aware that rsync will cause file atimes to be lost
    (on the source) when copying; this can impact classic UNIX mail
    spools (mbox), where people use clients like mutt/pine which utilise
    atimes to determine if there's new mail or not.

    --
    | Jeremy Chadwick jdc at parodius.com |
    | Parodius Networking http://www.parodius.com/ |
    | UNIX Systems Administrator Mountain View, CA, USA |
    | Making life hard for others since 1977. PGP: 4BD6C0CB |

    _______________________________________________
    freebsd-stable@freebsd.org mailing list
    http://lists.freebsd.org/mailman/lis...freebsd-stable
    To unsubscribe, send any mail to "freebsd-stable-unsubscribe@freebsd.org"


  3. Re: Problem with dump stalling

    On Thu, Oct 09, 2008 at 12:08:17PM +0200, Christoph Schug wrote:
    > On Wed, Oct 08, 2008, David Peall wrote:
    >
    > > I'm running 7.0-RELEASE-p4 and trying to backup to an external USB
    > > drive.
    > >
    > > I'm using the following command
    > >
    > > dump -a0Lf /backup/diskimages/root /dev/mfid0s1a

    > [...]
    > > But it just stops at a random percentage, the system continues to run
    > > and the processes are killable?

    >
    > This is a know bug which mainly affects multi-core CPU systems. It is
    > fixed in RELENG_7 see also PR bin/121684 [1] where the relevant patch
    > is referenced.
    >
    > [1] http://www.freebsd.org/cgi/query-pr.cgi?pr=bin/121684


    This was fixed on HEAD in revision 1.48 (March 13th), with the comment
    "MFC: 1 week".

    The commit to RELENG_7 did not happen until April 19th, see revision
    1.39.2.3:

    http://www.freebsd.org/cgi/cvsweb.cg...r_sleepqueue.c

    The PR referenced in the CVS commit is PR 117603.

    David, can you verify you're using a version of
    src/sys/kern/subr_sleepqueue.c that is 1.39.2.3 or newer?

    If so, the problem David is experiencing is different. If not, David
    will need to csup and then rebuild world and kernel (do NOT just do
    one; do both) to pick up the changes.

    --
    | Jeremy Chadwick jdc at parodius.com |
    | Parodius Networking http://www.parodius.com/ |
    | UNIX Systems Administrator Mountain View, CA, USA |
    | Making life hard for others since 1977. PGP: 4BD6C0CB |

    _______________________________________________
    freebsd-stable@freebsd.org mailing list
    http://lists.freebsd.org/mailman/lis...freebsd-stable
    To unsubscribe, send any mail to "freebsd-stable-unsubscribe@freebsd.org"


  4. RE: Problem with dump stalling

    > -----Original Message-----
    > From: owner-freebsd-stable@freebsd.org [mailtowner-freebsd-
    > stable@freebsd.org] On Behalf Of Jeremy Chadwick
    > Sent: 09 October 2008 12:45 PM
    > To: Christoph Schug
    > Cc: freebsd-stable@freebsd.org; David Peall
    > Subject: Re: Problem with dump stalling
    >
    >
    > This was fixed on HEAD in revision 1.48 (March 13th), with the comment
    > "MFC: 1 week".
    >
    > The commit to RELENG_7 did not happen until April 19th, see revision
    > 1.39.2.3:
    >
    > http://www.freebsd.org/cgi/cvsweb.cg...r_sleepqueue.c
    >
    > The PR referenced in the CVS commit is PR 117603.
    >
    > David, can you verify you're using a version of
    > src/sys/kern/subr_sleepqueue.c that is 1.39.2.3 or newer?
    >
    > If so, the problem David is experiencing is different. If not, David
    > will need to csup and then rebuild world and kernel (do NOT just do
    > one; do both) to pick up the changes.


    I have two identical machines the second one is displaying the same
    problems. I'm going to do a cvsup update and build work and kernel, the
    only partition on this server that is UFS is the /boot/ partition as its
    running ZFS on the others yet is still fails the dump that.

    If there is anyone willing to dedicate some time to this problem I will
    assist where I can?

    Regards
    --
    David Peall :: IT Manager
    e-Schools' Network :: http://www.esn.org.za/
    Phone +27 (021) 674-9140
    _______________________________________________
    freebsd-stable@freebsd.org mailing list
    http://lists.freebsd.org/mailman/lis...freebsd-stable
    To unsubscribe, send any mail to "freebsd-stable-unsubscribe@freebsd.org"


  5. Re: Problem with dump stalling

    On Thu, Oct 09, 2008 at 01:58:39PM +0200, David Peall wrote:
    > > -----Original Message-----
    > > From: owner-freebsd-stable@freebsd.org [mailtowner-freebsd-
    > > stable@freebsd.org] On Behalf Of Jeremy Chadwick
    > > Sent: 09 October 2008 12:45 PM
    > > To: Christoph Schug
    > > Cc: freebsd-stable@freebsd.org; David Peall
    > > Subject: Re: Problem with dump stalling
    > >
    > > This was fixed on HEAD in revision 1.48 (March 13th), with the comment
    > > "MFC: 1 week".
    > >
    > > The commit to RELENG_7 did not happen until April 19th, see revision
    > > 1.39.2.3:
    > >
    > > http://www.freebsd.org/cgi/cvsweb.cg...r_sleepqueue.c
    > >
    > > The PR referenced in the CVS commit is PR 117603.
    > >
    > > David, can you verify you're using a version of
    > > src/sys/kern/subr_sleepqueue.c that is 1.39.2.3 or newer?
    > >
    > > If so, the problem David is experiencing is different. If not, David
    > > will need to csup and then rebuild world and kernel (do NOT just do
    > > one; do both) to pick up the changes.
    > >

    >
    > I'm using stable:
    > FreeBSD ochre.school.za 7.0-RELEASE-p4 FreeBSD 7.0-RELEASE-p4 #0: Tue
    > Sep 2 18:48:24 UTC 2008
    > root@amd64-builder.daemonology.net:/usr/obj/usr/src/sys/GENERIC amd64
    >
    > I don't update via source only via binaries, how would I tell?


    I'm not sure. I don't use freebsd-update, so I don't do binary-only
    upgrades. Someone else will have to chime in here for assistance. But
    I can tell you that freebsd-update does not get all the updates to
    certain things.

    --
    | Jeremy Chadwick jdc at parodius.com |
    | Parodius Networking http://www.parodius.com/ |
    | UNIX Systems Administrator Mountain View, CA, USA |
    | Making life hard for others since 1977. PGP: 4BD6C0CB |

    _______________________________________________
    freebsd-stable@freebsd.org mailing list
    http://lists.freebsd.org/mailman/lis...freebsd-stable
    To unsubscribe, send any mail to "freebsd-stable-unsubscribe@freebsd.org"


  6. RE: Problem with dump stalling

    > -----Original Message-----
    > From: owner-freebsd-stable@freebsd.org [mailtowner-freebsd-
    > stable@freebsd.org] On Behalf Of Gary Palmer
    > Sent: 09 October 2008 05:36 PM
    > To: David Peall
    > Cc: freebsd-stable@freebsd.org; Jeremy Chadwick; Christoph Schug
    > Subject: Re: Problem with dump stalling
    >
    > > > -----Original Message-----
    > > > From: owner-freebsd-stable@freebsd.org [mailtowner-freebsd-
    > > > stable@freebsd.org] On Behalf Of Jeremy Chadwick
    > > > Sent: 09 October 2008 12:45 PM
    > > > To: Christoph Schug
    > > > Cc: freebsd-stable@freebsd.org; David Peall
    > > > Subject: Re: Problem with dump stalling
    > > >
    > > > David, can you verify you're using a version of
    > > > src/sys/kern/subr_sleepqueue.c that is 1.39.2.3 or newer?
    > > >
    > > > If so, the problem David is experiencing is different. If not,
    > > > David
    > > > will need to csup and then rebuild world and kernel (do NOT just

    do
    > > > one; do both) to pick up the changes.
    > > >

    >
    > Try running:
    >
    > ident /boot/kernel/kernel | grep subr_sleepqueue


    Thank you Gary:
    $FreeBSD: src/sys/kern/subr_sleepqueue.c,v 1.39.4.1 2008/01/29
    16:37:04 jhb Exp $

    Which is indeed newer that 1.39.2.3 and dump on UFS2 is still a problem.


    --
    David Peall :: IT Manager
    e-Schools' Network :: http://www.esn.org.za/
    Phone +27 (021) 674-9140
    _______________________________________________
    freebsd-stable@freebsd.org mailing list
    http://lists.freebsd.org/mailman/lis...freebsd-stable
    To unsubscribe, send any mail to "freebsd-stable-unsubscribe@freebsd.org"


  7. RE: Problem with dump stalling

    After some digging it would appear the patch has not been committed ?

    From
    /src/sys/kern/subr_sleepqueue.c
    /*
    * Lock sleepq chain before unlocking proc
    * without this, we could lose a race.
    */
    mtx_lock_spin(&sc->sc_lock);
    PROC_UNLOCK(p);
    thread_lock(td);
    if (ret == 0) {

    Should read
    /*
    + * Lock the per-process spinlock prior to dropping the PROC_LOCK
    + * to avoid a signal delivery race. PROC_LOCK, PROC_SLOCK, and
    + * thread_lock() are currently held in tdsignal().
    */
    + PROC_SLOCK(p);
    mtx_lock_spin(&sc->sc_lock);
    PROC_UNLOCK(p);
    thread_lock(td);
    + PROC_SUNLOCK(p);
    if (ret == 0) {

    Also :
    __FBSDID("$FreeBSD: src/sys/kern/subr_sleepqueue.c,v 1.39.4.1 2008/01/29
    16:37:04 jhb Exp $");

    Regards

    --
    David Peall :: IT Manager
    e-Schools' Network :: http://www.esn.org.za/
    Phone +27 (021) 674-9140

    _______________________________________________
    freebsd-stable@freebsd.org mailing list
    http://lists.freebsd.org/mailman/lis...freebsd-stable
    To unsubscribe, send any mail to "freebsd-stable-unsubscribe@freebsd.org"


+ Reply to Thread