Package: xen-linux-system-2.6.26-1-xen-686
Version: 2.6.26-9
Severity: normal

Hi,

Our Heartbeat cluster consists of two Xen hosts in an active-backup
setup, monitoring each other by an Ethernet and a serial link (ttyS0).
Since Oct 23, we experienced several reboots, usually at 1:0X am, when
the debsums checks are scheduled (on the dom0 as well as on some domUs).
For the latest, serial console (hvc0 routed to ttyS1) output is available,
although garbled, as the hypervisor-emulated serial ports seemingly can't
do flow control (or there is some more serious problem):

[312448.032681] ttyS0: 1 input overrun(s)
[317045.074847] ttyS0: 1 input overrun(s)
[350105.941354] ttyS0: 1 input overrun(s)
[353002.736250] iret exception: 0000 [#1] SMP
[353002.736250] Modules linked in: video output ac battery bridge 8021q bonding ip6t_REJECT ip6t_LOG nf_conntrack_ipv6 ip6table_filter ip6_tables ipv6 ipt_recent ipt_REJECT nf_conntrack_ipv4 xt_state nf_conntrack xt_tcpudp ipt_LOG xt_limit xt_multiport iptable_filter ip_tables x_tables ext2 mbcache dm_round_robin dm_emc dm_multipath loop evdev psmouse serio_raw shpchp pci_hotplug pcspkr i2c_piix4 i2c_core button sworks_agp agpgart dcdbas xfs dm_mirror dm_log dm_snapshot dm_mod raid1 md_mod sg ide_cd_mod cdrom sd_mod ata_generic libata dock aic7xxx scsi_transport_spi tg3 floppy qla2xxx firmware_class scsi_transport_fc scsi_tgt scsi_mod ohci_hcd usbcore serverworks ide_pci_generic ide_core thermal processor fan thermal_sys
[353002.738157]
[353002.738157] Pid: 0, comm: swapper Not tainted (2.6.26-1-xen-686 #1)
[353002.738157] EIP: 0061:[] EFLAGS: 00000002 CPU: 0
[353002.738157] EIP is at serial_in+0x5f/0x65
[353002.738157] EAX: c0414264 EBX: 000003f8 ECX: 00000000 EDX: 000003f8
[353002.738157] ESI: c0414320 EDI: cfe8dc00 EBP: 00000069 ESP: c0387ec8
[353002.738157] DS: 007b ES: 007b FS: 00d8 GS: 0000 SS: 0069
[353002.738157] Process swapper (pid: 0, ti=c0386000 task=c03591a0 task.ti=c0386000)
[353002.738157] Stack: cdcbe401 c0414320 c0235766 c0387f08 cfe8dc00 000000ff 00000001 69006e6e
[353002.738157] c0414320 c0413340 c04143dc 00000000 c0236418 00000004 00000000 00000000
[353002.738157] 00000001 ce30e1e0 00000000 00000000 00000004 c0149885 c03773a0 00000004
[353002.738157] Call Trace:
[353002.738157] [] receive_chars+0x28/0x228
[353002.738157] [] serial8250_interrupt+0x66/0x120
[353002.738157] [] handle_IRQ_event+0x36/0x6e
[353002.738157] [] handle_level_irq+0x90/0xed
[353002.738157] [] do_IRQ+0x4d/0x65
[353002.738157] [] evtchn_do_upcall+0xfa0x191
[353002.8157] [] hypervisor_clback+0x46/0x4e[353002.738157][] xesafe_halt+0x9f/b3
[353002.7387] [xen_idle+0x1b/06
[353002.7381] []pu_idle+0xa8/0x
[353002.73815 =====================
[3530.738157] Code: 03 eb 27 03 5e0 8b 03 eb 23 83b 02 8b 46 08 70c 8b 00 c1 e8 25 ff 00 00 00b 0f 01 d8 8a 0eb 06 03 5e 04 da ec <0f> b6 5b 5e c3 57 569 ce 53 0f b6 419 89 c3 89 d0 e0
[353002.7157] EIP: [] serial_inx5f/0x65 SS:ESP069:c0387ec8
[3002.738157] Keel panic - not ncing: Fatal exption in interrt
(XEN) Domain crashed: reboong machine in 5seconds.

The kernel came from Sid, but these are otherwise up-to-date Lenny
systems. I'm about to set up netconsole which will hopefully provide
better output on the next crash. But maybe the above also rings
some bells.

Thanks,
Feri.

-- System Information:
Debian Release: 4.0
APT prefers stable
APT policy: (500, 'stable')
Architecture: i386 (i686)
Shell: /bin/sh linked to /bin/bash
Kernel: Linux 2.6.26-1-686
Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8)



--
To UNSUBSCRIBE, email to debian-bugs-dist-REQUEST@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org