Java Mailing List Archive

http://www.dba.5341.com/

Home » Home (12/2007) » suse oracle »

[suse-oracle] Problem with OCFS2 - Heartbeat write timeout to device

João Daniel

2006-01-17

Replies:

Hi,





Sometimes I have a problem related to OCFS2 on one of the nodes from my
Oracle RAC. The RAC has two nodes.

The server completely hangs after the error that I report below in the error
messages.

Can anyone help me with this problem?













ERROR MESSAGES:



-> o2hb_write_timeout:165 ERROR: Heartbeat write timeout to device dm-1
after 12000 milliseconds

-> o2hb_stop_all_regions:1674 ERROR: stopping heartbeat on all active
regions.

-> Kernel panic: ocfs2 is very sorry to be fencing this system by panicing





Device dm-1 is configured with OCFS2 and multipath to access my EMC storage.
I use it for Voting Disk file (CRS) and Cluster Registry (OCR). The storage
is working fine.













HARDWARE:

- DELL 2850

- 1x QLogic 2340 (BIOS 1.47) to acess my Storage (EMC Clariion CX300)





SOFTWARE:

- SLES9 (32 bit) + SP2 (with Multipath and OCFS2 configured)

- Oracle Clusterware Release 2 (10.2.0.1.0)

- Oracle Database 10g Release 2 (10.2.0.1.0)

- Qlogic driver: qlafc-linux-8.00.03b-1-install.tgz

- Naviagent CLI: naviagentcli-6.16.0.4.63-1.i386.rpm

- SANsurfer: emc_sansurfer2.0.30b32_linux_install.bin















João Daniel




Apoio Informático

Sistema de Informação do Instituto Politécnico de Setúbal

<mailto:jd-apinf@(protected)>

Instituto Politécnico de Setúbal
Esc. Sup. de Tec. de Setúbal

Campus do IPS, Estefanilha
2910-761 Setúbal





tel:  


+351 265 790 133
+351 265 790 119





©2008 dba.5341.com - Jax Systems, LLC, U.S.A.