Java Mailing List Archive

http://www.dba.5341.com/

Home » Home (12/2007) » suse oracle »

[suse-oracle] ocfs2 network problem on SLES9, SP3, 32 bit

Marcin Przyczyna

2006-03-08

Replies:

Hi,

I did set up 2 nodes cluster on SLES9 on x86, SP3 and with lvm2 on top
of multipath, for testing purposes. I'm using qla2200 HBA.

I can mount my partition on the first node, but not on the second one.
That problem is symmetrical. No matter which node will be activated
first, the second one cannot mount the shared partition.
I can't join the cluster, if a partition is already mounted elsewhere.

Errors I get follow:

on the second node I get mount error:

lt2:/ # mount /ora01
mount.ocfs2: Transport endpoint is not connected while
mounting /dev/ocfs2/handel on /ora01

/var/log/messages :

Mar 4 13:00:23 lt2 kernel: OCFS2 1.1.7-SLES Tue Nov 1 14:45:27 PST
2005 (build sles)
Mar 4 13:00:23 lt2 kernel: (12867,0):ocfs2_initialize_super:1332
max_slots for this device: 4
Mar 4 13:00:23 lt2 kernel: (12867,0):ocfs2_fill_local_node_info:1011 I
am node 1
Mar 4 13:00:28 lt2 kernel: (12845,0):o2net_connect_expired:1442 ERROR:
no connection established with node 0 after 10 seconds, giving up and
returning errors.
Mar 4 13:00:28 lt2 kernel: (12867,0):dlm_request_join:751 ERROR: status
= -107
Mar 4 13:00:28 lt2 kernel: (12867,0):dlm_try_to_join_domain:899 ERROR:
status = -107
Mar 4 13:00:28 lt2 kernel: (12867,0):dlm_join_domain:1144 ERROR: status
= -107
Mar 4 13:00:28 lt2 kernel: (12867,0):dlm_register_domain:1334 ERROR:
status = -107
Mar 4 13:00:28 lt2 kernel: (12867,0):ocfs2_dlm_init:1706 ERROR: status
= -107
Mar 4 13:00:28 lt2 kernel: (12867,0):ocfs2_mount_volume:1043 ERROR:
status = -107
Mar 4 13:00:28 lt2 kernel: ocfs2: Unmounting device (253,13) on (node
1)

----------------------------

on the node with mounted partition I get the log entry
in /var/log/messages :

Mar 4 13:00:28 lt1 kernel: (13086,0):o2net_connect_expired:1442 ERROR:
no connection established with node 1 after 10 seconds, giving up and
returning errors.

---------------------------

"o2cb status" return "checking cluster ocfs2 online" on both nodes.
It seems like I do have a problem with network interfaces. I'm using
one NIC per server, an old 100 mbit/s copper card in switched LAN. My
netstat shows listening ocfs2 deamons on both nodes:

tcp 0 0 0.0.0.0:7777  0.0.0.0:*   LISTEN

When I attempt to mount the ocfs2 partition concurrently I get
the "transport endpoint is not connected" error on both nodes, then
mount fails.

Because many of you use ocfs2 for various purposes every day,
I assume I did make a simple configuration/mental mistake.
Well, I hope at least.

PS. Does anyone of you know approximately date of certification of
ocfs2 for RAC on linux ?

Help of any kind will be appreciated.

Regards,
mpr.

--
Marcin Przyczyna
it/org
http://www.citiworks.de/
+49 89 9925 75356
mpr@(protected)

--
To unsubscribe, email: suse-oracle-unsubscribe@(protected)
For additional commands, email: suse-oracle-help@(protected)
Please see http://www.suse.com/oracle/ before posting

©2008 dba.5341.com - Jax Systems, LLC, U.S.A.