且构网

分享程序员开发的那些事...
且构网 - 分享程序员编程开发的那些事

Oracle Database RAC 11.2.0.3 for AIX6.1TL7安装记录(3)

更新时间:2022-08-17 17:13:09


        下面是再一次安装Oracle 11.2.0.3 RAC Database for  AIX6.1 TL7遇到问题的记录,之前还有两篇记录文章:

《Oracle Database RAC 11.2.0.3 for AIX6.1TL7安装记录(1)》:
http://space.itpub.net/23135684/viewspace-733990

《Oracle Database RAC 11.2.0.3 for AIX6.1TL7安装记录(2)》:
http://space.itpub.net/23135684/viewspace-742017

《在AIX上运行RAC时网络方面的一些***经验》:
https://blogs.oracle.com/Database4CN/entry/%E5%9C%A8aix%E4%B8%8A%E8%BF%90%E8%A1%8Crac%E6%97%B6%E7%BD%91%E7%BB%9C%E6%96%B9%E9%9D%A2%E7%9A%84%E4%B8%80%E4%BA%9B%E6%9C%80%E4%BD%B3%E7%BB%8F%E9%AA%8C


问题一:
        报错截图如下:

Oracle Database RAC 11.2.0.3 for AIX6.1TL7安装记录(3)
 
        在安装Grid Infrastructure的时候,图形界面会调用CVU验证安装环境,其中在验证网络参数的时候会出现验证失败的情况,但在操作系统层面通过no -a | grep ipqmaxlen命令验证相应参数已是正确配置。问题的解决办法如下:
PRVE-0273 : The value of network parameter "udp_sendspace" for interface "en0" is not configured to the expected value on node "racnode1" [ID 1373242.1]
修改时间:2012-3-7Oracle Database RAC 11.2.0.3 for AIX6.1TL7安装记录(3)类型:REFERENCEOracle Database RAC 11.2.0.3 for AIX6.1TL7安装记录(3)状态:MODERATEDOracle Database RAC 11.2.0.3 for AIX6.1TL7安装记录(3)优先级:3

In this Document
  Purpose
  PRVE-0273 : The value of network parameter "udp_sendspace" for interface "en0" is not configured to the expected value on node "racnode1"
      
     bug 13077654 - AIX specific
     bug 13531373 - AIX specific 




This document is being delivered to you via Oracle Support's Rapid Visibility (RaV) process and therefore has not been subject to an independent technical review.



Applies to:

Oracle Server - Enterprise Edition - Version: 11.2.0.1 and later   [Release: 11.2 and later ]
Information in this document applies to any platform.

Purpose

This note lists typical causes and solutions for the following cluvfy error:

PRVE-0273 : The value of network parameter "rfc1323" for interface "en2" is not configured to the expected value on node "racnode1"

PRVE-0273 : The value of network parameter "udp_sendspace" for interface "en0" is not configured to the expected value on node "racnode1"

bug 13077654 - AIX specific

On AIX, runInstaller complains network parameter setting: ipqmaxlen, rfc1323, sb_max, tcp_sendspace, udp_sendspace, udp_recvspace

INFO: *********************************************
INFO: Network parameter - rfc1323: Checks if the network parameter is set correctly on the system
INFO: Severity:IGNORABLE
INFO: OverallStatus:VERIFICATION_FAILED
INFO: -----------------------------------------------
INFO: Verification Result for Node:racnode1
INFO: Expected Value:1
INFO: Actual Value:en2=0
INFO: Error Message:PRVE-0273 : The value of network parameter "rfc1323" for interface "en2" is not configured to the expected value on node "racnode1".[Expected="1"; Found="en2=0"]

Manually verified with "ifconfig" and "/usr/sbin/no", the setting is as expected

This bug is fixed in 12.1 and onward

The workaround is to create a symbolic as root:

# ln -s /usr/sbin/no /etc/no


bug 13531373 - AIX specific

On AIX, runInstaller complains network parameter setting even when they are bigger than required: 

INFO: *********************************************
INFO: Network parameter - tcp_sendspace: Checks if the network parameter is set correctly on the system
INFO: Severity:IGNORABLE
INFO: OverallStatus:VERIFICATION_FAILED
INFO: -----------------------------------------------
INFO: Verification Result for Node:racnode1
INFO: Expected Value:1
INFO: Actual Value:en2=0
INFO: Error Message:PRVE-0273 : The value of network parameter "tcp_sendspace" for interface "en10" is not configured to the expected value on node "racnode1".[Expected="65536";Found="en10=262144"]

As you can see, the expected value is 65536, and the current value is 262144 is satisfies the requirement.

The fix is included in 11.2.0.3 GI PSU2, 11.2.0.4 and above, the error can be ignored.


问题二:
        报错截图如下:

Oracle Database RAC 11.2.0.3 for AIX6.1TL7安装记录(3)

        在安装Grid Infrastrcture的时候,根据提示在第一个节点执行root.sh脚本出现如上的错误信息(Failed to write the checkpoint:'' with status:FAIL.Error code is 256),问题的解决办法如下:
AIX: 11gR2 Grid Infrastructure Installation, root.sh Error: Failed to write the checkpoint:'' with status:FAIL.Error code is 256 [ID 1382505.1]
修改时间:2011-12-5Oracle Database RAC 11.2.0.3 for AIX6.1TL7安装记录(3)类型:PROBLEMOracle Database RAC 11.2.0.3 for AIX6.1TL7安装记录(3)状态:PUBLISHEDOracle Database RAC 11.2.0.3 for AIX6.1TL7安装记录(3)优先级:3

In this Document
  Symptoms
  Cause
  Solution




Applies to:

Oracle Server - Enterprise Edition - Version: 11.2.0.1 and later   [Release: 11.2 and later ]
IBM AIX on POWER Systems (64-bit)

Symptoms

Trying to install Grid Infrastructure 11.2.0.3, root.sh fails with:

# root.sh
..
User ignored Prerequisites during installation
Failed to write the checkpoint:'' with status:FAIL.Error code is 256
Undefined subroutine &crsconfig_lib::dieformat called at /oracle/app/11.2.0.3/grid/crs/install/crsconfig_lib.pm line 6135.



rootcrs_<node1>.log shows:


2011-11-23 03:43:20: User ignored Prerequisites during installation
2011-11-23 03:43:24: ###### Begin DIE Stack Trace ######
2011-11-23 03:43:24: Package File Line Calling
2011-11-23 03:43:24: --------------- -------------------- ---- ----------
2011-11-23 03:43:24: 1: main rootcrs.pl 375 crsconfig_lib::dietrap
2011-11-23 03:43:24: 2: crsconfig_lib crsconfig_lib.pm 6135 main::__ANON__
2011-11-23 03:43:24: 3: crsconfig_lib crsconfig_lib.pm 6640 crsconfig_lib::set_file_perms
2011-11-23 03:43:24: 4: main rootcrs.pl 457 crsconfig_lib::run_env_setup_modules
2011-11-23 03:43:24: ####### End DIE Stack Trace #######
..
2011-11-23 03:43:24: Failed to write the checkpoint:'' with status:FAIL.Error code is 256

Cause

The problem is caused by clusterware library pointing to non-exist Vendor clusterware library, eg:

$ ls -l /oracle/app/11.2.0.3/grid/lib/libskgxn*
lrwxrwxrwx 1 grid oinstall 33 Nov 23 03:08 /oracle/app/11.2.0.3/grid/lib/libskgxn2.so -> /opt/ORCLcluster/lib/libskgxn2.so
-rwxr-xr-x 1 grid oinstall 159806 Oct 20 23:55 /oracle/app/11.2.0.3/grid/lib/libskgxnr.a
lrwxrwxrwx 1 grid oinstall 33 Nov 23 09:38 /oracle/app/11.2.0.3/grid/lib/libskgxnr.so -> /opt/ORCLcluster/lib/libskgxnr.so

$ ls -l /opt/ORCLcluster
ls: 0653-341 The file /opt/ORCLcluster does not exist.

This is caused by HACMP executable is not removed cleanly when HACMP is deinstalled. When HACMP is installed, it installs the directory /usr/sbin/cluster/utilities along with others. Oracle OUI depends on /usr/sbin/cluster/utilities/cldomain to determine if vendor clusterware exists. If yes, then a symlink of $GRID_HOME/lib/libskgxn2.so will be created pointing to /opt/ORCLcluster/lib/libskgxn2.so (so does libskgxnr.so). /opt/ORCLcluster directory is setup during rootpre.sh if vendor cluster is presented.

In this case, HACMP was first installed, then Veritas software was installed, it caused the /usr/sbin/cluster/utilities/cldomain became a symlink pointing to Veritas clusterware:

$ ls -l /usr/sbin/cluster/utilities/cldomain
lrwxrwxrwx    1 root     system           29 Sep 21 13:54 /usr/sbin/cluster/utilities/cldomain -> /opt/VRTSvcs/rac/bin/cldomain

When HACMP was deinstalled later, it removed all other files but left this symlink cldomain, causing Oracle considered vendor clusterware exists and created the symlink of libskgxn2.so and libskgxnr.so during link libraries phase in OUI installation. Further leads to root.sh failure.

Solution

1. When deinstalling vendor clusterware, make sure all associated files are removed. In this case, remove the symlink /usr/sbin/cluster/utilities/cldomain

2. Clean up the failed GI installation via $GRID_HOME/deinstall/deinstall command or clean up manually follow DOCUMENT 1364419.1

3. Reinstall Grid Infrastructure 


--end--

Oracle Database RAC 11.2.0.3 for AIX6.1TL7安装记录(3)1.jpg

Oracle Database RAC 11.2.0.3 for AIX6.1TL7安装记录(3)2.jpg