SYMPTOM
In 12.1.0.2 GI/RAC environment, there are a couple of processes consuming high CPU.
$ ps -ef|grep ifconfig root 18941 1 0 06:25 ? 00:00:00 sh -c /bin/su -l grid -c "/usr/bin/ssh -o StrictHostKeyChecking=no -o BatchMode=yes -o ConnectTimeout=5 RACTEST2 /sbin/ifconfig -a" 2>&1 root 18942 18941 99 06:25 ? 06:07:08 /bin/su -l grid -c /usr/bin/ssh -o StrictHostKeyChecking=no -o BatchMode=yes -o ConnectTimeout=5 RACTEST2 /sbin/ifconfig -a grid 26928 23166 0 12:32 pts/1 00:00:00 grep ifconfig root 62153 1 0 Jan23 ? 00:00:00 sh -c /bin/su -l grid -c "/usr/bin/ssh -o StrictHostKeyChecking=no -o BatchMode=yes -o ConnectTimeout=5 RACTEST2 /sbin/ifconfig -a" 2>&1 root 62154 62153 99 Jan23 ? 14:29:31 /bin/su -l grid -c /usr/bin/ssh -o StrictHostKeyChecking=no -o BatchMode=yes -o ConnectTimeout=5 RACTEST2 /sbin/ifconfig -a root 77170 1 0 10:30 ? 00:00:00 sh -c /bin/su -l grid -c "/usr/bin/ssh -o StrictHostKeyChecking=no -o BatchMode=yes -o ConnectTimeout=5 RACTEST2 /sbin/ifconfig -a" 2>&1 root 77171 77170 99 10:30 ? 02:02:37 /bin/su -l grid -c /usr/bin/ssh -o StrictHostKeyChecking=no -o BatchMode=yes -o ConnectTimeout=5 RACTEST2 /sbin/ifconfig -a $top .. . PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND 62154 root 25 0 98.8m 1392 1104 R 100.0 0.0 851:33.36 su 18942 root 25 0 98.8m 1400 1104 R 99.9 0.0 349:10.86 su 77171 root 25 0 98.8m 1404 1104 R 99.9 0.0 104:39.33 su .. .
CAUSES
As per Oracle ID 2340905.1, it is a Bug 24692439 : LNX64-12.2-DIAGSNAP: AUXILIARY CMDS GENERATED BY DIAGSNAP WOULD HOG CPU FOREVER.
It is fixed in 18.1.
WORKAROUND
1)as GI owner:
$ oclumon manage -disable diagsnap Diagsnap option is successfully Disabled on RACTEST1 Diagsnap option is successfully Disabled on RACTEST2 Successfully Disabled diagsnap
2) kill the existing “su” processes.
#kill -9 77170 ....