OEM Agent has stopped monitoring:Disk Full
OEM is the short form for Oracle Enterprise Manager.
Agent has stopped monitoring. The following errors are reported : Disk Full. In this article we can see how to use dcli command in the Exadata server.
Error received
OEM Agent has stopped monitoring.Disk Full
Message from OEM alert: EM Event: Critical:OPRD_SRVR:3872 – Agent has stopped monitoring. The following errors are reported : Disk Full
Below is the example of dcli command usage in Exadata server.
Overview of the environment
This error received on compute node3 in the Exadata environment.
Hence used dcli command to verify the status of the all other agents on each compute node.
Action Taken
Purged logfiles .Now some space is there in the disk where agent has been installed.
After purging some logfiles and verifying the status ,The Agent is up and running .
Note : After starting the agent ,In the below output of dcli command for some databases
it is showing for few databases “running for 129 seconds”.
It can be ignoredand it will take some time to refresh and upload the data from agent that is running on node3 to OMS .
dcli command and sample output
[oracle@OPRD_SRVR01 ~]$ dcli -l oracle -g dbs_group “/u01/app/oracle/product/EMbase/agent12c/core/12.1.0.5.0/bin/emctl status agent | grep -i Running” |
OPRD_SRVR1: Agent is Running and Ready
OPRD_SRVR2: Agent is Running and Ready
OPRD_SRVR3: rac_database.OPRD – RETRY_TARGET running for 129 seconds
OPRD_SRVR3: rac_database.OPRDW – RETRY_TARGET running for 129 seconds
OPRD_SRVR3: Dynamic property executor tasks running
OPRD_SRVR3: Agent is Running and Ready
OPRD_SRVR4: Agent is Running and Ready
OPRD_SRVR5: Agent is Running and Ready
OPRD_SRVR6: Agent is Running and Ready
OPRD_SRVR7: Agent is Running and Ready
OPRD_SRVR8: Agent is Running and Ready
[oracle@OPRD_SRVR1 ~]$
Complete Original Message from OEM alert looks like below.
Host=OPRD_SRVR
Target type=Agent
Target name=OPRD_SRVR:3872
Categories=Availability
Message=Agent has stopped monitoring. The following errors are reported : Disk Full.
Severity=Critical
Operating System=Linux
Platform=x86_64
Associated Incident Id=164208
Associated Incident Status=New
Associated Incident Owner=
Associated Incident Acknowledged By Owner=No
Associated Incident Priority=None
Associated Incident Escalation Level=0
Event Type=Target Availability
Event name=Status
Availability status=Agent Unreachable (Disk Full)
Rule Name=Incident management rule set for all targets,Incident creation rule for down and agent unreachable availability status for agents and hosts
Rule Owner=System Generated
Update Details:
Agent has stopped monitoring. The following errors are reported : Disk Full.