[GTER] Junos snmp health-monitor, como determinar a causa raiz?

Gabriel Farias gabrielfarias03 at gmail.com
Thu Jun 23 13:05:00 -03 2016


Junos snmp health-monitor, como determinar a causa raiz?

Olá membros,

Há muitas opções para monitorar a saúde dos equipamentos Juniper com Junos,
o registro de eventos geralmente são via SNMP, as opções utilizando "snmp
health-monitor" satisfazer as minhas necessidades atuais.

A minha pergunta é consigo determinar qual processo causou esses eventos,
princialmente de utilização de CPU?, vocês podem me ajudar?, podemos
determinar o processo? Veja o seguinte exemplo:

[edit]
root at RT0001AMZ> show configuration snmp health-monitor
Jun 16 15:57:23
[edit snmp]
health-monitor {
interval 5;
rising-threshold 40;
falling-threshold 30;
}

root at RT0001AMZ> show log messages | match SNMPD_HEALTH_MON_THRESH_CROSS
Jun 16 12:47:59 RT0001AMZ snmpd[923]: SNMPD_HEALTH_MON_THRESH_CROSS: Health
Monitor: root file system utilization crossed rising threshold 40 (value:
79), (variable: jnxHrStoragePercentUsed.1)
Jun 16 12:48:07 RT0001AMZ snmpd[923]: SNMPD_HEALTH_MON_THRESH_CROSS: Health
Monitor: RE 0 memory utilization crossed rising threshold 40 (value: 80),
(variable: jnxOperatingBuffer.9.1.0.0)
Jun 16 12:49:14 RT0001AMZ snmpd[923]: SNMPD_HEALTH_MON_THRESH_CROSS: Health
Monitor: RE 0 CPU utilization crossed falling threshold 30 (value: 23),
(variable: jnxOperatingCPU.9.1.0.0)
Jun 16 13:20:14 RT0001AMZ snmpd[923]: SNMPD_HEALTH_MON_THRESH_CROSS: Health
Monitor: RE 0 CPU utilization crossed rising threshold 40 (value: 44),
(variable: jnxOperatingCPU.9.1.0.0)

Para os equipamentos Cisco podemos determinar a causa raiz (processo), por
exemplo, de High CPU conforme o exemplo á seguir

BTA#sh run | inc cpu
process cpu threshold type total rising 70 interval 5 falling 60 interval 5
snmp-server enable traps cpu threshold

BTA#sh log | i CPU
May 18 18:31:40.139: %SYS-1-CPURISINGTHRESHOLD: Threshold: Total CPU
Utilization(Total/Intr): 88%/47%, Top 3 processes(Pid/Util): 181/38%, 6/1%,
91/0%
May 18 18:32:05.140: %SYS-1-CPUFALLINGTHRESHOLD: Threshold: Total CPU
Utilization(Total/Intr) 54%/29%.

BTA#sh processes cpu | i 181
PID Runtime(ms) Invoked uSecs 5Sec 1Min 5Min TTY Process
181 66001892 805865684 81 1.91% 3.61% 4.09% 0 IP Input        >>>>> Process

Obrigado,
Gabriel Farias



More information about the gter mailing list