NETGEAR is aware of a growing number of phone and online scams. To learn how to stay safe click here.
Forum Discussion
OperatorITSP
Apr 13, 2026Guide
M4300 24x24F Stack member failure
Hello
We are having situation where 02 member of two member stack is crashing for unknown reason.
We had this crash multiple times on various firmware.
We do not run multicast and traffic is really low.
Topology is star with redundant links to access layer switches(aruba), we use SPF+ 10G
12.0.17.16
Symptoms, high CPU reported, no obvious reason for it(eg extra traffic). When logged into GUI, CLI to investigate. CPU would go back to normal and we were unable to find out what is causing it. If stack/02 member was not rebooted at this time it would eventually crash in a way where it kept interfaces up, it would eventually fully reboot and join stack.
12.0.19.12
Same as 12.0.17.16
12.0.19.19
No spike in CPU usage.
We did get some logs that look useful(attached). Crash happened out of hours with nobody in the office so traffic/usage was real low. Both members of stack were rebooted, but this did not help so right now we run on 01 only. I'll go to remote office to plug console into 02 to see if I can get anything useful.
Any idea what could be causing this?
2 Replies
<13> Jan 2 23:45:26 CORE01-2 TRAPMGR[dot1s_task]: traputil.c(795) 962 %% Spanning Tree Topology Change Received: MSTID: 0 lag 5 """
<13> Jan 2 23:45:25 CORE01-2 TRAPMGR[SNMPCfgTask]: traputil.c(795) 961 %% Cold Start: Unit: 0"""
<12> Jan 2 23:45:11 CORE01-2 DRIVER[bcmDISC]: broad_stack_mgr.c(2394) 945 %% There has been consecutive 50 mis-matches between driver stack db and app db. Discovery will not be kicked-off."
<13> Jan 2 23:44:43 CORE01-2 DOT3AD[dot3ad_core_lac]: dot3ad_lac.c(290) 918 %% POESW07 is up."""
<14> Jan 2 23:44:43 CORE01-2 DOT3AD[dot3ad_core_lac]: dot3ad_db.c(951) 917 %% Interface 2/0/9 attached to POESW07."""
<13> Jan 2 23:44:43 CORE01-2 DOT3AD[dot3ad_core_lac]: dot3ad_lac.c(290) 916 %% POESW05 is up."""
<14> Jan 2 23:44:43 CORE01-2 DOT3AD[dot3ad_core_lac]: dot3ad_db.c(951) 915 %% Interface 2/0/7 attached to POESW05."""
<13> Jan 2 23:44:42 CORE01-2 TRAPMGR[trapTask]: traputil.c(753) 914 %% Link Up: 2/0/9"""
<14> Jan 2 23:44:42 CORE01-2 SSLT[ssltTask]: sslt_util.c(796) 912 %% SSLT: Successfully loaded all required SSL PEM files"
<13> Jan 2 23:44:42 CORE01-2 BSP[intDisTask]: cpu_boxs.c(2208) 907 %% SFP Interrupt received on the unit 2"""
<13> Jan 2 23:44:42 CORE01-2 TRAPMGR[trapTask]: traputil.c(753) 909 %% Link Up: 2/0/7"""
<13> Jan 2 23:44:42 CORE01-2 TRAPMGR[dot1s_task]: traputil.c(795) 910 %% Spanning Tree Topology Change Received: MSTID: 0 lag 4 """
<13> Jan 2 23:44:40 CORE01-2 TRAPMGR[dot1s_task]: traputil.c(795) 904 %% Spanning Tree Topology Change Received: MSTID: 0 lag 4 """
<13> Jan 2 23:44:40 CORE01-2 TRAPMGR[dot1s_task]: traputil.c(795) 905 %% Spanning Tree Topology Change: 0, Unit: 1"""
<13> Jan 2 23:44:42 CORE01-2 BSP[intDisTask]: cpu_boxs.c(2208) 906 %% SFP Interrupt received on the unit 2"""
<14> Jan 2 23:44:40 CORE01-2 BONJOUR[bonjourTask]: bonjour_control.c(400) 903 %% Bonjour Responder admin mode enable is already set to the same as requetsed"""
<14> Jan 2 23:44:39 CORE01-2 CLI_WEB[tRpcsrv.01000]: config_script_api.c(1040) 901 %% Configuration script TempTxtConfigScript.scr of length 146 is compressed to 162 of size "
<13> Jan 2 23:44:39 CORE01-2 TRAPMGR[trapTask]: traputil.c(753) 899 %% SFP+ inserted in 2/0/12"""
<14> Jan 2 23:44:39 CORE01-2 TRAPMGR[trapTask]: traputil.c(969) 898 %% bad rc on Send Trap call to registrar_ID 33"
<13> Jan 2 23:44:39 CORE01-2 TRAPMGR[trapTask]: traputil.c(753) 895 %% SFP inserted in 2/0/7"""
<14> Jan 2 23:44:39 CORE01-2 TRAPMGR[trapTask]: traputil.c(969) 894 %% bad rc on Send Trap call to registrar_ID 33"
<13> Jan 2 23:44:39 CORE01-2 TRAPMGR[trapTask]: traputil.c(753) 891 %% SFP+ inserted in 2/0/5"""
<14> Jan 2 23:44:39 CORE01-2 TRAPMGR[trapTask]: traputil.c(969) 892 %% bad rc on Send Trap call to registrar_ID 33"
<14> Jan 2 23:44:39 CORE01-2 TRAPMGR[trapTask]: traputil.c(969) 888 %% bad rc on Send Trap call to registrar_ID 33"
<13> Jan 2 23:44:39 CORE01-2 TRAPMGR[trapTask]: traputil.c(753) 885 %% SFP+ inserted in 2/0/2"""
<14> Jan 2 23:44:39 CORE01-2 TRAPMGR[trapTask]: traputil.c(969) 886 %% bad rc on Send Trap call to registrar_ID 33"
<14> Jan 2 23:44:39 CORE01-2 TRAPMGR[trapTask]: traputil.c(969) 880 %% bad rc on Send Trap call to registrar_ID 33"
<13> Jan 2 23:44:36 CORE01-2 TRAPMGR[trapTask]: traputil.c(753) 876 %% Warm Auto-Restart has completed on unit 2."""
<14> Jan 2 23:44:36 CORE01-2 UNITMGR[unitMgrTask]: unitmgr.c(2701) 875 %% Warm Auto-Restart complete on unit 2"""
<13> Jan 2 23:44:36 CORE01-2 TRAPMGR[dot1s_task]: traputil.c(795) 874 %% Spanning Tree Topology Change Initiated: 0, Interface: lag 3"""
<13> Jan 2 23:44:36 CORE01-2 TRAPMGR[dot1s_task]: traputil.c(795) 873 %% Spanning Tree Topology Change: 0, Unit: 1"""
<13> Jan 2 23:44:36 CORE01-2 TRAPMGR[trapTask]: traputil.c(753) 872 %% Entity Database: Configuration Changed"""
<13> Jan 2 23:44:33 CORE01-2 TRAPMGR[boxs Req]: traputil.c(795) 871 %% Temperature state change alarm: Unit Number: 2 Current: Normal, Previous: None"""
<14> Jan 2 23:44:32 CORE01-2 DOT1Q[dot1qTask]: dot1q_outcalls.c(317) 870 %% Bad rc 1 in vlanNotifyRegisteredUsers for registrar 48, DOT1S"
<14> Jan 2 23:44:32 CORE01-2 DOT1Q[dot1qTask]: dot1q_outcalls.c(317) 868 %% Bad rc 1 in vlanNotifyRegisteredUsers for registrar 48, DOT1S"
<14> Jan 2 23:44:32 CORE01-2 DOT1Q[dot1qTask]: dot1q_outcalls.c(317) 865 %% Bad rc 1 in vlanNotifyRegisteredUsers for registrar 48, DOT1S"
<14> Jan 2 23:44:32 CORE01-2 DOT3AD[dot3ad_core_lac]: dot3ad_db.c(1014) 864 %% Interface 2/0/9 detached from POESW07."""
<13> Jan 2 23:44:32 CORE01-2 TRAPMGR[trapTask]: traputil.c(753) 862 %% Link Down: 2/0/9"""
<13> Jan 2 23:44:32 CORE01-2 TRAPMGR[trapTask]: traputil.c(753) 861 %% Link Down: 2/0/7"""
<12> Jan 2 23:44:32 CORE01-2 PTP_TC[ptpTc]: cnfgr_hw_tally.c(356) 858 %% PTP_TC reported unexpectedly for L2 hardware reconciliation"""
<14> Jan 2 23:44:32 CORE01-2 General[procLOG]: procmgr.c(879) 847 %% Application Started (appmgr, ID = 15, PID = 1893"""
<13> Jan 2 23:44:32 CORE01-2 General[procLOG]: procmgr.c(2515) 800 %% Administrative Command:app-start appmgr """
<14> Jan 2 23:44:31 CORE01-2 General[procLOG]: procmgr.c(3756) 798 %% Application Terminated (netsnmp, ID = 3, PID = 1887"""
<14> Jan 2 23:44:31 CORE01-2 General[procLOG]: procmgr.c(879) 799 %% Application Started (netsnmp, ID = 3, PID = 1889"""
<13> Jan 2 23:44:31 CORE01-2 IP[tRpcsrv.02000]: ip_api.c(8998) 797 %% OSPF instance completed NSF routes update."""
<13> Jan 2 23:44:31 CORE01-2 IP[ipMapProcessing]: vrf_util.c(1563) 796 %% Registered IPMAP-0 as a best route callback with RTO"""
<13> Jan 2 23:44:31 CORE01-2 General[procLOG]: procmgr.c(2538) 787 %% Administrative Command:app-restart netsnmp """
<14> Jan 2 23:44:30 CORE01-2 DOT1Q[dot1qTask]: dot1q_outcalls.c(317) 772 %% Bad rc 1 in vlanNotifyRegisteredUsers for registrar 48, DOT1S"
<12> Jan 2 23:44:31 CORE01-2 PVT_GROUP[nim_t]: cnfgr_hw_tally.c(356) 778 %% PRIVATE_GROUP_VLAN reported unexpectedly for L2 hardware reconciliation"""
<13> Jan 2 23:44:30 CORE01-2 DOT3AD[dot3ad_core_lac]: dot3ad_lac.c(290) 767 %% POESW10 is up."""
<14> Jan 2 23:44:30 CORE01-2 DOT1Q[dot1qTask]: dot1q_outcalls.c(317) 771 %% Bad rc 1 in vlanNotifyRegisteredUsers for registrar 48, DOT1S"
Logs below
<13> Jan 2 23:45:26 CORE01-2 TRAPMGR[dot1s_task]: traputil.c(795) 962 %% Spanning Tree Topology Change Received: MSTID: 0 lag 5 """
<13> Jan 2 23:45:25 CORE01-2 TRAPMGR[SNMPCfgTask]: traputil.c(795) 961 %% Cold Start: Unit: 0"""
<12> Jan 2 23:45:11 CORE01-2 DRIVER[bcmDISC]: broad_stack_mgr.c(2394) 945 %% There has been consecutive 50 mis-matches between driver stack db and app db. Discovery will not be kicked-off."
<13> Jan 2 23:44:43 CORE01-2 DOT3AD[dot3ad_core_lac]: dot3ad_lac.c(290) 918 %% POESW07 is up."""
<14> Jan 2 23:44:43 CORE01-2 DOT3AD[dot3ad_core_lac]: dot3ad_db.c(951) 917 %% Interface 2/0/9 attached to POESW07."""
<13> Jan 2 23:44:43 CORE01-2 DOT3AD[dot3ad_core_lac]: dot3ad_lac.c(290) 916 %% POESW05 is up."""
<14> Jan 2 23:44:43 CORE01-2 DOT3AD[dot3ad_core_lac]: dot3ad_db.c(951) 915 %% Interface 2/0/7 attached to POESW05."""
<13> Jan 2 23:44:42 CORE01-2 TRAPMGR[trapTask]: traputil.c(753) 914 %% Link Up: 2/0/9"""
<14> Jan 2 23:44:42 CORE01-2 SSLT[ssltTask]: sslt_util.c(796) 912 %% SSLT: Successfully loaded all required SSL PEM files"
<13> Jan 2 23:44:42 CORE01-2 BSP[intDisTask]: cpu_boxs.c(2208) 907 %% SFP Interrupt received on the unit 2"""
<13> Jan 2 23:44:42 CORE01-2 TRAPMGR[trapTask]: traputil.c(753) 909 %% Link Up: 2/0/7"""
<13> Jan 2 23:44:40 CORE01-2 TRAPMGR[dot1s_task]: traputil.c(795) 904 %% Spanning Tree Topology Change Received: MSTID: 0 lag 4 """
<13> Jan 2 23:44:40 CORE01-2 TRAPMGR[dot1s_task]: traputil.c(795) 905 %% Spanning Tree Topology Change: 0, Unit: 1"""
<13> Jan 2 23:44:42 CORE01-2 BSP[intDisTask]: cpu_boxs.c(2208) 906 %% SFP Interrupt received on the unit 2"""
<14> Jan 2 23:44:40 CORE01-2 BONJOUR[bonjourTask]: bonjour_control.c(400) 903 %% Bonjour Responder admin mode enable is already set to the same as requetsed"""
<14> Jan 2 23:44:39 CORE01-2 CLI_WEB[tRpcsrv.01000]: config_script_api.c(1040) 901 %% Configuration script TempTxtConfigScript.scr of length 146 is compressed to 162 of size "
<13> Jan 2 23:44:39 CORE01-2 TRAPMGR[trapTask]: traputil.c(753) 899 %% SFP+ inserted in 2/0/12"""
<14> Jan 2 23:44:39 CORE01-2 TRAPMGR[trapTask]: traputil.c(969) 898 %% bad rc on Send Trap call to registrar_ID 33"
<13> Jan 2 23:44:39 CORE01-2 TRAPMGR[trapTask]: traputil.c(753) 895 %% SFP inserted in 2/0/7"""
<14> Jan 2 23:44:39 CORE01-2 TRAPMGR[trapTask]: traputil.c(969) 894 %% bad rc on Send Trap call to registrar_ID 33"
<13> Jan 2 23:44:39 CORE01-2 TRAPMGR[trapTask]: traputil.c(753) 891 %% SFP+ inserted in 2/0/5"""
<14> Jan 2 23:44:39 CORE01-2 TRAPMGR[trapTask]: traputil.c(969) 888 %% bad rc on Send Trap call to registrar_ID 33"
<13> Jan 2 23:44:39 CORE01-2 TRAPMGR[trapTask]: traputil.c(753) 885 %% SFP+ inserted in 2/0/2"""
<14> Jan 2 23:44:39 CORE01-2 TRAPMGR[trapTask]: traputil.c(969) 880 %% bad rc on Send Trap call to registrar_ID 33"
<13> Jan 2 23:44:36 CORE01-2 TRAPMGR[trapTask]: traputil.c(753) 876 %% Warm Auto-Restart has completed on unit 2."""
<14> Jan 2 23:44:36 CORE01-2 UNITMGR[unitMgrTask]: unitmgr.c(2701) 875 %% Warm Auto-Restart complete on unit 2"""
<13> Jan 2 23:44:36 CORE01-2 TRAPMGR[dot1s_task]: traputil.c(795) 874 %% Spanning Tree Topology Change Initiated: 0, Interface: lag 3"""
<13> Jan 2 23:44:36 CORE01-2 TRAPMGR[dot1s_task]: traputil.c(795) 873 %% Spanning Tree Topology Change: 0, Unit: 1"""
<13> Jan 2 23:44:36 CORE01-2 TRAPMGR[trapTask]: traputil.c(753) 872 %% Entity Database: Configuration Changed"""
<13> Jan 2 23:44:33 CORE01-2 TRAPMGR[boxs Req]: traputil.c(795) 871 %% Temperature state change alarm: Unit Number: 2 Current: Normal, Previous: None"""
<14> Jan 2 23:44:32 CORE01-2 DOT1Q[dot1qTask]: dot1q_outcalls.c(317) 865 %% Bad rc 1 in vlanNotifyRegisteredUsers for registrar 48, DOT1S"
<14> Jan 2 23:44:32 CORE01-2 DOT3AD[dot3ad_core_lac]: dot3ad_db.c(1014) 864 %% Interface 2/0/9 detached from POESW07."""
<13> Jan 2 23:44:32 CORE01-2 TRAPMGR[trapTask]: traputil.c(753) 862 %% Link Down: 2/0/9"""
<13> Jan 2 23:44:32 CORE01-2 TRAPMGR[trapTask]: traputil.c(753) 861 %% Link Down: 2/0/7"""
<12> Jan 2 23:44:32 CORE01-2 PTP_TC[ptpTc]: cnfgr_hw_tally.c(356) 858 %% PTP_TC reported unexpectedly for L2 hardware reconciliation"""
<14> Jan 2 23:44:32 CORE01-2 General[procLOG]: procmgr.c(879) 847 %% Application Started (appmgr, ID = 15, PID = 1893"""
<13> Jan 2 23:44:32 CORE01-2 General[procLOG]: procmgr.c(2515) 800 %% Administrative Command:app-start appmgr """
<14> Jan 2 23:44:31 CORE01-2 General[procLOG]: procmgr.c(3756) 798 %% Application Terminated (netsnmp, ID = 3, PID = 1887"""
<14> Jan 2 23:44:31 CORE01-2 General[procLOG]: procmgr.c(879) 799 %% Application Started (netsnmp, ID = 3, PID = 1889"""
<13> Jan 2 23:44:31 CORE01-2 IP[tRpcsrv.02000]: ip_api.c(8998) 797 %% OSPF instance completed NSF routes update."""
<13> Jan 2 23:44:31 CORE01-2 IP[ipMapProcessing]: vrf_util.c(1563) 796 %% Registered IPMAP-0 as a best route callback with RTO"""
<13> Jan 2 23:44:31 CORE01-2 General[procLOG]: procmgr.c(2538) 787 %% Administrative Command:app-restart netsnmp """
<14> Jan 2 23:44:30 CORE01-2 DOT1Q[dot1qTask]: dot1q_outcalls.c(317) 772 %% Bad rc 1 in vlanNotifyRegisteredUsers for registrar 48, DOT1S"
<12> Jan 2 23:44:31 CORE01-2 PVT_GROUP[nim_t]: cnfgr_hw_tally.c(356) 778 %% PRIVATE_GROUP_VLAN reported unexpectedly for L2 hardware reconciliation"""
<13> Jan 2 23:44:30 CORE01-2 DOT3AD[dot3ad_core_lac]: dot3ad_lac.c(290) 767 %% POESW10 is up."""
<14> Jan 2 23:44:30 CORE01-2 DOT1Q[dot1qTask]: dot1q_outcalls.c(317) 771 %% Bad rc 1 in vlanNotifyRegisteredUsers for registrar 48, DOT1S"
Related Content
NETGEAR Academy
Boost your skills with the Netgear Academy - Get trained, certified and stay ahead with the latest Netgear technology!
Join Us!