NETGEAR is aware of a growing number of phone and online scams. To learn how to stay safe click here.

Forum Discussion

Greg_11's avatar
Greg_11
Aspirant
Mar 03, 2017

M4300 stacking problems

I'm trying to create an 8 switch spine and leaf stack with two XSM4324S (M4300-12X12F) and six GSM4352PB-100NES (M4300-52G-PoE+) and I'm having some major issues. The stack created fine with 7 of the switches, but when I added the 8th, it added to the stack, synchronized its firmware as expected, rebooted, displayed the appropriate stack unit number, and appeared as OK in the stack configuration page. However, aside from the stack ports, the rest of the ports on the switch were dead. In the device view, the stack ports were green, but the rest were greyed out. At first I thought it was a switch problem, so I swapped in a spare. Same thing. I tried new optics, same thing.

 

So far I've tried at least half a dozen SFP+ modules, and three M4300-52G-PoE+ switches. I've tried leaving the switch configured in the stack, and also deleting it. I've factory reset the switch (not the stack) from the boot menu (option 6) which works fine, but once added to the stack, same thing. If I reload the switch that's not working without the stack ports connected, it comes up fine and forwards packets, but now it's the master, along with the other master, and I get MAC/IP conflicts unless I factory reset it first.

 

At one point both spine switches got powered down, so every leaf became a stack master. When the spines came back, I now had six stack masters connected to two spine switches, each now with the same management IP, MAC, etc. The only way I could get it back to a working state was to physically power down all the switches, and bring up the spines first, and then the leaves. When it came back up, one of the leaves which was working just fine was now in the state the 8th member kept getting in with the stack ports up, but the rest down. I've tried all of the steps I tried with the 8th switch on this other previous stack member, and it still isn't working. I can get it added back to the stack just fine, but then it doesn't forward traffic.

 

The currently six switch stack has production traffic on it which I would prefer to not take down, but if I have to have yet another maintenance period, I will.

 

The buffered logs when switch 7 is part of the stack but isn't forwarding traffic (repeats):

 

<15> Mar 3 18:24:01 MD-SW-ACCESS-1 DRIVER[osapiTimer]: broad_hpc_stacking.c(608) 114525 %% Destination unit 7 is not ready <13> Mar 3 18:24:01 MD-SW-ACCESS-1 CARDMGR[cmgrInsertTask]: cardmgr_util.c(521) 114524 %% Card insert failed for unit/slot 7/0, wait for retry <15> Mar 3 18:24:01 MD-SW-ACCESS-1 DRIVER[cmgrInsertTask]: broad_init.c(2998) 114523 %% Driver: Card insertion for unit 7, slot 0 failed. Wait for retry.

<12> Mar 3 18:24:01 MD-SW-ACCESS-1 DRIVER[cmgrInsertTask]: l7_usl_sm.c(2094) 114522 %% USL: Cold start on unit 7 failed, error code -4 <11> Mar 3 18:24:01 MD-SW-ACCESS-7 DRIVER[unitMgrTask]: l7_usl_bcm.c(1828) 114521 %% Db TRUNK(8) sync failed, error code -4 elem Trunk-id 0 App-id 1 PSC=0, DLF Index=0, MC Index=0, IPMC Index=0, LocalPref=0 Member ports in the Trunk():

<15> Mar 3 18:24:00 MD-SW-ACCESS-1 DRIVER[osapiTimer]: broad_hpc_stacking.c(608) 114520 %% Destination unit 7 is not ready <15> Mar 3 18:23:59 MD-SW-ACCESS-1 DRIVER[osapiTimer]: broad_hpc_stacking.c(608) 114519 %% Destination unit 7 is not ready <15> Mar 3 18:23:58 MD-SW-ACCESS-1 DRIVER[osapiTimer]: broad_hpc_stacking.c(608) 114518 %% Destination unit 7 is not ready <15> Mar 3 18:23:57 MD-SW-ACCESS-1 DRIVER[osapiTimer]: broad_hpc_stacking.c(608) 114517 %% Destination unit 7 is not ready <15> Mar 3 18:23:56 MD-SW-ACCESS-1 DRIVER[osapiTimer]: broad_hpc_stacking.c(608) 114516 %% Destination unit 7 is not ready <15> Mar 3 18:23:55 MD-SW-ACCESS-1 DRIVER[osapiTimer]: broad_hpc_stacking.c(608) 114515 %% Destination unit 7 is not ready <15> Mar 3 18:23:54 MD-SW-ACCESS-1 DRIVER[osapiTimer]: broad_hpc_stacking.c(608) 114514 %% Destination unit 7 is not ready <15> Mar 3 18:23:53 MD-SW-ACCESS-1 DRIVER[osapiTimer]: broad_hpc_stacking.c(608) 114513 %% Destination unit 7 is not ready <15> Mar 3 18:23:52 MD-SW-ACCESS-1 DRIVER[osapiTimer]: broad_hpc_stacking.c(608) 114512 %% Destination unit 7 is not ready <15> Mar 3 18:23:51 MD-SW-ACCESS-1 DRIVER[osapiTimer]: broad_hpc_stacking.c(608) 114511 %% Destination unit 7 is not ready <15> Mar 3 18:23:50 MD-SW-ACCESS-1 DRIVER[osapiTimer]: broad_hpc_stacking.c(608) 114510 %% Destination unit 7 is not ready <15> Mar 3 18:23:49 MD-SW-ACCESS-1 DRIVER[osapiTimer]: broad_hpc_stacking.c(608) 114509 %% Destination unit 7 is not ready <15> Mar 3 18:23:48 MD-SW-ACCESS-1 DRIVER[osapiTimer]: broad_hpc_stacking.c(608) 114508 %% Destination unit 7 is not ready <15> Mar 3 18:23:47 MD-SW-ACCESS-1 DRIVER[osapiTimer]: broad_hpc_stacking.c(608) 114507 %% Destination unit 7 is not ready <15> Mar 3 18:23:46 MD-SW-ACCESS-1 DRIVER[osapiTimer]: broad_hpc_stacking.c(608) 114506 %% Destination unit 7 is not ready <13> Mar 3 18:23:46 MD-SW-ACCESS-1 CARDMGR[cmgrInsertTask]: cardmgr_util.c(521) 114505 %% Card insert failed for unit/slot 7/0, wait for retry <11> Mar 3 18:23:46 MD-SW-ACCESS-7 DRIVER[unitMgrTask]: l7_usl_bcm.c(1828) 114504 %% Db TRUNK(8) sync failed, error code -4 elem Trunk-id 0 App-id 1 PSC=0, DLF Index=0, MC Index=0, IPMC Index=0, LocalPref=0 Member ports in the Trunk():

<15> Mar 3 18:23:46 MD-SW-ACCESS-1 DRIVER[cmgrInsertTask]: broad_init.c(2998) 114503 %% Driver: Card insertion for unit 7, slot 0 failed. Wait for retry.

 

 Stack Port Diagnostics for the ports used to stack the switch in question:

1/0/3 RBYT:2720 RPKT:34 TBYT:2792 TPKT:35 RFCS:0 RFRG:0 RJBR:0 RUND:1 RUNT:1 TFCS:0 TERR:0

2/0/3 RBYT:2720 RPKT:34 TBYT:2792 TPKT:35 RFCS:0 RFRG:0 RJBR:0 RUND:1 RUNT:1 TFCS:0 TERR:0

7/0/51 RBYT:113018091 RPKT:232461 TBYT:90651015 TPKT:126899 RFCS:0 RFRG:0 RJBR:0 RUND:0 RUNT:0 TFCS:0 TERR:0

7/0/52 RBYT:82228630 RPKT:220577 TBYT:57840722 TPKT:197901 RFCS:0 RFRG:0 RJBR:0 RUND:0 RUNT:0 TFCS:0 TERR:0

 

Note: I am unable to add the either model number of these switches to the "Model" field of this post. Mods, please update your tables.

7 Replies

  • Hi Greg,

     

    Welcome to NETGEAR community!:smileyhappy:

     

     

    Could you share us the simply topology about the spine and leaf stack?(the stack port connected between spine and leaf)

    Which SFP+ module did you used for stack? (or DAC cable?)

    Did you set the high stack priority for both spine switch? (It will make two M4300-12X-12F as master and standby all the time)

     

    Please also send the diagnosis information to us as following link:

    http://kb.netgear.com/app/answers/detail/a_id/31439

     

     

    • Greg_11's avatar
      Greg_11
      Aspirant

      Hi again Dainel,

       

       The stack ports of the two 12x12f switches are set to Management and OprStandby with priority 15 and 14 respectively. All of the PoE switches have undefined priority and are Stack Members. The switches are connected with either DAC, CAT6 (to stack ports), SMF, or MMF. The DACs and all transcievers are NETGEAR compatible from FiberStore. Diagram below. The switches that aren't stacking properly are Access-7 and Access-8. I'll get the diagnostic files uploaded shortly. Access-8 is not currently connected to the stack as I need the devices connected to it uplinked to our network. Currently the access stack shown is uplinked to our existing core switch (not shown) during transition.

      Diagram.png

      • Greg_11's avatar
        Greg_11
        Aspirant

        I just noticed.. On the diagram, the uplinks from the CORE stack to the firewall are only 1G, not 10G even though the legend says it all is.

NETGEAR Academy

Boost your skills with the Netgear Academy - Get trained, certified and stay ahead with the latest Netgear technology! 

Join Us!

ProSupport for Business

Comprehensive support plans for maximum network uptime and business peace of mind.

 

Learn More