Investigating unbalanced MCIP ISLs (network utilization on one switch pair's ISL is much higher than the other). A tip to go on:
Keep a close look at the network port utilization of the back-end (MCIP Cisco in this case) switches and the QoS statistics latency show for the cluster domain. Perhaps the unbalance is down to the iSCSI paths to the remote disks. Maybe we can tweak those paths.
In the documentation we see:
Considerations for ISLs (netapp.com)
The maximum theoretical throughput of shared ISLs (for example, 240 Gbps with six 40 Gbps ISLs) is a best-case scenario. When using multiple ISLs, statistical load balancing can impact the maximum throughput. Uneven balancing can occur and reduce throughput to that of a single ISL.
So uneven balancing is kind-of to be expected (the statement above is more for when a pair of switches has multiple ISLs, not one per switch pair, but I think it is still relatable.)
statistics start -object ?
There are so many objects we can get metrics from in ONTAP but I don't see any that will help with our unbalanced ISLs.
statistics catalog object show
When I ran this on ONTAP 9.11.1 there were 692 objects. There was nothing specific to ISL (searched and found nothing.) There are various MCC objects:
- mcc_conifg
- mcc_drc
- mcc_hm_storage_bridge_fc_port
- mcc_hm_storage_bridge_sas_port
- mcc_hm_storage_switch
- mcc_perf_cluster
- mcc_perf_node
- mcc_perf_vserver
- mcc_storage
- mcc_subsystem
- mcc_vserver
- mcculp
The one that looked interesting was mcculp:
- mcculp : These counters track IO latency statistics pertaining to MCC interconnect DR node collected at MCC ULP layer.
But nothing useful in there pertaining to the ISLs.
QoS Satistics Latency Show
A useful check but again, nothing pertaining to the ISLs.
ActiveIQ
On the AFF A800, we have e0b and e1b as the HA and MetroCluster interfaces (HA traffic and disk traffic travels through these ports).
- e0b goes to MCIP switch 1
- e1b goes to MCIP switch 2
- Cluster 1 Node 1 e0b/e1b = 10.1.1.1 / 10.1.2.1
- Cluster 1 Node 2 e0b/e1b = 10.1.1.2 / 10.1.2.2
- Cluster 2 Node 1 e0b/e1b = 10.1.1.3 / 10.1.2.3
- Cluster 2 Node 2 e0b/e1b = 10.1.1.4 / 10.1.2.4
Comments
Post a Comment