InfiniBand Port Counters

Version 4

    The following are definitions for the various port counters reported by the InfiniBand diagnostic tools.

     

    SymbolErrors: The total number of minor link errors detected on one or more physical lanes. This includes 8B/10B coding violations and is typically an indication of a bit error on the line.

     

    LinkRecovers: The total number of times the Port Training state machine has successfully completed the link error recovery process.

     

    LinkDowned: The total number of times the Port Training state machine has failed the link error recovery process and downed the link.

     

    RcvErrors: The total number of packets containing an error that were received on the port.

      These errors include:

    • Local physical errors (ICRC, VCRC, FCCRC, and all physical errors that cause entry into the BAD PACKET or BAD PACKET DISCARD states of the packet receiver state machine)
    • Malformed data packet errors (LVer, length, VL)
    • Malformed link packet errors (operand, length, VL)
    • Packets discarded due to buffer overrun

     

    RcvRemotePhysErrors: The total number of packets marked with the EBP (End of Bad Packet) delimiter received on the port. This
    is typically due to a physical error that was detected and marked by an upstream port.

     

    RcvSwRelayErrors: The total number of packets received on the port that were discarded because they could not be forwarded by the switch relay.

     

    XmtDiscards: The total number of packets dropped because the port is down or congested.

     

     

    XmtConstraintErrors: The total number of packets not transmitted from the switch physical port for the following reasons:

     

    • FilterRawOutbound is true and packet is raw.
    • PartitionEnforcementOutbound is true and packet fails partition key check or IP version check.

     

    RcvConstraintErrors: The total number of packets received on the switch physical port that were discarded for the following reasons:

     

    • FilterRawInbound is true and packet is raw.
    • PartitionEnforcementInbound is true and packet fails partition key check or IP version check.

     

    LinkIntegrityErrors: The number of times that the count of
    local physical errors exceeded the threshold specified by LocalPhyErrors.

     

    ExcBufOverrunErrors: The number of times that OverrunErrors consecutive flow control update periods occurred, each having at least one overrun error.

     

    VL15Dropped: The number of incoming VL15 packets dropped due to resource limitations (for example, lack of buffers) in the port.

     

    XmtData: The total number of data octets, divided by 4, transmitted on all VLs from the port.

     

    RcvData: The total number of data octets, divided by 4, received on all VLs from the port.

     

    XmtPkts: The total number of packets transmitted on all VLs from the port.

     

    RcvPkts: The total number of packets received on all VLs from the port.