Friday, January 10, 2014

Diagnosing Mellanox Fabrics and RDMA Aware Networks Programming User Manual

If you wish to diagnose the Mellanox/Voltaire Fabric, do the following

1. Reset all the fabric PM counters
# ibdiagnet -pc

2. If any of the provided PM is greater then its provided value than print it
# ibdiagnet -P all=1

You will find logs of the findings from ibdiagnet such as
# ls -l /var/tmp/ibdiagnet2/
ibdiagnet2.aguid
ibdiagnet2.db_csv
ibdiagnet2.debug
ibdiagnet2.log
ibdiagnet2.lst
ibdiagnet2.nodes_info
ibdiagnet2.pkey
ibdiagnet2.pm
ibdiagnet2.sm

3. To check out the errors in your Fabric Environment, check the generated ibdiagnet2.log file for more information.

For references on
  1. RDMA Aware Networks Programming User Manual (pdf)
  2. Mellanox OFED for Linux User Manual (pdf)

No comments: