Post Reply 
 
Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
Infiniband SRP boot
2012-12-06, 01:36
Post: #4
RE: Infiniband SRP boot
I am sorry you are right, it was mt25218 (my typo)

Intially when getting original message in first post SM log is pretty sparse
cat /var/log/opensm.0x0002c90200210dad.log | grep "Dec 04 07:"
0x01 -> osm_prtn_make_partitions: Partition configuration /etc/opensm/partitions.conf is not accessible (No such file or directory)
0x02 -> SUBNET UP
0x02 -> log_notice: Reporting Generic Notice type:3 num:67 (Mcast group deleted) from LID:1 GID:ff12:1405:ffff::3333:1:2
0x02 -> log_notice: Reporting Generic Notice type:3 num:67 (Mcast group deleted) from LID:1 GID:ff12:401b:ffff::fc0:988f
0x02 -> log_notice: Reporting Generic Notice type:3 num:67 (Mcast group deleted) from LID:1 GID:ff12:401b:ffff::fb
0x01 -> osm_prtn_make_partitions: Partition configuration /etc/opensm/partitions.conf is not accessible (No such file or directory)
0x02 -> SUBNET UP
0x01 -> osm_prtn_make_partitions: Partition configuration /etc/opensm/partitions.conf is not accessible (No such file or directory)
0x02 -> SUBNET UP
0x01 -> osm_prtn_make_partitions: Partition configuration /etc/opensm/partitions.conf is not accessible (No such file or directory)

I am running the subnet manager without a config just specifying the port that has a cable.

I was fed up with all the partitions.conf missing spam every 10secs, so read man page and created the following as /etc/opensm/partitions.conf
Default=0x7fff,ipoib:ALL=full

rebuilt ipxe.kpxe with debug=infiniband & many, many messages resulted in them scrolling off console to fast to read.
How should I capture them ? Will remote syslog be ok or will I need to source a serial cable?

cat /var/log/opensm.0x0002c90200210dad.log | grep "Dec 05 23:"
0x80 -> SM port is up
0x01 -> log_send_error: ERR 5411: DR SMP Send completed with error (IB_TIMEOUT) -- dropping
0x01 -> Received SMP on a 1 hop path: Initial path = 0,1, Return path = 0,0Dec 05 23:00:02 953051 [53662700] 0x01 -> sm_mad_ctrl_send_err_cb: ERR 3113: MAD completed in error (IB_TIMEOUT): SubnGet(NodeInfo), attr_mod 0x0, TID 0x2f9e7
0x01 -> sm_mad_ctrl_send_err_cb: ERR 3120 Timeout while getting attribute 0x11 (NodeInfo); Possible mis-set mkey?
0x80 -> Entering MASTER state
0x80 -> SUBNET UP
<Repeats lots of times>

I will try again tonight with DEBUG=arbel and remove /etc/opensm/partitions.conf to see if that is a red herring
Find all posts by this user
Quote this message in a reply
Post Reply 


Messages In This Thread
Infiniband SRP boot - johnp12345 - 2012-12-05, 01:23
RE: Infiniband SRP boot - thomil - 2012-12-05, 12:12
RE: Infiniband SRP boot - johnp12345 - 2012-12-06 01:36
RE: Infiniband SRP boot - thomil - 2012-12-06, 13:06
RE: Infiniband SRP boot - johnp12345 - 2012-12-07, 03:09
RE: Infiniband SRP boot - johnp12345 - 2012-12-07, 04:22
RE: Infiniband SRP boot - mcb30 - 2013-07-05, 14:22
RE: Infiniband SRP boot - CGen - 2013-07-03, 10:26
RE: Infiniband SRP boot - CGen - 2013-07-03, 19:09
RE: Infiniband SRP boot - mcb30 - 2013-07-05, 14:24



User(s) browsing this thread: 1 Guest(s)