Infiniband SRP boot
|
2012-12-06, 01:36
Post: #4
|
|||
|
|||
RE: Infiniband SRP boot
I am sorry you are right, it was mt25218 (my typo)
Intially when getting original message in first post SM log is pretty sparse cat /var/log/opensm.0x0002c90200210dad.log | grep "Dec 04 07:" 0x01 -> osm_prtn_make_partitions: Partition configuration /etc/opensm/partitions.conf is not accessible (No such file or directory) 0x02 -> SUBNET UP 0x02 -> log_notice: Reporting Generic Notice type:3 num:67 (Mcast group deleted) from LID:1 GID:ff12:1405:ffff::3333:1:2 0x02 -> log_notice: Reporting Generic Notice type:3 num:67 (Mcast group deleted) from LID:1 GID:ff12:401b:ffff::fc0:988f 0x02 -> log_notice: Reporting Generic Notice type:3 num:67 (Mcast group deleted) from LID:1 GID:ff12:401b:ffff::fb 0x01 -> osm_prtn_make_partitions: Partition configuration /etc/opensm/partitions.conf is not accessible (No such file or directory) 0x02 -> SUBNET UP 0x01 -> osm_prtn_make_partitions: Partition configuration /etc/opensm/partitions.conf is not accessible (No such file or directory) 0x02 -> SUBNET UP 0x01 -> osm_prtn_make_partitions: Partition configuration /etc/opensm/partitions.conf is not accessible (No such file or directory) I am running the subnet manager without a config just specifying the port that has a cable. I was fed up with all the partitions.conf missing spam every 10secs, so read man page and created the following as /etc/opensm/partitions.conf Default=0x7fff,ipoib:ALL=full rebuilt ipxe.kpxe with debug=infiniband & many, many messages resulted in them scrolling off console to fast to read. How should I capture them ? Will remote syslog be ok or will I need to source a serial cable? cat /var/log/opensm.0x0002c90200210dad.log | grep "Dec 05 23:" 0x80 -> SM port is up 0x01 -> log_send_error: ERR 5411: DR SMP Send completed with error (IB_TIMEOUT) -- dropping 0x01 -> Received SMP on a 1 hop path: Initial path = 0,1, Return path = 0,0Dec 05 23:00:02 953051 [53662700] 0x01 -> sm_mad_ctrl_send_err_cb: ERR 3113: MAD completed in error (IB_TIMEOUT): SubnGet(NodeInfo), attr_mod 0x0, TID 0x2f9e7 0x01 -> sm_mad_ctrl_send_err_cb: ERR 3120 Timeout while getting attribute 0x11 (NodeInfo); Possible mis-set mkey? 0x80 -> Entering MASTER state 0x80 -> SUBNET UP <Repeats lots of times> I will try again tonight with DEBUG=arbel and remove /etc/opensm/partitions.conf to see if that is a red herring |
|||
« Next Oldest | Next Newest »
|
Messages In This Thread |
Infiniband SRP boot - johnp12345 - 2012-12-05, 01:23
RE: Infiniband SRP boot - robinsmidsrod - 2012-12-05, 09:50
RE: Infiniband SRP boot - thomil - 2012-12-05, 12:12
RE: Infiniband SRP boot - johnp12345 - 2012-12-06 01:36
RE: Infiniband SRP boot - thomil - 2012-12-06, 13:06
RE: Infiniband SRP boot - johnp12345 - 2012-12-07, 03:09
RE: Infiniband SRP boot - johnp12345 - 2012-12-07, 04:22
RE: Infiniband SRP boot - mcb30 - 2013-07-05, 14:22
RE: Infiniband SRP boot - CGen - 2013-07-03, 10:26
RE: Infiniband SRP boot - CGen - 2013-07-03, 19:09
RE: Infiniband SRP boot - mcb30 - 2013-07-05, 14:24
|
User(s) browsing this thread: 1 Guest(s)