15:02:34 #startmeeting neutron_northbound 15:02:34 Meeting started Mon Jul 17 15:02:34 2017 UTC. The chair is yamahata. Information about MeetBot at http://ci.openstack.org/meetbot.html. 15:02:34 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 15:02:34 The meeting name has been set to 'neutron_northbound' 15:02:40 #chair mkolesni rajivk 15:02:40 Current chairs: mkolesni rajivk yamahata 15:02:48 #topic agenda bashing and roll cal 15:02:51 #info mkolesni 15:02:52 #info yamahata 15:02:59 #info rajivk 15:03:01 #info mbuil 15:03:16 #link https://wiki.opendaylight.org/view/NeutronNorthbound:Meetings 15:03:38 any topics to discuss in addition to patches/buts? 15:03:51 we can talk about the gate 15:04:00 espeically pike-3 is almost near, so we need to finally make decision on patches. 15:04:12 Yeah, gate is very unstable. 15:04:14 also yamahata when will you know if you make it to ptg? 15:04:28 Not yet, hopefully in this week or next week. 15:04:48 ok please let us know when you know concretely 15:04:52 sure. 15:05:01 cool 15:05:25 anything else? 15:06:07 ok, move on 15:06:08 #topic Announcements 15:06:20 pike-3 is next week. 15:06:39 openstack cfp was closed. 15:07:15 ODL nitrogen karaf distribution is still unstable. 15:07:36 also ODL neutron northbound build is also unstable due to integration-test failure. 15:07:43 anything else to announce? 15:09:09 seems nothing else. move on 15:09:15 #topic Pike/Nitrogen planning 15:09:21 For pike, pike-3 is feature freeze. 15:09:26 as pike-3 is next week id like to request that we progress with https://review.openstack.org/#/q/status:open+project:openstack/networking-odl+branch:master+topic:bp/dep-validations-on-create 15:09:29 We have three major issue. 15:09:58 Yeah, we're running out of time, we need to make decision. 15:10:14 For https://review.openstack.org/#/c/453581/ I don't see how we can come to consensus. 15:10:19 But we need to make progress. 15:10:28 as we discussed you said you will give it another look 15:10:53 we can progress and resolve bugs later 15:11:00 If we merge the patch, mkolesni at lest I'd like you to commit to stabilize CI, tempest with v2driver. 15:11:18 Is it okay? 15:11:19 im already working on stabilizing CI 15:11:37 Cool. then can you please rebase to resolve merge conflict? 15:11:40 im investigating it further to understand whats going on 15:11:46 then we can make progress. 15:11:46 yes ill rebase it 15:12:18 also im reviewing the patches by rajivk_ 15:12:20 Regarding to https://review.openstack.org/#/c/444648/ , have you looked at https://review.openstack.org/#/q/topic:bug/1683797 ? 15:12:33 mkolesni, if some help is needed, please let me know. 15:12:51 then we can make the number of threads configurable and single timer in main process. 15:12:56 Does that work for you? 15:13:11 The series includes your patch, 444648. 15:13:33 i didnt yet because i started investigating CI 15:13:41 Okay. 15:13:43 but as this is bug fix it is not constrained by FF 15:13:48 0so not critical 15:14:11 cool. 15:14:57 Do you mean https://review.openstack.org/#/c/474851/ ? 15:15:49 no this one has to get in by next week 15:16:39 I see. 15:16:57 another big patch is, https://review.openstack.org/#/c/465735/ dhcp port creation. 15:17:27 my understanding is, it can be merged (after several review if necessary.) 15:18:22 rajivk: has several bug fix patches, but they can be after pike-3. 15:18:34 any other patches for Pike-3? 15:19:43 seems nothing. 15:19:54 For Pike, another requirement is documentation migration. 15:20:13 https://review.openstack.org/#/c/483607/ is one of the patch 15:20:25 And I think it's good occasion to improve doc. 15:20:47 #link https://specs.openstack.org/openstack/docs-specs/specs/pike/os-manuals-migration.html 15:21:06 #topic patches/bugs 15:21:26 we can discuss CI stuff. 15:21:30 i rebased https://review.openstack.org/#/q/status:open+project:openstack/networking-odl+branch:master+topic:bp/dep-validations-on-create 15:21:35 please review it 15:21:44 oh, how fast it is. 15:21:57 #action yamahata review https://review.openstack.org/#/q/status:open+project:openstack/networking-odl+branch:master+topic:bp/dep-validations-on-create 15:21:59 ci stuff, so investigation revelased the problem with port status update which jhershbe fixed 15:22:10 revealed 15:22:27 mkolesni: please go ahead. 15:22:40 the problem is that ci jobs time out generally and then logs dont get collected 15:22:48 so its hard to know what happened 15:23:08 when timeout occurs, logs aren't collected unfortunately. 15:23:20 i have a "WIP" patch that reduces the timeouts in order to capture the logs but for some reason even that doesnt help sometimes 15:23:36 a bit before I thought it's collected. I suspect something changed with jenkins. 15:23:51 maybe its something we can check on project-infra 15:24:11 but im not that knowledgable about it so i dont know 15:24:43 what i noticed now in some jobs theres a problem with the dhcp but i didnt manage to pin point it 15:24:51 Another possibility is, the ssh failures occur with specific tests. we can disable half of them. 15:24:52 so im continuing investigation 15:25:11 i noticed one thing. 15:25:13 I noticed that port binding of dhcp port sometimes fails at early tempest tests. 15:25:23 i dont think its a good idea, the amount of tests failing is very high its not a few 15:25:33 yes i saw that as well 15:25:36 It's because the first hostconfig update isn't done when subnet is created. 15:26:01 i suspect its because of change of binding from topology to pseudo 15:26:20 network topology is also similar characteristic. 15:26:30 Anyway I'm thinking of BEFORE_CREATE/UPDATE of subnet. 15:26:47 when binding was topology i think we had much less sporadic failures 15:26:49 if hostconfig isn't there, we can try to update hostconfig. 15:27:32 what do u mean? 15:27:41 s/if hostconfig isn't there/if agentdb entry doesn't exist/ 15:27:55 but its part of provisioning 15:28:14 maybe it needs to be done at some earlier stage in the provisioning 15:28:52 networking-odl retrieaves it into agentdb. If it didn't try to retrieve it, enforce networking-odl to retrieve it when port binding. 15:29:30 thats a heavy solution 15:29:56 im not sure its a good idea to have port binding stuck retrieving it 15:30:07 when port binding occurs, we know hostname, we can retrieve only single hostconfig. 15:30:15 Not entire list of hostconfig. 15:30:29 i think in the set up of the ci the hostconfig needs to happen before the tempest run 15:30:50 And it's only before first full retrieval. 15:31:07 i think in a real env this problem wont exist 15:31:21 In tempest case, we can fix it. But we can't enforce cloud admin to do that. 15:31:47 if he uses pseudo agent he will run hostconfig before bringing up neutron 15:32:09 then whether neutron or odl comes up first it should sync up when both are up 15:32:23 not sure whats the current situation but thats the desired one IMHO 15:32:36 Hmm, are you saying we should introduce a utility to update agentdb based on hostconfig before running neutorn server? 15:33:11 no im not familiar with it enough to know the solution 15:33:37 wouldnt that be going back to the old way of preconfiguring everything in neutron? 15:33:54 hostconfig in ovs has to be populated. 15:33:55 No. 15:33:59 I don't get your question. 15:34:08 hostconfig in ovsdb needs to be populated. 15:34:12 That's right. 15:34:31 im not really proficient in this area of the driver im afraid.. :/ 15:34:42 The issue is that, networking-odl has to retrieve those value via ODL and then update agentdb 15:35:11 anyway I'll try to cook experimental patch and we'll see the outcome. 15:35:16 ok 15:35:39 ill keep digging to find the problems 15:35:57 the amount of rechecks right now is too much 15:36:08 Yeah, the issue may reside in ODL netvirt. In that case, we can ask netvirt folks. 15:36:26 no i see also failures in old netvirt 15:36:38 which of course has bugs but these failures seem consistent 15:37:02 ohhh, it sounds like issues in networking-odl (or neutorn) 15:37:18 another issue is ODL L3 plug is broken for long time. 15:37:39 i havent looked at that, do you have an idea whats wrong there? 15:38:06 yeah 15:38:25 It uses wrong db session 15:38:39 you mean the deprecated one? 15:38:41 In the past, there was an attempt to fix it, https://review.openstack.org/#/c/356839/ 15:39:14 After that, no one paid serious attention to it. 15:39:52 yamahata: does that mean that we should not use odl-router_v2 in the neutron service plugins? 15:39:56 ill take a look at it i dont remember the details 15:40:28 mbuil: we should fix it at least. 15:41:08 The direction is to fix odl l3 or to switch l3 flavor(service driver) 15:41:38 and L3 plugin has been modified heavily. 15:41:54 yamahata: I thought after ODL Carbon release, that was the only way to do L3. I was told that the service_plugin=router (the one coming from OPenStack) was not working with ODL Carbon. 15:42:58 mbuil: Maybe that's right. 15:43:14 At least we know the current odl l3 v2 driver is broken. and it should be fixed for Pike release. 15:43:24 i dont see any use of GUARD_TRANSACTION in neutron, please check with kevinbenton when will they remove it cause they might do it before pike release 15:43:59 Yeah, the l3 plugin code has been heavily modified by kevin. 15:44:26 midonet tried l3 flavor as experiment. he found that there are many thing to do. 15:44:39 if they throw it out before pike is released well be in real sh*t 15:44:43 That's the right way in long term. 15:44:59 can you check with kevin on that? 15:45:16 For Pike, we need to decide the direction. 15:45:46 that's obvious from git log. 15:45:57 Kevin has been worked on L3 issue since Ocata. 15:46:20 In Pike he reached the point that experimental midonet l3 flavor works somehow. 15:46:27 not sure what you mean "that's obvious from git log" 15:46:53 its obvious that " # FIXME(kevinbenton): get rid of all uses of this flag 15:46:53 " 15:47:00 #link http://lists.openstack.org/pipermail/openstack-dev/2017-July/119657.html 15:47:47 seems he had no reply 15:47:52 Not yet. 15:48:23 Anyway ODL or midonet (or other out-of-tree) needs l3 flavor,. 15:48:34 So it is us (and yamamoto) to drive the discussion. 15:49:52 agent based L3 doesn't need it. 15:50:18 maybe you should reply on the email so they know l3 is broken for us 15:50:53 Yeah, I wanted to see Kevin's reply first. 15:51:49 anything else to discuss? 15:51:54 mbuil: do you have any? 15:52:58 yamahata: well, just a question, I am trying to learn about odl-router_v2 and how to configure it when using port_binding_controller = pseudo-agentdb-binding 15:53:20 yamahata: I am not sure if there is a guide about that, the only thing I found is http://git.openstack.org/cgit/openstack/networking-odl/tree/doc/source/devref/hostconfig.rst 15:53:46 yamahata: apparently, it is important to add “bridge_mappings”: {“physnet1":"br-ex”} to the OVS config, but I am not sure if that is the only thing 15:53:50 For L3? just specify odl-router_v2 for l3 plugin 15:54:11 Nothing special in hostconfig for L3. 15:54:56 ODL L3 is mentioned because of future plan. 15:55:15 Right now you don't have to specify anything for L3 in hostconfig of ovsdb. 15:55:40 #topic open mike 15:56:01 yamahata: ok. and something I asked before and we were not sure. If I have two computes with OVS, should I have two ODL L2 agents when listing the network agents? 15:56:05 Probably we should improve the doc to avoid confusion. 15:56:17 Right. 15:57:07 ok, thanks about both questions. I will try to deploy l3 with ODL in the next days. In case I have Netvirt issues, do you know somebody who could help? 15:57:29 It's netvirt. #opendaylight-netvirt 15:57:35 you can find many there. 15:58:18 also there is mailing list. 15:58:22 https://lists.opendaylight.org/mailman/listinfo/netvirt-dev 15:58:30 you'd like to cc to odl-neutorn-dev too 15:58:40 yamahata: ok, thanks. I will update you guys with the progress 15:58:42 https://lists.opendaylight.org/mailman/listinfo/neutron-dev 15:58:53 yamahata: I am already in that list :) 15:59:09 Cool. 15:59:13 anything else? 15:59:53 thank you everyone. 16:00:02 #topic cookies 16:00:07 #endmeeting