14:08:09 #startmeeting oVirt Infra 14:08:09 Meeting started Mon Jul 7 14:08:09 2014 UTC. The chair is dcaro. Information about MeetBot at http://wiki.debian.org/MeetBot. 14:08:09 Useful Commands: #action #agreed #help #info #idea #link #topic. 14:08:11 eedri, ! 14:08:22 #chair eedri obasan knesenko ewoud 14:08:22 Current chairs: dcaro eedri ewoud knesenko obasan 14:08:29 eedri, bkp is also about I think 14:08:42 #chair dneary 14:08:42 Current chairs: dcaro dneary eedri ewoud knesenko obasan 14:08:55 * dneary will be lurking 14:09:01 * rbarry also lurking 14:09:12 YamakasY, maybe also want to listen ;) 14:09:14 dcaro, ok.. 14:09:43 #chair rbarry 14:09:43 Current chairs: dcaro dneary eedri ewoud knesenko obasan rbarry 14:09:55 #topic Hosting 14:10:25 I have good news there, finally the storage servers are functional 14:10:38 I'll install one of the hosts and the engine this week to start the tests 14:10:55 #info storage server at PHX functional 14:11:28 #action install one host with hosted engine and test thephx lab setup 14:11:36 dcaro, +1.. great 14:11:45 dcaro, so we're expecting something like 13TB mirrored right? 14:11:56 eedri: yep 14:12:12 dcaro, did we manage to get them to bond the network between storage servers? 14:12:53 eedri: they have a floating ip that pacemaker+cman/crm manage if that's what you mean 14:13:27 eedri: but right now they are using only one interface, so no bonding can be done yet (lacking the physical cables) 14:13:51 dcaro, do we have some sort of tracking/tickets we can use? 14:14:02 dcaro, for stuff we need from therm 14:14:23 eedri: just the email thread and the internal docs 14:14:38 dcaro, ok, worth checking if we can open tickets somehow, will be easier to follow 14:14:47 eedri: ok 14:15:15 eedri: misc you upgraded the alterway02 host, ming saying a few words about hte issues/solutions? 14:15:28 well, yes 14:15:42 first, the upgrade didn't exactly wrok fine from a rpm point of view 14:15:48 I had to use yum-shell 14:15:48 misc, might be one of the only one who ever upgraded all in one i think from 3.2 to 3.4 :) 14:16:06 misc, but maybe i'm wrong.. 14:16:17 didn't took note on the problem, and since ewoud told we were using non official RPM, and likely non supported jump, I didn't looked more 14:16:21 ( and just fixed ) 14:16:36 then ewoud made the upgrade, using engine-setup 14:16:53 and now, I am trying to figure why the iso domain is down ( or rather, how to make it up ) 14:16:58 misc, yea, that's why we're aiming at more formal release and installation for phx2 - hosted-engine rather than all in one 14:17:26 eedri: I tought we were using hosted-engine already ? 14:17:33 I guess I need to RTFM a bit more :) 14:18:04 anyway, so it seems to work, besides that issue with storage domain, and I would appreciate that someone who have more than 3 days of experience with ovirt could take a look :) 14:19:15 misc: no, currently it's all-in-one installs 14:19:22 #info upgraded alterway02, but failing ot get iso domain up 14:19:23 misc, no, for alterway and rackspace it's all-in-one 14:19:33 misc, hosted-engine was only introduced in 3.4 formally 14:19:36 #action brin up the iso domain for alterway02 14:20:03 I must admit fixing it was also a bit hacky by manually changing the database settings file 14:20:03 misc: everything else is working fine?¿ 14:20:12 dcaro: servers are running 14:20:23 misc: that's not what I asked ;) 14:20:25 we still need to schedule a reboot 14:20:27 ojorge: Hi, I was disconnected for a few minutes, how is the upgrade going? 14:20:28 ewoud, we should take notice of that for next upgrade 14:20:40 it is possible that after a reboot servers won't come up 14:20:51 ewoud, plan is to migrate all alterway eventually to a production DC on phx2 14:20:57 ewoud, and use alterways as hypervisors 14:21:03 dcaro: well, I think the regular domain is also down, whcih is weird, i think stuff are in a incosnistant state, but I do not know how much 14:21:18 IMHO it's best practice to reboot after a big upgrade and it's needed for a new kernel anyway 14:21:23 yeah 14:21:32 but I would first make sure it work 14:21:40 agreed 14:21:46 eedri: wazzup ? 14:21:53 YamakasY, hey, we're in a meeting 14:21:53 how scared are we that we can't reach it at all after a reboot? 14:21:54 misc: ok, will have to take a deep look to see if we can avoid losses at reboot 14:22:05 i.e. do we need to have someone from alterway on standby? 14:22:06 eedri: mhh... I'm not :D 14:22:07 :P 14:22:11 ewoud, best to sync it with alteray contact (kevin?) 14:22:22 ewoud, so we'll have local people there to help 14:22:35 dneary: ^ 14:22:40 ewoud, i don't think we have console access there 14:22:49 eedri: I don't think so either 14:23:15 ewoud, eedri, misc: I think Kévin left AlterWay 14:23:28 dneary, who is our contact there now? 14:23:38 ewoud, eedri, misc: Hervé Leclerc is the best contact person there now - he will tell us who else can help us if needed 14:23:52 dneary, you have his email? 14:24:04 dneary, best to update our file with it 14:24:09 eedri, Yes - I believe he's also on infra@ 14:24:49 eedri, sent directly just to avoid it being in the public log 14:26:41 dneary, thanks 14:28:13 dneary: thanks! 14:28:26 np! 14:28:56 ok, so the action course should be -> Take a deep look -> contact hlecrerc -> reboot and cross fingers 14:28:58 ? 14:29:48 dcaro, yea.. 14:30:07 dcaro, misc ewoud i updated the file with hlecrerc email 14:30:22 misc, so if you're planning a reboot, might worth shooting him an email 14:30:33 #action After checking alterway02, program reboot with alterway assistance 14:31:50 ok, anythin else about hosting? 14:33:31 moving then 14:33:35 #topic Jenkins 14:33:55 #info We have ppc slaves available! 14:34:10 Still need to set them up and configure jobs to use them though 14:34:26 (set them up meaning install puppet if able and all the slave deps) 14:34:49 knesenko: want to talk a bit about the copr jobs? 14:35:46 dcaro, +1 14:36:01 ewoud, you know if we should expect any issues adding puppet + foreman to it? 14:36:08 ewoud, i.e power pc 64 slave 14:36:43 eedri: as long as packages are available with the same name, I wouldn't expect any 14:37:24 dcaro: no prblem 14:37:36 dcaro: just let me know when 14:37:38 :) 14:37:49 anyone familiar with this error: ***L:ERROR Internal error: type object 'Stages' has no attribute 'DB_CONNECTION_SETUP' 14:37:59 and how to work around it? 14:38:23 kobi, we're in the middle of infra meeting 14:39:06 dcaro, also, work has started on 3.5 rpm jobs, via yaml 14:39:31 yep, will try to get generic easy to use templates so it0s easy to add new jobs 14:39:48 and create a howto for it 14:43:11 dcaro, +1 14:44:06 dcaro, anything else on jenkins? 14:44:16 dcaro, i think there is also el7 slave right? 14:44:24 true! 14:44:56 #info new el7 physical host jenkins-slave-host06.ovirt.org 14:45:12 we can start migrating and testing jobs there 14:45:42 dcaro, yea 14:46:03 knesenko, want to talk about copr ? 14:46:22 eedri: sure 14:46:35 eedri: so we have a new jobs for building pkgs on copr 14:46:41 what is copr ? 14:46:58 copr is open source build system related to Fedora folks 14:47:21 here we have a list of projects that we can build on copr build system 14:47:22 http://copr.fedoraproject.org/coprs/ovirt/ 14:47:51 here we have jobs to trigger the build on copr build system - http://jenkins.ovirt.org/view/copr/ 14:48:11 this will reduce the load on our slaves when we are planning to build for official releases ... 14:49:08 next step is to add more jobs and implement a script to collect all builds from all projects and compose a single repo 14:49:16 eedri: dcaro that's all from my side I think 14:49:23 they can also serve as mirror AFAIK 14:49:32 do we also plan to use them instead of resources.ovirt.org? 14:49:40 ewoud, i don't think we can.. 14:49:42 ewoud: I am not sure we can 14:49:48 ewoud, they have a retention policy 14:49:54 ewoud, deleting old builds for starts 14:50:05 ewoud, and they don't support non centos/fedora builds 14:50:11 ewoud: they provide a repo per project ... 14:50:17 means - vdsm:repo, engine:repo 14:50:20 and so on 14:51:26 #info new jobs were added to support official ovirt builds via copr system 14:51:29 ok 14:51:36 #link http://jenkins.ovirt.org/view/copr/ 14:51:58 dcaro, lets move to review some critical tickets? 14:52:03 eedri: sure 14:52:08 so they're not meant to be used by end users 14:52:13 #topic Tickets 14:52:23 dcaro, btw, i prepared a query for tickets solved in june 14:52:24 dcaro, http://goo.gl/ddyIvl 14:52:48 nice :) 14:53:11 #link https://fedorahosted.org/ovirt/query?status=closed&changetime=4+weeks..1+day&order=priority&report=9&col=id&col=summary&col=changetime&col=status&col=type&col=priority&col=milestone&col=component 14:53:19 wow... xd 14:53:31 hehe 14:53:54 https://fedorahosted.org/ovirt/ticket/188 14:54:00 ovirt-appliance can not be build because of unreachable mirror 14:54:25 fabiand, rbarry here? 14:54:36 dcaro, seems like a new dep is needed on the slaves? 14:54:38 here 14:54:46 The unreachable mirror problem is fixed, thanks to dcaro! 14:54:52 \0/ 14:55:06 fabiand: you sent a patch for the deps right? Was it merged? 14:55:16 http://gerrit.ovirt.org/#/c/29584/ 14:55:19 I am not sure yet .. 14:55:19 no it's not :) 14:55:25 dcaro, but that#s just a "drop on the hot stsone". 14:55:32 I'll review 14:55:32 There are other issues which prevent building the appliance .. 14:55:56 fabiand: so only f19 slaves will be used for that job? 14:56:03 dcaro, yes 14:56:12 fabiand: okok 14:56:17 dcaro, I also just pinned it to one specific host, to make it more reproducable. 14:56:53 fabiand: take into account that it will get blocked when loaded and will fail when that slave is not there anymore, I recommend pinning it only for the tests or when debugging 14:57:05 eedri: Here 14:57:14 dcaro, I fully agree! Once it's stable I'm happy to pin it to the f19 tag 14:57:20 okok :) 14:59:19 dcaro, fabiand next one is also from you :) 14:59:25 * fabiand hides :) 14:59:34 fabiand, https://fedorahosted.org/ovirt/ticket/177 14:59:43 update ovirt-node package in jenkins servers 14:59:49 Oh that one. 14:59:54 not sure if there is still action item there 14:59:56 I honestly did not understand that one .. 15:00:20 hehehe 15:00:22 dougsland, ? 15:00:28 dougsland, can you elaborate on that ticket? 15:00:40 dougsland, is it still needed 15:02:15 eedri, closed 15:02:20 dougsland, great 15:02:31 fabiand, 3rd one is the charm! (or icecream) 15:02:40 https://fedorahosted.org/ovirt/ticket/136 15:02:47 you're keeping the infra team busy :) 15:04:47 I think we are out of time 15:04:56 dougsland, can you help with https://fedorahosted.org/ovirt/ticket/123? 15:05:01 any issue we must discuss before losing? 15:05:15 dcaro, let's have a quick look on the urgent tickets 15:05:16 hahaha, s/losing/closing/ 15:05:31 dcaro, hehe 15:05:38 dcaro, we always win! never lose 15:05:56 eedri, I think so. 15:06:04 dcaro, unless there is some needinfo on those critical tickets, i guess we can close and tackle it offline 15:06:09 dougsland, so you can provide el7 builds? 15:06:14 dougsland, for vdsm deps.. 15:06:23 eedri, is it downstream or upstream ? 15:06:27 dougsland, ovirt 15:06:34 dougsland, using copr or koji i guess 15:06:51 ok, so closing the meeting 15:06:56 dcaro, +1 15:07:08 #endmeeting