Wikimedia Release Engineering Team/Checkin archive/20180319

= 2018-03-19 =

Vacations/Important dates

 * https://office.wikimedia.org/wiki/HR_Corner/Holiday_List
 * How to do it


 * March 20th dentist 8am thcipriani (should be back before it matters, I hope)
 * Mar 26-29 (week since WMF holiday Fri): thcipriani vacation
 * Mar 30 (Fri): WMF Holiday
 * April 2: Željko (Holidays in Croatia - Easter Monday)
 * Apr 3-13: Greg vacation
 * April 16 (Mon): WMF Holiday
 * May 1: Željko (Holidays in Croatia - Labor Day / May Day)
 * May 15?/16/17: Team offsite in Barcelona
 * May 18-20: Wikimedia Hackathon in Barcelona
 * May 21 (Mon): Tech-Mgt F2F
 * May 31: Željko (Holidays in Croatia - Corpus Christi)

Train

 * Maniphest query for deployment blocker tasks: https://phabricator.wikimedia.org/maniphest/?project=PHID-PROJ-fmcvjrkfvvzz3gxavs3a&statuses=open%28%29&group=none&order=newest#R


 * Feb 19 - wmf.22 - Mukunda
 * Feb 26 - wmf.23 - Tyler
 * Mar 05 - wmf.24 - Tyler
 * Mar 12 - wmf.25 - Chad
 * Mar 19 - wmf.26 - Chad <
 * Mar 26 - wmf.27 - Mukunda
 * Apr 02 - wmf.28 - Mukunda
 * Apr 09 - wmf.29 - Tyler
 * Apr 16 - wmf.30 - Tyler

SoS

 * Feb 19 - Chad
 * Feb 26 - Mukunda
 * Mar 05 - Mukunda
 * Mar 12 - Tyler
 * Mar 19 - Tyler <
 * Mar 26 - Chad
 * Apr 02 - Chad
 * Apr 09 - Mukunda
 * Apr 16 - Mukunda

Scrum of Scrums

 * Greg to copy to etherpad after meeting: https://etherpad.wikimedia.org/p/Scrum-of-Scrums

Release Engineering

 * Blocking
 * Blocked
 * Updates
 * Minor Gerrit upgrade planned for this week (2.14.6 -> 2.14.7)
 * Incident analysis started last week of the last year’s worth of incidents reports
 * Scap 3.7.7 should be rolled out to production this week
 * Incident analysis started last week of the last year’s worth of incidents reports
 * Scap 3.7.7 should be rolled out to production this week

Release Engineering

 * Blocking
 * Scoring platform release of scap 3.8 (I think mukunda tagged that, but I'll double check)
 * Blocked
 * SRE: Minikube packaging stuff https://phabricator.wikimedia.org/T184457
 * Updates
 * 1.31.0-wmf.25 going out this week, if you see blockers: https://phabricator.wikimedia.org/T183964
 * greg is out this week, FYI
 * Quarterly goal dependency update:
 * Continue improving the ways that users can download articles of interest for later consumption
 * Reading Web: Tech Ops/RelEng
 * Update:

Train status and happenings

 * https://www.mediawiki.org/wiki/Wikimedia_Release_Engineering_Team/Roles#Train_Conductor


 * mostly boring

Past week status updates

 * All of it in table form: https://www.mediawiki.org/wiki/Wikimedia_Release_Engineering_Team/Goals/201718Q3

Program 1: Outcome 5: Milestone 1: Develop and migrate to a JavaScript-based browser testing stack

 * Due: End of this quarter
 * What: Specific improvements to the now canonical framework, see: task T182421, notably:
 * Upgrade webdriverIO to version 4.9
 * Investigate replacing nodemw with mwbot
 * Video recording for Selenium tests in Node.js
 * Task:


 * T180144 Upgrade WebdriverIO to 4.12.0 - done, waiting for reviews
 * T179188 Video recording for Selenium tests in Node.js - initial research done
 * T182691 Selenium tests should be easier to run - had an idea but got stuck on upstream bug https://github.com/webdriverio/webdriverio/issues/2657
 * T187859 Move one Selenium tests from mediawiki/core to mediawiki/skins/Vector - stuck with understanding the problem, asked for help

Program 1: Outcome 5: Objective 1: Maintain existing shared Continuous Integration infrastructure

 * Goals
 * Draft requirements for a Kubernetes based solution for CI -
 * Migrate MediaWiki PHPUnit tests to Shipyard (docker-based CI) (~40% of Nodepool usage) -
 * Will be worked on after the long tail
 * Unify production and CI docker image build process -
 * ✅ 01/15

Program 3: Outcome 1: Objective 2: Identify and find stewards for high-priority/high use code segment orphans

 * Due: End of quarter

Program 3: Outcome 2: Objective 2: Define and implement a process to regularly address technical debt across the Foundation

 * Due: End of quarter

Program 3: Outcome 2: Objective 3: Promote and surface important technical debt topics at large gatherings of Wikimedia developers (e.g., DevSummit and Hackathon(s))

 * Due: End of next quarter

Program 6: Outcome 2: Objective 2: Set up a continuous integration and deployment pipeline

 * Due: End of this quarter
 * Keyword: SSD
 * phab project: https://phabricator.wikimedia.org/project/view/2453/
 * Goal:
 * Verify basic functionality of 'production' deployment and image (initially targeting mathoid):
 * Functional PoC within integration in the deployment-pipeline
 * Deploy to isolated k8s


 * meeting last week went well
 * helm is packaged
 * still need minikube packaging
 * still need puppet work
 * still need ci work
 * this is a lot :(

Program 1: Outcome 1: Objective 1: Scap (Tech Debt Sprint FY201718-Q2)

 * workboard


 * Working with awight on scap + git-lfs testing on beta
 * https://phabricator.wikimedia.org/T180627
 * Still waiting for 3.7.7 deployment

Program 1: Outcome 6: Milestone 2: Maintain Phabricator

 * Streamline logspam workflows by adding some integration with phabricator
 * Store git-lfs (and other phab uploads) in swift:


 * Still working on swift backend

Antoine

 * What I plan to do this week
 * reviews, CI tasks as usual
 * Finish longtail of jobs to Docker https://phabricator.wikimedia.org/T187797
 * Really one left as of monday night: the job that builds wikimedia/portals/deploy https://phabricator.wikimedia.org/T190073
 * Poc / notes for MediaWiki testing (was: quibble)
 * What I'm blocked on
 * Figuring out a good strategy to run MediaWiki tests in Docker
 * Other?
 * Upstreamed a few bits of CI to mediawiki/core, fixed tests

Chad

 * What I plan to do this week
 * Train
 * Gerrit minor point upgrade (2.14.6 -> 2.14.7)
 * What I'm blocked on
 * Other?
 * Other?

Dan

 * What I plan to do this week
 * Continue to help with incident analysis
 * Get Blubber back on Jenkins agents https://phabricator.wikimedia.org/T186548
 * Commit scap blubber.yaml (blocked on following reviews)
 * What I'm blocked on
 * Needs Review  http://phabricator.wikimedia.org/D999 Allow for configuration policies
 * Needs Review  http://phabricator.wikimedia.org/D1002 Fix ownership on artifact copies
 * Needs Review  http://phabricator.wikimedia.org/D1003 Provide a `runs.insecurely` to be used with test variants
 * Other?

Greg

 * What I plan to do this week
 * Incident analysis+preso
 * sync-up with Victoria and Mark on how to move this forward
 * Travel budget verification
 * What I'm blocked on
 * Other?
 * Other?

Jean-Rene

 * What I plan to do this week
 * What I'm blocked on
 * Other?
 * Other?
 * Other?

Mukunda

 * What I plan to do this week
 * SCAP 3.7.7 should be deployed today.
 * More work on swift for phabricator storage.
 * What I'm blocked on
 * Other?
 * Other?

Tyler

 * What I plan to do this week
 * As much puppet as I'm able for integration/minikube stuff
 * code review
 * finish slides
 * What I'm blocked on
 * minikube still
 * Other?

Zeljko

 * What I plan to do this week
 * T185011 Create selenium-MediaWiki-jessie daily Jenkins job
 * T188740 Post mortem for T139740 Port Selenium tests from Ruby to Node.js
 * What I'm blocked on
 * T185094 Update page object pattern in Selenium tests - done in core, waiting for reviews
 * T181284 Replace nodemw with mwbot - done in core, waiting for reviews
 * T180144 Upgrade WebdriverIO to 4.12.0 - done, waiting for reviews
 * T187859 Move one Selenium tests from mediawiki/core to mediawiki/skins/Vector - stuck with understanding the problem, asked for help
 * T182691 Selenium tests should be easier to run - had an idea but got stuck on upstream bug https://github.com/webdriverio/webdriverio/issues/2657
 * Other?
 * right hand hurts :'( - typing slowly using left hand
 * T190046 Write Selenium tests in JavaScript/Node.js - session for wikimedia-hackathon-2018
 * T189122 Migrate language-screenshots-VisualEditor off of Nodepool to Docker containers - done - LanguageScreenshotBot uploaded 2k+ screenshots in 90 languages - Antoine did all CI and half bot work

Team Kanban Board Review and Triage

 * closed and touched in the 7 days
 * No update for 4 weeks
 * No update for 3 weeks
 * No update for 2 weeks
 * No update for 1 week
 * All Open
 * Review To Triage column of #releng
 * Assigned
 * Unassigned

Once / month-ish review of backlog(s)

 * releng Review To Triage column of #releng
 * releng-kanban Review unassigned in kanban
 * releng-kanban Review 'backlog' colum of -kanban
 * releng-next - Review for things we need to put on our kanban backlog
 * releng-backlog - oh my, the huge backlog of things...

Kanban stats

 * Burnup chart