Wikimedia Release Engineering Team/Checkin archive/20180212

= 2018-02-12 =

Vacations/Important dates

 * https://office.wikimedia.org/wiki/HR_Corner/Holiday_List
 * How to do it


 * Feb 2nd: Mukunda vacation day
 * Feb 19 (Mon): President's Day (All US Staff)
 * Mar 30 (Fri): WMF Holiday
 * April 2: Željko (Holidays in Croatia - Easter Monday)
 * April 14 (Fri): WMF Holiday
 * May 1: Željko (Holidays in Croatia - Labor Day / May Day)
 * May 15?/16/17: Team offsite in Barcelona
 * May 18-20: Wikimedia Hackathon in Barcelona
 * May 21 (Mon): Tech-Mgt F2F
 * May 31: Željko (Holidays in Croatia - Corpus Christi)

Train

 * Maniphest query for deployment blocker tasks: https://phabricator.wikimedia.org/maniphest/?project=PHID-PROJ-fmcvjrkfvvzz3gxavs3a&statuses=open%28%29&group=none&order=newest#R


 * Feb 05 - wmf.20 - Chad
 * Feb 12 - wmf.21 - Mukunda
 * Feb 19 - wmf.22 - Mukunda
 * Feb 26 - wmf.23 - Tyler
 * Mar 05 - wmf.24 - Tyler
 * Mar 12 - wmf.25 - Chad
 * Mar 19 - wmf.26 - Chad
 * Mar 26 - wmf.27 - Mukunda
 * Apr 02 - wmf.28 - Mukunda
 * Apr 09 - wmf.29 - Tyler
 * Apr 16 - wmf.30 - Tyler

SoS

 * Feb 05 - Tyler
 * Feb 12 - Chad
 * Feb 19 - Chad
 * Feb 26 - Mukunda
 * Mar 05 - Mukunda
 * Mar 12 - Tyler
 * Mar 19 - Tyler
 * Mar 26 - Chad
 * Apr 02 - Chad
 * Apr 09 - Mukunda
 * Apr 16 - Mukunda

Annual Plan

 * https://office.wikimedia.org/wiki/Release_Engineering/FY1819-Planning
 * https://office.wikimedia.org/wiki/Release_Engineering/FY1819-Planning/Continuous_Deployment_pipeline
 * https://office.wikimedia.org/wiki/Release_Engineering/FY1819-Planning/Code_Health
 * https://office.wikimedia.org/wiki/Release_Engineering/FY1819-Planning/Developer_productivity

Mid-year check-ins
A basic check-in: Go over what we hoped we'd do when we talked back for annual review

Then, the outline from T&C:
 * Celebrate accomplishments (highs):
 * What can we learn from (lows):
 * What goals or projects have been completed? What went well, what could have been better?
 * What goals/responsibilities are still ahead? Is any support needed to help get them done?  Does any work need to be adjusted (due to departmental/organizational change)?
 * Any peer feedback wanted or given? (Feel free to do this casually via email or conversation.)
 * [Greg: Let me know if anyone wants to do some kind of "ask your peers for feedback", I can set something up.]

Please come prepared to talk about those items; I'll be filling out a document we can look at during the meeting, just for our records. They are not shared with T&C (they will only be checking in with managers to verify the check-ins happened, not what the content was).

Scrum of Scrums

 * Greg to copy to etherpad after meeting: https://etherpad.wikimedia.org/p/Scrum-of-Scrums

Release Engineering

 * Blocking
 * None?
 * Blocked
 * Updates
 * Updates

Last week

 * Blocking
 * Release Engineering - Need some input on https://phabricator.wikimedia.org/T179552 relating to Docker browser test image
 * Blocked
 * None
 * Updates:
 * Whole team back from offsite, slowly recovering from jet lag
 * wmf.20 rolling out this week as normal, deployments back to normal

Train status and happenings

 * https://www.mediawiki.org/wiki/Wikimedia_Release_Engineering_Team/Roles#Train_Conductor


 * Last week was **bad** (lots of failures, mostly fixed)


 * Two weeks off is a bummer
 * replag and operational issues
 * scary stuff is fixed, but there are stragglers
 * wmf.20 task still open for the stragglers
 * None of these are blockers for wmf.21

Past week status updates

 * All of it in table form: https://www.mediawiki.org/wiki/Wikimedia_Release_Engineering_Team/Goals/201718Q3

Program 1: Outcome 5: Milestone 1: Develop and migrate to a JavaScript-based browser testing stack

 * Due: End of this quarter
 * What: Specific improvements to the now canonical framework, see: task T182421, notably:
 * Upgrade webdriverIO to version 4.9
 * Investigate replacing nodemw with mwbot
 * Video recording for Selenium tests in Node.js
 * Task:


 * no progress, catching up (including on sleep) all week :(


 * ORES and Phabricator now have Selenium test
 * https://phabricator.wikimedia.org/diffusion/EORS/browse/master/tests/selenium/
 * https://phabricator.wikimedia.org/source/phab-deployment/browse/wmf%252Fstable/tests/selenium/

Program 1: Outcome 5: Objective 1: Maintain existing shared Continuous Integration infrastructure

 * Goals
 * Draft requirements for a Kubernetes based solution for CI -
 * Migrate MediaWiki PHPUnit tests to Shipyard (docker-based CI) (~40% of Nodepool usage) -
 * Unify production and CI docker image build process -
 * ✅ 01/15

Program 3: Outcome 1: Objective 2: Identify and find stewards for high-priority/high use code segment orphans

 * Due: End of quarter


 * Next steps: apply feedbsck from discussion with Toby/Victoria to stewardship collateral. Start broad communication. Start filling gaps (fill in inaccuracies first, then for those that truly have no stewards, pull them into the stewardship review process)

Program 3: Outcome 2: Objective 2: Define and implement a process to regularly address technical debt across the Foundation

 * Due: End of quarter

Next steps: finish blog series on topic, setup regular SIG discussions on Tech Debt, reach out to teams to get in sync with how they are addressing exisitng tech debt.
 * no progress.

Program 3: Outcome 2: Objective 3: Promote and surface important technical debt topics at large gatherings of Wikimedia developers (e.g., DevSummit and Hackathon(s))

 * Due: End of next quarter


 * planning session for the May Hack-a-thon

Program 6: Outcome 2: Objective 2: Set up a continuous integration and deployment pipeline

 * Due: End of this quarter
 * Keyword: SSD
 * phab project: https://phabricator.wikimedia.org/project/view/2453/
 * Goal:
 * Verify basic functionality of 'production' deployment and image (initially targeting mathoid):
 * Functional PoC within integration in the deployment-pipeline
 * Deploy to isolated k8s


 * thcipriani: still packaging minikube, no progress last week
 * Mathoid pipeline broken due to permissions, will discuss at next meeting https://phabricator.wikimedia.org/T186790
 * service-checker image merged! https://phabricator.wikimedia.org/T184220
 * need to get blubber back on CI labs instances
 * will work on integrating helm and helm test into pipeline script
 * need to package helm I guess :/

Program 1: Outcome 1: Objective 1: Scap (Tech Debt Sprint FY201718-Q2)

 * workboard

Program 1: Outcome 6: Milestone 2: Maintain Phabricator

 * Streamline logspam workflows by adding some integration with phabricator
 * Store git-lfs (and other phab uploads) in swift:

Antoine

 * What I plan to do
 * Migrate long tail of Nodepool based jobs to Docker containers
 * What I'm blocked on
 * Running MediaWiki tests in a Docker container. But I havent tried hard enough yet :D
 * Other?
 * Barely looking at Phabricator/IRC/email. Trying to focus.

Chad

 * What I plan to do
 * Wrapping up some gerrit stuff
 * Playing with mw-in-docker (I had a fun saturday reading about helm)
 * Kill extension-list-labs: https://gerrit.wikimedia.org/r/c/409750/
 * What I'm blocked on
 * Nothing really
 * Other?

Dan

 * What I plan to do
 * Continuing to integrate `helm test` into pipeline script
 * Service checker image was merged!
 * Going to work on getting blubber back on CI labs instances
 * Maybe I can start working on helm packaging
 * What I'm blocked on
 * Not technically blocked but need to work through blubber/python implementation decisions around image layer caching
 * Other?

Greg

 * What I plan to do
 * Get our Annual Plan segments into shape
 * meetings with others in Technology
 * What I'm blocked on
 * SRE on some annual plan program drafting
 * Other?
 * I've been having Annual Plan related dreams/nightmares, and clenching my jaw, way too much lately

Jean-Rene

 * What I plan to do
 * Summarize Stewardship Review discussion and set up meeting with Victoria and Toby
 * Plan/Prepare for Legacy encoding post portem
 * Set up post mortem for MediaWiki incident
 * Update stewardship content with feedback from Victoria/Toby
 * Logspam education
 * tech debt blog posts
 * What I'm blocked on
 * Other?
 * Other?

Mukunda

 * What I plan to do
 * train: 1.31.0-wmf.21 - https://phabricator.wikimedia.org/T183960
 * Phabricator deployment on wednesday.
 * This one might be a pain because there are a lot of database migrations.
 * Hopefully start working on swift for phabricator file storage which will enable us to use git-lfs in phab.
 * What I'm blocked on
 * No blockers
 * Other?

Tyler

 * What I plan to do
 * Minikube packaging work
 * scap loose-ends
 * What I'm blocked on
 * https://phabricator.wikimedia.org/D970 review
 * Creating project in gerrit ✅
 * Other?
 * line 169 for Mathoid

Zeljko

 * What I plan to do
 * T183456 Continuous integration for mediawiki-vagrant
 * T181284 Investigate replacing nodemw with mwbot
 * T182692 Document differences between Ruby and Node.js Selenium frameworks
 * T185094 Update page object pattern in Selenium tests
 * T185315 selenium-EXTENSION-jessie Jenkins job should have e-mail notification
 * What I'm blocked on
 * Chad/Mukunda/Tyler: how to run smoke tests during deployment?
 * Other?

Team Kanban Board Review and Triage

 * closed and touched in the 7 days
 * No update for 4 weeks
 * No update for 3 weeks
 * No update for 2 weeks
 * No update for 1 week
 * All Open
 * Review To Triage column of #releng
 * Assigned
 * Unassigned

Once / month-ish review of backlog(s)

 * releng Review To Triage column of #releng
 * releng-kanban Review unassigned in kanban
 * releng-kanban Review 'backlog' colum of -kanban
 * releng-next - Review for things we need to put on our kanban backlog
 * releng-backlog - oh my, the huge backlog of things...

Kanban stats

 * Burnup chart