Wikimedia Release Engineering Team/Checkin archive/20170906

= 2017-09-06 =

Vacations/Important dates

 * https://office.wikimedia.org/wiki/HR_Corner/Holiday_List
 * How to do it


 * Sept 18-29: WMF "Remote and Relaxation" weeks
 * October 6 (Friday): Željko on a conference
 * Oct 9 - Indigenous People's Day
 * November 1 (Wednesday): Željko local holiday (All Saints' Day)
 * Nov 10 (Fri) - Veteran's Day
 * Nov 23+24 - Thanksgiving
 * Dec 25-Jan 1 - End of year/new year holidays

Rotating positions and absences
Maniphest query for deployment blocker tasks: https://phabricator.wikimedia.org/maniphest/?project=PHID-PROJ-fmcvjrkfvvzz3gxavs3a&statuses=open%28%29&group=none&order=newest#R

Aug 28 and Sept 4

 * Train: Chad
 * wmf.16
 * wmf.17
 * SoS: Tyler
 * Out
 * September 1 - Željko on a conference
 * Sept 1 - Antoine
 * Sept 4 - Labor Day

Sep 11 and Sept 18

 * Train: Chad
 * wmf.18
 * wmf.19
 * SoS: Mukunda
 * Out

Sep 25 and Oct 2

 * Train: Chad
 * wmf.20
 * wmf.21
 * SoS: Tyler
 * Out

Actions from last meeting

 * TODO: incident report for last week's crappy "can't roll forward, can't rollback" situation
 * Tyler: outline of it/start it, hand off to jaime/aaron/ladsgroup ✅
 * https://wikitech.wikimedia.org/wiki/Incident_documentation/20170814-Train

This week

 * Blocking
 * Blocked
 * discovery/analytics -> scap3?
 * Updates
 * Updates

Last week

 * Blocking
 * Blocked
 * Updates
 * Selenium Ruby framework deprecation announce: https://lists.wikimedia.org/pipermail/wikitech-l/2017-August/088653.html
 * End of October deprecation date.
 * Selenium Ruby framework deprecation announce: https://lists.wikimedia.org/pipermail/wikitech-l/2017-August/088653.html
 * End of October deprecation date.

Logspam \ Last week's train updates

 * https://www.mediawiki.org/wiki/Wikimedia_Release_Engineering_Team/Roles#Train_Conductor


 * Yes, some logspam for wmf.17, see task

Other Team Business

 * Q2 Planning:
 * https://www.mediawiki.org/wiki/Wikimedia_Release_Engineering_Team/Goals/201718Q2
 * P1O1O1: Prioritize git-fat work, push back MW+scap3 until after that, Chad and Mukunda to talk and fill out plan
 * P1O5O1: Greg to email Mark re CI cluster, Antoine to write what the first part is and fill in for our "work" plan
 * P1O5M1: good here
 * P1O6M1: on-going maintenance and upgrades
 * P1O6M2: on-going work to move to modern hardware and cross-datacenter resiliency
 * P3: Greg Add Outcome 2 ojectives 1&2 - ✅
 * P6: automated deployments, premature? Determine how developer feedback looks... create build stage of the pipeline with developer feedback
 * build test image, and running the entry point for the associated tests in CI (where?)
 * see: https://etherpad.wikimedia.org/p/blubberlist

Program 6: Streamlined service delivery

 * Define functional tests for Mathoid running on the staging Kubernetes cluster for use in future gating decisions -
 * Define method for monitoring and reacting to the above functional tests -
 * Workboard: https://phabricator.wikimedia.org/project/view/2453/


 * Cleaned up workboards and created more tasks for tracking build-stage work
 * Blubber patches up for remaining prereqs (`arc list`):
 * Needs Review http://phabricator.wikimedia.org/D759: Define `NODE_ENV` and always define `NODE_PATH`
 * Needs Review http://phabricator.wikimedia.org/D768: Support `copies` config entry for multi-stage builds
 * Needs Review http://phabricator.wikimedia.org/D769: Smarter copies/sharedvolume/default behavior

Deprecate use of Trebuchet across production -

 * https://phabricator.wikimedia.org/T129290

iegreview *may* go this week scholarships Soon™ jobrunner now has a patch so that should be ready whenever Krinkle or someone has time to confirm that a deploy works

Migrate majority of developers to JavaScript based browser test framework (webdriver.io) -

 * Things are moving, not sure if it's e-mail announcements or SoS or something else or all of the above.
 * T174595 ContentTranslation: Migrate browser tests from Ruby to node.js
 * No need to migrate tests, just deleted Ruby tests.
 * T171852 WebdriverIO tech talk
 * stalled, Rachel was on vacation

Quality improvements

 * Technical Debt
 * TD is officially a program
 * Continued developing series of blog posts
 * minor change - first blog post will be include CH/CHG to set the stage
 * Continued review of TD phab
 * Provided overview of Tech Debt and "top 5" approach to Tech Mgmt
 * Post Mortems
 * Had Wikidata Post Mortem
 * Biggest outcome - fix wikidata build fixed/redesigned.
 * Scheduled for next week - Post Mortem - T173462: Cannot flush pre-lock snapshot because writes are pending

Browser tests (non quaterly goal stuff)

 * T164479 For MediaWiki extensions, merge rubocop inside mwext-mw-selenium-jessie
 * Done on CI side. Deployed for Echo. Will cleanup other repos in the next few weeks.

Docker for CI
> But overall the image is 4.2GBytes which is too large. We need to split.
 * diskimage-builder produced a CI image for docker
 * potentially "docker import" can import a qcow2 image

Misc CI

 * php5.5 jobs moving from Trusty to Jessie

Team Kanban Board Review and Triage

 * closed and touched in the 7 days
 * No update for 4 weeks
 * No update for 3 weeks
 * No update for 2 weeks
 * No update for 1 week
 * All Open
 * Review To Triage column of #releng


 * Assigned
 * Unassigned

Once / month-ish review of backlog(s)

 * releng Review To Triage column of #releng
 * releng-kanban Review unassigned in kanban
 * releng-kanban Review 'backlog' colum of -kanban
 * releng-next - Review for things we need to put on our kanban backlog
 * releng-backlog - oh my, the huge backlog of things...

Kanban stats

 * Burnup chart