Wikimedia Release Engineering Team/Checkin archive/20180903

= 2018-09-03 =

Vacations/Important dates

 * https://office.wikimedia.org/wiki/HR_Corner/Holiday_List
 * How to do it


 * August 31, September 4-5: Dan on vacation
 * September 3 (Monday): US Holiday (Labor Day)
 * September 6 (Thursday, half day) - 7 (Friday) Željko on a conference
 * Mid september - Mid october, Antoine to take off some weeks/days/part time

Train

 * Maniphest query for deployment blocker tasks: https://phabricator.wikimedia.org/maniphest/?project=PHID-PROJ-fmcvjrkfvvzz3gxavs3a&statuses=open%28%29&group=none&order=newest#R


 * July 02 - wmf.11 - Zeljko - no train, Fourth of July
 * July 09 - wmf.12 - Zeljko
 * July 16 - wmf.13 - Zeljko
 * July 23 - wmf.14 - Zeljko
 * July 30 - wmf.15 - Mukunda
 * Aug 06 - wmf.16 - Mukunda
 * Aug 13 - wmf.17 - Mukunda (No train - Wednesday is a holiday)
 * Aug 20 - wmf.18 - Tyler
 * Aug 27 - wmf.19 - Dan && Antoine lurking over the shoulders
 * Sep 03 - wmf.20 - Antoine
 * Sep 10 - wmf.21 - Antoine -- No train due to DC switchover
 * Sep 17 - wmf.22 - Antoine
 * Sep 24 - wmf.23 - Zeljko (only one week for me?)
 * Oct 01 - wmf.24 - Dan
 * Oct 08 - wmf.25 - Dan -- No train due to DC switchover
 * Oct 15 - wmf.26 - Mukunda (last 1.32 wmf.XX release, 1.33 starts the next week)
 * Oct 22 - wmf.1 - Mukunda

SoS

 * July 04 - Dan
 * July 11 - Antoine
 * July 18 - Antoine
 * July 25 - Tyler
 * Aug 01 - Tyler
 * Aug 08 - Zeljko
 * Aug 15 - Dan (No SoS this week)
 * Aug 22 - Zeljko
 * Aug 29 - Zeljko
 * Sep 05 - Tyler / Željko
 * Sep 12 - Tyler
 * Sep 19 - Dan
 * Sep 26 - Dan
 * Oct 03 - Zeljko
 * Oct 10 - Zeljko
 * Oct 17 - Antoine
 * Oct 24 - Antoine
 * Oct 31 - Mukunda

Hiring

 * Accepted!

First Offsite
Details:
 * Week of December 3rd
 * At the Queen Mary hotel in Long Beach
 * Deb T will be facilitating

Needs attention

 * Re-evaluate use of "Dependent Pipeline" in Zuul for gate-and-submit - https://phabricator.wikimedia.org/T94322
 * 2018-08-24 TODO: antoine and dan had followups IIRC
 * https://docs.google.com/document/d/1Sdqe5vCWu5UOVWIFyL_9vhzwMfwZzsRe2WP0uHn6pms/edit#

Scrum of Scrums

 * Greg to copy to etherpad after meeting: https://etherpad.wikimedia.org/p/Scrum-of-Scrums

Release Engineering

 * Blocked by:
 * Blocking:
 * Updates:
 * Train:
 * 1.32.0-wmf.20 at group 1, no problems
 * on European time this week
 * No train next week, DC switchover
 * Log Health:
 * Exception thrown for failure to save settings appears ~ 1000 times/day: https://phabricator.wikimedia.org/T202149
 * labtestweb2001 is sending updates to a read-only db host: db2037: https://phabricator.wikimedia.org/T201082
 * ErrorException from line EducationProgram PHP Notice: Undefined variable: retValue: https://phabricator.wikimedia.org/T203577
 * labtestweb2001 is sending updates to a read-only db host: db2037: https://phabricator.wikimedia.org/T201082
 * ErrorException from line EducationProgram PHP Notice: Undefined variable: retValue: https://phabricator.wikimedia.org/T203577

Release Engineering

 * Blocked by:
 * Noise from https://phabricator.wikimedia.org/T201082 during Train deployment (not really blocked but distracted)
 * Blocking:
 * Updates:
 * Train: no major problems, 1.32.0-wmf.19 at group 0 https://phabricator.wikimedia.org/T191065 https://tools.wmflabs.org/versions/
 * Log spam: Unknown modifier 'R': [/^page\-User\:BeneBot.+/RfD\-open/text$/] in /srv/mediawiki/php-1.32.0-wmf.16/extensions/Translate/stringmangler/StringMatcher.php https://phabricator.wikimedia.org/T202058
 * Log spam: Unknown modifier 'R': [/^page\-User\:BeneBot.+/RfD\-open/text$/] in /srv/mediawiki/php-1.32.0-wmf.16/extensions/Translate/stringmangler/StringMatcher.php https://phabricator.wikimedia.org/T202058

Train status and happenings

 * https://www.mediawiki.org/wiki/Wikimedia_Release_Engineering_Team/Roles#Train_Conductor

Past week status updates

 * All of it in table form: https://www.mediawiki.org/wiki/Wikimedia_Release_Engineering_Team/Goals/201718Q4

Pipeline: Move verify stage from Minikube to CI k8s namespace in production context

 * tracking task

Code Health

 * T199253 - Investigate and propose record of origin (ROO) for deployed code (currently Developers/Maintainers page)
 * Perform existing Stewardship review process for Q1 cycle.
 * T199254 - Add test evaluation to post mortem review process.
 * Review existing e2e test coverage.
 * Define prioritization scheme.
 * Prioritize e2e testing gaps.
 * T199257 - make current unit testing coverage more visible by reporting out to Engineering Management.
 * T199259 - Platform and Search Platform teams are using TDM PoC
 * T199262 - Identify key Tech Debt areas
 * T199263 - Put in place Tech Debt management process for PEP
 * T199261 - Define base Code Health metric set.

Developer Productivity

 * Make a hire to create the capacity needed for this program.
 * Write and share a survey to measure developer satisfaction and areas for investment. -

Selenium

 * Q1 goals task: T198389 Q1 Selenium framework improvements
 * T179188 Video recording for Selenium tests in Node.js - code in final review https://gerrit.wikimedia.org/r/c/mediawiki/core/+/422933
 * T185011 Create MediaWiki-npm-browser-run-selenium-test-node-6-docker daily Jenkins job - the job is running but failing because beta cluster is different than other environments, will run just tests that pass for now https://integration.wikimedia.org/ci/job/selenium-daily-beta-MediaWiki/
 * T188742 Run tests daily targeting beta cluster for all repositories with Selenium tests - focusing on moving Nodepool jobs to Docker, then adding more jobs https://gerrit.wikimedia.org/r/c/integration/config/+/443931

Phabricator

 * Elastic 6 support: Mostly finished now - https://phabricator.wikimedia.org/T181393 ( https://phabricator.wikimedia.org/D1109 )
 * I'd still like to make a cleaner way to roll this out without reindexing but elastic 6 compat work is done.

Antoine

 * What I plan to do this week
 * What I'm blocked on
 * Other?
 * Other?
 * Other?

Dan

 * What I plan to do this week
 * What I'm blocked on
 * Other?
 * Other?
 * Other?

Greg

 * What I plan to do this week
 * What I'm blocked on
 * Other?
 * Other?
 * Other?

Jean-Rene

 * What I plan to do this week
 * What I'm blocked on
 * Other?
 * Other?
 * Other?

Mukunda

 * What I plan to do this week
 * Document/Improve `scap swat` https://phabricator.wikimedia.org/T196411
 * Start on the survey for https://phabricator.wikimedia.org/T197635
 * What I'm blocked on
 * Other?
 * Other?

Tyler

 * What I plan to do this week
 * Add Eval.JIT=1 to rebuildLocalisationCache.php in scap
 * Start work on list of services for "staging"
 * tmpfs for /tmp dir in quibble
 * What I'm blocked on
 * Blubber v0.5.0 update, will maybe poke the task
 * Other?

Zeljko

 * What I plan to do this week
 * T179188 Video recording for Selenium tests in Node.js - code in final review https://gerrit.wikimedia.org/r/c/mediawiki/core/+/422933
 * T185011 Create MediaWiki-npm-browser-run-selenium-test-node-6-docker daily Jenkins job - the job is running but failing because beta cluster is different than other environments, will run just tests that pass for now https://integration.wikimedia.org/ci/job/selenium-daily-beta-MediaWiki/
 * T188742 Run tests daily targeting beta cluster for all repositories with Selenium tests - focusing on moving Nodepool jobs to Docker, then adding more jobs https://gerrit.wikimedia.org/r/c/integration/config/+/443931
 * Testival conference http://www.testival.eu/
 * What I'm blocked on
 * Antoine is owner of this, so I can't un-WIP it, I think it's ready for review https://gerrit.wikimedia.org/r/c/integration/config/+/443931
 * Other?

Team Kanban Board Review and Triage

 * closed and touched in the 7 days
 * No update for 4 weeks
 * No update for 3 weeks
 * No update for 2 weeks
 * No update for 1 week
 * All Open
 * Review To Triage column of #releng
 * Assigned
 * Unassigned

Once / month-ish review of backlog(s)

 * releng Review To Triage column of #releng
 * releng-kanban Review unassigned in kanban
 * releng-kanban Review 'backlog' colum of -kanban
 * releng-next - Review for things we need to put on our kanban backlog
 * releng-backlog - oh my, the huge backlog of things...

Kanban stats

 * Burnup chart