Wikimedia Release Engineering Team/Checkin archive/20170807

= 2017-08-07 =

Vacations/Important dates

 * https://office.wikimedia.org/wiki/HR_Corner/Holiday_List
 * How to do it


 * August 3-9: Željko vacation
 * August 7-25: Antione relocating and vacation
 * August 8-15: Greg @ Wikimania & Tech-mgrs F2F
 * August 9-13: Wikimania
 * Aug 9-13: Dan on vacation
 * Aug 11-14: Chad maybe on vacation
 * Aug 14th: thcipriani Birthday!
 * Aug 15th - WMF Monthly day off (random)
 * Aug 17th: Mukunda - court again
 * Aug 21st - thcipriani eclipse!
 * Sept 4 - Labor Day
 * Oct 9 - Indigenous People's Day

Rotating positions and absences
Maniphest query for deployment blocker tasks: https://phabricator.wikimedia.org/maniphest/?project=PHID-PROJ-fmcvjrkfvvzz3gxavs3a&statuses=open%28%29&group=none&order=newest#R

July 31 and Aug 7

 * Train: Mukunda
 * wmf.12
 * wmf.13
 * SoS: Chad
 * Out:
 * August 3-9: Željko vacation
 * August 7-25: Antione relocating and vacation
 * August 8-15: Greg @ Wikimania&Tech-mgrs F2F
 * August 9-13: Wikimania
 * Aug 10-13: Dan on vacation
 * Aug 11-13: Chad maybe on vacation

Aug 14 and Aug 21

 * Train: Tyler
 * wmf.14
 * Tuesday: Chad doing branch cut/group0
 * wed/thur: Tyler
 * wmf.15
 * SoS: Mukunda
 * Out:
 * August 7-25: Antione relocating and vacation
 * Aug 14th: thcipriani Birthday!
 * Aug 15th - WMF Monthly day off
 * Aug 17th: Mukunda court :-/
 * Aug 21st - thcipriani eclipse!

Aug 28 and Sept 4

 * Train: Chad
 * wmf.16
 * wmf.17
 * SoS: Tyler
 * Out

Actions from last meeting

 * TODO: Greg email mark/faidon/moritz plan for week after wikimania
 * DONE

This week
REMINDER: We missed last week


 * Blocking
 * Blocked
 * Updates
 * Updates
 * Updates

Last week

 * Blocking
 * Blocked
 * Need feedback on https://phabricator.wikimedia.org/T129148#3482379 from Ops
 * Updates
 * Updates

Logspam \ Last week's train updates

 * https://www.mediawiki.org/wiki/Wikimedia_Release_Engineering_Team/Roles#Train_Conductor


 * Wikidata: https://etherpad.wikimedia.org/p/1.30-wmf.12-postmortem
 * Performance regression: https://phabricator.wikimedia.org/T172447

Other Team Business

 * Security announce lists -> our team list?
 * see also: Jenkins Security advisory pre-announcement
 * If yes, which services?
 * Jenkins - ✅
 * kinda... the whitelisted jenkins-security@googlegroups email didn't send the actual advisory :(
 * Gerrit - no such list, wtf?
 * nodepool/zuul?
 * Phabricator (is there one?)


 * Deployment process improvements:
 * https://wikitech.wikimedia.org/w/index.php?title=Deployments&action=historysubmit&type=revision&diff=1765924&oldid=1765923
 * https://wikitech.wikimedia.org/wiki/Deployments/Holding_the_train
 * Emailed link out with last week's rollback

Program 6: Streamlined service delivery

 * Define functional tests for Mathoid running on the staging Kubernetes cluster for use in future gating decisions -
 * Define method for monitoring and reacting to the above functional tests -


 * we have registry creds/ops (joe) have pwstore access
 * meeting later today...

Deprecate use of Trebuchet across production -

 * https://phabricator.wikimedia.org/T129290


 * keyholder Too many authentication failures
 * plan for jobrunner, methinks
 * ahem waiting on authors :P https://phabricator.wikimedia.org/D734

Quality improvements

 * Tech Debt
 * Created wiki page for Tech Tech Program
 * Need to break out some content an put in existing TD wiki page
 * Code Health
 * Created Code Health wiki page
 * Created Code Health Group wiki page
 * scheduled next CHG core meeting

Phabricator

 * Phabricator is migrated to phab1001.eqiad.wmnet
 * Tested phab2001.codfw.wmnet - we are now very close to being ready for migration to Dallas
 * Found and fixed a bunch of stupid stuff in the process, hopefully we won't hit as many problems with codfw.
 * Phab email was down for quite a while. Daniel and I were up past midnight fixing things :-/

Docker for CI

 * operations/puppet now live :)
 * https://grafana.wikimedia.org/dashboard/db/zuul?panelId=13&fullscreen&orgId=1&from=now-7d&to=now-5m
 * https://integration.wikimedia.org/ci/job/operations-puppet-tests-docker/buildTimeTrend
 * Documented: https://www.mediawiki.org/wiki/Continuous_integration/Docker

Team Kanban Board Review and Triage

 * closed and touched in the 7 days
 * No update for 4 weeks
 * No update for 3 weeks
 * No update for 2 weeks
 * No update for 1 week
 * All Open
 * Review To Triage column of #releng


 * Assigned
 * Unassigned

Once / month-ish review of backlog(s)

 * releng Review To Triage column of #releng
 * releng-kanban Review unassigned in kanban
 * releng-kanban Review 'backlog' colum of -kanban
 * releng-next - Review for things we need to put on our kanban backlog
 * releng-backlog - oh my, the huge backlog of things...

Kanban stats

 * Burnup chart