Wikimedia Cloud Services team/Onboarding Arturo/Sessions

= Arturo and Chase Onboarding Sessions =

Oct 31, 2017

 * 2017-11-01 is a public holiday for Arturo
 * We should get some/all of these for the next few months on the team calendar
 * Arturo trying to understand which servers are physical, which are virtual, and how they link together
 * Wants a map of how things fit together
 * Nick poked Arturo about setting up his User page on metawiki

https://wikitech.wikimedia.org/wiki/Ganeti <--- KVM + DRBD
 * Chase to find the newly formed ongoing topographical docs
 * Everything is physical *except* Cloud VPS tenents and a few things on Ganeti in "production"


 * https://tools.wmflabs.org/openstack-browser/project/
 * https://tools.wmflabs.org/openstack-browser/project/tools <-- all of the VMs in Toolforge
 * names of the vms give a hint to what they do:
 * tools-k8s-* -- kubernetes core services
 * tools-docker-* -- kubernetes related Docker hosts (Docker registry, Docker image builder host)
 * tools-worker-* -- kubernetes exec nodes
 * tools-paws-* -- a second kubernetes cluster that powers PAWS , run by Yuvi
 * tools-exec-* -- Grid Engine execution nodes for "normal" tasks
 * tools-webgrid-* -- Grid Engine execution nodes for "web" tasks


 * https://quarry.wmflabs.org/
 * http://paws.wmflabs.org/

Oct 26, 2017
https://phabricator.wikimedia.org/T179024
 * topics?
 * I've been working on this task today: nfsiostat diamond collector

To test a patch, depool a node and test in a node: https://phabricator.wikimedia.org/P6194

Puppet (how does it work)

 * LDAP is the "same sign-on" solution for all things that are not MediaWiki
 * Unix user accounts outside of Cloud VPS are not connected directly to LDAP
 * Data is managed by Puppet based on modules/admin/data/data.yaml


 * puppetmaster1001.eqiad.wmnet
 * puppet-merge

y/n?

new installs
https://wikitech.wikimedia.org/wiki/Server_Lifecycle#Installation


 * New installs
 * Getting the MAC address for a new server
 * https://wikitech.wikimedia.org/wiki/Platform-specific_documentation
 * https://wikitech.wikimedia.org/wiki/Server_Lifecycle

New server: foo.eqiad.wmnet management network: foo.mgmt.eqiad.wmnet management network: .eqiad.wnet == mgmt

https://wikitech.wikimedia.org/wiki/Platform-specific_documentation/HP_DL3N0 show system1/network1/Integrated_NICs

files/dhcpd/linux-host-entries.ttyS1-115200:host labcontrol1001 { # onboard management Host *.mgmt.*.wmnet StrictHostKeyChecking ask UserKnownHostsFile /Users/cpettet/.ssh/wmf_mgmt_hosts https://gerrit.wikimedia.org/r/#/admin/projects/operations/dns

https://phabricator.wikimedia.org/diffusion/

baham.eqiad.wment authdns-update

From pupetmaster1001: new-install

Bastions

 * Bastions (protected bastion)
 * https://wikitech.wikimedia.org/wiki/Production_shell_access

restricted.bastion.wmflabs.org

toolforge <-- own bastion

---

Cloud VPS project request instructions -- https://phabricator.wikimedia.org/project/view/2875/

Openstack vs Horizon vs Toolsadmin

 * OpenStackManager -- https://www.mediawiki.org/wiki/Extension:OpenStackManager
 * Horizon -- https://docs.openstack.org/horizon/latest/
 * Toolsadmin (codename: Striker) -- https://wikitech.wikimedia.org/wiki/Toolsadmin.wikimedia.org