Data Platform Engineering/Data Products/work focus

Data Products Goals
At a high level, our team is currently focused on two hypotheses
 * 2.5.1: "If we develop a centralized experimentation platform that can define, deploy, and get feedback on experiments, Product teams will develop and instrument their features for experimentation."
 * 3.4.1: "If our trusted datasets are all in the same place following the same conventions in dimension semantics, naming, and granularity considerations; it will be easier to combine and extract the data and serve data that can be easily evaluated in terms of privacy."

Sprint Goals
The goals for current sprint are (23/10/03 - 23/10/23)


 * 1) SDS 2.5.1: Prepare to onboard the rest of the team
 * 2) Traffic to all six services routed to AQS 2. AQS is ready to sunset.
 * 3) Technical strategy for Commons Impact Metrics prototype including implementation draft
 * 4) Dumps 2: Bring to complete or pause with a plan for future.
 * 5) Knowledge gaps: pause until we open work on SDS 3.4

Past Sprints
23/09/12 - 23/10/02
 * 1) At least one client library is refactored to include the new data contract (core schema and scheme fragments) and an existing instrument is prototyped [receiving live data?]
 * 2) Did not yet
 * 3) Almost at two client libraries refactored
 * 4) Merge requests not quite landed
 * 5) [Continue] Generate XML dumps for simplewiki
 * 6) Not yet
 * 7) XML generated with everything but data quality issues form input
 * 8) How we import is remaining work
 * 9) 100% of traffic routed to Media, Pageviews [Edit and Editor Analytics next]
 * 10) Media done 🎉
 * 11) Pageviews is waiting on SRE
 * 12) Knowledge Gaps Index metrics receive production traffic
 * 13) Waiting on SRE
 * 14) Data dumps transition has been clearly communicated across stakeholders
 * 15) Done 🎉

23/08/28 - 23/09/08
Generate XML dumps for a simplewiki

Core interaction schema and schema fragments are prototyped and tested in preparation for updating metrics platform client libraries next sprint

100% of traffic routed to Geo and Media Analytics

Identify and mitigate risks associated with MediaWiki History pipeline