Data Platform Engineering/Data Engineering

The Data Enginering team is responsible for the core capabilities of the data platform, including data storage, batch and streaming infrastructure, and distributed query engines. This platform supports ingestion of Wikimedia project content, web traffic, instrumentation, operational data and other datasets into the Data Lake. The team manages the foundational data pipelines, whereas the data producers manage their respective data pipelines and data products. The team's responsibilities include data quality, observability, and discoverability.

Contact Us
Please see the Intake Process page to make a request or contact one of our Product Managers.