NetConnect Germany is one of two transmission system operators in the German natural gas market. With a high-pressure pipeline network of a total length of about 20,000 km, it connects approximately 500 downstream networks and is responsible for the operational management of the market area cooperation. Naturally, this involves incorporating large amounts of data from all distribution networks. So-called “allocation data points” contain spatial and temporal information on physical gas flows and are required by NCG to balance the energy system. Due to technical or manual reasons, it can happen that some of the data points delivered by the distribution networks contain measurement errors.
This is where you can add substantial value to NCG’s operations: Ensuring a consistently high data quality is of tremendous importance, so that NCG’s own analyses and prediction tools are based on truthful information from the beginning. With the help of historical data, including labelled data points that were erroneous, it is your task to develop an application that automatically a) detects erroneous data points b) imputes reasonable values, so that follow-up processes and analyses are not distorted by outliers.