Skip to content

Data and metadata

Source

Background data (the Norwegian Energy Performance Certificates (EPC) dataset) is a component of the Norwegian Energy Labelling System for Houses and Dwellings. Certification carried out by Enova and managed by the Norwegian Water Resources and Energy Directorate. Currently, the available dataset contains approximately 79 000 EPC records that meet the following criteria:

  • The EPC has been issued before September 2020;
  • The EPC specifies the total (across all energy sources) annual measured/reported energy use of the certified unit;
  • The certified unit belongs to one of 74 municipalities with the total EPC records count larger than 100;
  • Energy use/intensity is in range of sensible values for the units of given type/age/size/administrative affiliation. This means that the dataset has been subject to standard outlier removal procedures. These procedures are intended to mitigate the implications of errors made during certification on the analytical conclusions.

Metadata

The dataset contains variables/features of categorical and numerical types:

  • Categorical
    • City;
    • Building type.
  • Numerical:
    • Construction year [CY];
    • Heated floor area [HFA] (m2);
    • Total annual energy use [EU] (kWh·y-1);
    • Energy intensity [EI] (kWh·y-1·m-2).

Data slicing

Built Stock Explorer supports examining a comprehensive dataset in smaller subsets of interest by controlling the variables. The control of categorical variables is made available through the dropdown menus where the selection of multiple values is supported. Numerical variables are controlled using the range sliders that define the upper and the lower limits per variable. These sliders and menus are nested under the "Dataset" tab (Fig. 1). Dropdown menus support searching while typing with Latin or Norwegian alphanumeric characters. "City" menu contains a list of 74 municipalities sorted by the number of EPC records they are associated with. "Type" menu contains a list of building types that are available given the constraints specified by the "City" menu and all three sliders. A prefix "NR."/"RE." is used as a convention to denote Non-Residential and Residential building types. Range sliders have either linear (Construction year) or logarithmic ("Heated floor area (sq.m.)" and "Total energy use (kWh/year)") scales. The "Subset totals" section displays the summary of the dataset that is active given the user-defined constraints: total number of records, the sum of heated floor area and the sum of total energy use for these records.

datatab
Fig. 1: The components of the "Dataset" tab in Built Stock Explorer

Fig. 1 illustrates the selection of all certified terraced houses and advanced offices located in Bergen and Tromsø, constructed after 1950, having heated floor area within [100...10 000] m2, and using no more than 100 MWh per annum. In this example, 623 units with the total 8.87·104 m2 and 1.11·107 kWh·y-1 match the criteria.