Dataset Name: Building-Level fUlly labeled Electricity Disaggregation dataset (BLUED)

Data characteristics Multivariate time series Type of building(s) Residential Date 2011
Type of attributes Integer, Real, string Number of buildings 1 Duration 8 days
Level of details Whole house, events of appliances Location of buildings Pennsylvania, US Sample frequency 12 kHz

Access to the dataset

http://nilm.cmubi.org/

Data set description

The BLUED dataset contains high frequency (12 kHz) data of raw current and voltage of the whole house and the corresponding computed active power (60 Hz). The data set also contains an event timestamps list. That is, whenever an appliance state of power consumption changes by 30 watts or more and lasts for at least 5 seconds. Number of appliances is approximately 50. And a number of 2335 events have been recorded in the dataset. Extra 2482 events from un-known sources are recorded in the dataset as well.

Dataset structure

The BLUED includes one txt file and sixteen bzip files, namely location_001_datadictionary.txt and location001_dataset_001.bzip to 016. The data dictionary file describes codes of appliances (e.g. 129 for TV). Each of bzip files contain information for 12 hours of a day including I-V consumption data, an event list, and event lists for both phases, start and end dates and a readme file.

Attributes

Date (yy/mm/dd)

Time (hh:mm:ss)

Current A (in Amps)

Current B (in Amps)

VoltageA (in Volts)

Active power (in watt)

Paper and citation

K. Anderson, A. Ocneanu, D. Benitez, D. Carlson, A. Rowe, and M. Berges, "Blued: a fully labeled public dataset for event-based non-intrusive load monitoring research," in Proceedings of the 2nd KDD Workshop on Data Mining Applications in Sustainability, Beijing, China, 2012, pp. 12-16.