Data characteristics | Multivariate time series | Type of building(s) | Residential | Date | 2011 |
Type of attributes | Integer, Real, string | Number of buildings | 1 | Duration | 8 days |
Level of details | Whole house, events of appliances | Location of buildings | Pennsylvania, US | Sample frequency | 12 kHz |
The BLUED dataset contains high frequency (12 kHz) data of raw current and voltage of the whole house and the corresponding computed active power (60 Hz). The data set also contains an event timestamps list. That is, whenever an appliance state of power consumption changes by 30 watts or more and lasts for at least 5 seconds. Number of appliances is approximately 50. And a number of 2335 events have been recorded in the dataset. Extra 2482 events from un-known sources are recorded in the dataset as well.
The BLUED includes one txt file and sixteen bzip files, namely location_001_datadictionary.txt and location001_dataset_001.bzip to 016. The data dictionary file describes codes of appliances (e.g. 129 for TV). Each of bzip files contain information for 12 hours of a day including I-V consumption data, an event list, and event lists for both phases, start and end dates and a readme file.
Attributes
Date (yy/mm/dd)
Time (hh:mm:ss)
Current A (in Amps)
Current B (in Amps)
VoltageA (in Volts)
Active power (in watt)
K. Anderson, A. Ocneanu, D. Benitez, D. Carlson, A. Rowe, and M. Berges, "Blued: a fully labeled public dataset for event-based non-intrusive load monitoring research," in Proceedings of the 2nd KDD Workshop on Data Mining Applications in Sustainability, Beijing, China, 2012, pp. 12-16.