Professional Documents
Culture Documents
CIRED2015 0711 Final
CIRED2015 0711 Final
Paper 0711
Paper 0711
Paper 0711
a certain power substation, which provides energy for the geographical size. When a cell "breaths", some SIM
buildings belonging to that section; so, for the rest of this change the cell it is attached to, if the position of the SIM
paper we will refer to these buildings simply as is based on the location of the cell, this may result in an
substation, distinguishing each one by a primal number. apparent change of the SIM location, which instead did
Sample DSO data gave us measure of the amperage of not change its real position.
each substation every 15 minutes for about two weeks for On the other side, we now have to include power
5 months, in order to show us a pretty wide look to their consumption data in this schema, geolocating them inside
behaviour in the course of the year. the city map.
In addition to that, we consider necessary to an even The substations to analyse has been chosen in a limited
more complete study to include also climatic information central area of Milan, whose peculiarities are common to
to better pinpoint the behalf of consumers according to many other zones, in order to simplify the analysis as
the meteorological changes, to acquire a more precise well as to create a model useful to later describe the rest
estimate of their future request of energy. Power requests of the city. Knowing that every substation is related to a
are influenced by the climatic situation around. Let’s group of addresses, it is needed to identify them and
imagine a very cloudy day, warmed by strong hot understand how much of each cell they occupy. We
weather, or a sunny week with each day characterized by know, for example, that the cell a houses a number x of
low temperature all day long. The estimation of energy events and cell b a number y of events, and the substation
demand in these scenarios implies that our analysis must k is constituted by three building, lying on a and b and
consider how many people will be forced to use more occupying them respectively for za and zb percentage.
energy to warm (or cool down) house although the Because in our environment the events in every cell
number of daylight hours or cloud coverage. We have appear as equally distributed, we can assume that in
therefore to collect a proper set of information regarding substation k occurs the sum of the za percentage of the
meteorological situation [13], to add them in the events x and the zb percentage of the events y. About the
prediction model we’ll describe below. We decided to climatic parameters, our assumption is simpler: in this
introduce some supplemental parameters, such as intermediate phase we still consider the central area of
temperature indexes, in particular heat index and Summer Milan, whose dimension is not as wide as requiring more
simmer index (a new index calculated as a function of air than one set of different meteorological indexes. So at a
temperature and relative humidity; for Celsius degrees particular ∆T, the whole group of available substations in
above 22 it pictures closer the complaint caused by heat this study has the same heat/SSI, UV and cloudiness
because it points out the temperature it would feel like in indexes.
a dry environment such as a desert), cloudiness and UV
indexes. As we do for cellular events distribution, we can 3.3 Building the statistical models
correlate these parameters with the power consumption, Starting from the aim of correlating the two main sources
so that we can find the fittest to satisfy our analysis. of data available we tried to find a statistical significant
model to describe this correspondence using linear,
3.2 Cellular network data elaboration process logarithmic and polynomial regression functions. To this
Sampling for the same O-IMSI and for a given ∆T period end, we used univariate models where: network events
the consecutive events, it will provide the pattern of the and mobile users distributions are considered as the
location updated of the given O-IMSI for the observation independent variable of the model, while energy
period ∆T. This process has to be performed for every O- consumption data are considered as dependent variable.
IMSI collected from the mobile network. The resulting In general, univariate regression functions produce the
patterns are cumulated in a sort of 4D matrix, named slope of a line that best fits a single set of data. As for the
Origin Destination Table - OD table - where every case of the other additional independent variables such as
element represents the number of O-IMSI that moved heat indexes, visibility, cloudiness and ultra-violet
from a given Origin cell-id to a Destination cell-id at a radiations index, we used multivariate models to
certain time T for a certain duration ∆T. understand whether these additional data can complement
In this paper we focus on what lie on the diagonal of this cellular network events in providing better estimate of
matrix, the events occurred where origin and destination energy consumption.
cells correspond, that is the ones related to people Including climatic parameters entails changing the
remained in a particular place during a ∆T. perspective a little. The additional variables we suggest
It has to be mentioned that all the location updates have are heat/SSI, UV and cloudiness indexes, but to avoid
to be filtered from fake movements, such as for example using parameters that apparently could satisfy a good
when phenomena, such as cell "breathing", happens. The coefficient of Determination, (named R2, indicating how
size of the area covered by a cell changes depending on well data fit a statistical model) without having an
the number of users attached to the cell. This change in efficient estimation model, we decide to correlate each of
size is called "breathing" because the size of the cell these indexes with the consumption variables to find the
increases or decreases depending on the number of users, fittest variables which help us to match closely to our
as consequence also neighbouring cells changes their objective.
Paper 0711
Figure 2 - Data Plot for Energy Current Consumption [A] vs. VI Cellular Network Events
3.4 Detected Models As for the case of multivariate models, Figure 2 suggests
the identification of two groups of curves with a similar
We examined a group of 9 substations for 6 weeks from “electrical” behaviour: the first group (Group1) relates to
February to June, covering different seasonal conditions substations St1, St2, St3, St8, St9 and the second one
in terms of weather and people presence. (Group2) to substations St4, St7, St11, St12. Accordingly
As for univariate models, we detected several relevant two models have been derived, with respectively a R2
models for almost all the considered variables. equal to 0.87 and 0.82 for Group1 and Group2.
Starting from the analysis of the cell net events as The set of models can be then used at run-time to
independent variable, Figure 2 shows the univariate estimate in advance the energy consumption just starting
logarithmic models that better describe the correlation from the observation of the three selected independent
between dependent and independent variables and the R2 variables. The execution of the statistical models against
value, which indicates how well data fit the statistical the real-time cellular traffic data showed a relative
model. All substations are described by relevant standard error equal to 16.4%. Since the Relative
statistical models with a R2 higher than 0.5 (the threshold Standard Error is lower than 25% [12], the two
we set-up for models validity.) The average R2 for a multivariate models can be considered reliable enough
univariate model, exploiting just the cellular network data for general adoption
as independent variable, is equal to 0.705 with an upper
value of R2 = 0.869 for the substation St2 and a lower
4. CONCLUSION AND FUTURE WORK
value of R2 = 0.505 for substation St7.
Considering one-by-one the climate parameters as single The adoption of the proposed approach can find its
independent variable, we obtained significant models for breathe in the process of estimating the energy demand.
the two variables: heat/SSI and UV index, with an The models derived are relevant from a statistical point-
average R2 = 0.586 and R2 = 0.655, respectively. As for of-view and prove that cellular network data are a strong
the sky coverage variable, the average R2 of the found indicator to forecast in real-time the energy consumption
models for the 9 evaluated substations is R2 = 0.156 and demand in each area of the city. Of course, the
(lower than the threshold we set-up.) quality of the models can be improved by introducing
Summarizing, the univariate models that use the cellular other variables affecting the energy demand and
network events as independent variable to estimate extending the training data set.
energy consumption are more precise (e.g., based on the One of the further steps will focus in tuning the forecast
R2 analysis) than the models based on climate variables. into different timeslots, in order to calibrate the time-
Paper 0711
window of estimation and adapt the analysis to meet Journal of Applied Social Psychology, 9(1), 70-90.
correspondent requirements in terms of advance and 1979.
duration validity of the demand forecast complementing [6] Saab, S., Badr, E., & Nasr, G. Univariate modeling
the ones currently available to the Distributor System and forecasting of energy consumption: the case of
Operator that are not focused on short terms energy electricity in Lebanon. Energy, 26(1), 1-14. 2001.
demand. About demand forecasting, we studied an urban
area where, theoretically, cell events and power [7] Fumo, N., Mago, P., & Luck, R. Methodology to
consumptions are balanced. Our hypothesis is, in fact, estimate building energy consumption using
that network event density in this zone is proportionally EnergyPlus Benchmark Models. Energy and
related to how much energy is used, because we talked Buildings, 42(12), 2331-2337. 2010.
about cell with high thickness of domestic buildings, [8] Harris, C.; Cahill, V., "Exploiting user behaviour for
which consumptions are adjusted for little amounts of context-aware power management," Wireless And
people but deeply concentrated together. This hypothesis Mobile Computing, Networking And
finds confirmation by the levels of statistical correlations Communications (WiMob'2005), IEEE International
discussed above. In addition, we overlooked the weight Conference on, vol.4, no., pp.122,130 Vol. 4, 22-24
of the voltage phases, whose contribution is to be Aug. 2005.
considered as future work. [9] S. Geisser. Predictive Inference. New York, NY:
To enrich our analysis to a more thorough level, the OD Chapman and Hall. ISBN 0-412-03471-9. 1993.
matrix we introduced in Section 3.2 can be further
filtered focusing on particular SIM cards categories, such [10] Subhash, B.; Rajagopal, V., "Overview of smart
as the ones used by Electric Vehicles (EV), in order to metering system in Smart Grid scenario," Power and
foresee energy demand connected to EV stations. Energy Systems Conference: Towards Sustainable
Moreover, notable case studies to be investigated involve Energy, 2014, vol., no., pp.1, 6, 13-15 March 2014.
forecasting grid demand also in non-urban areas, for [11] Seunghyun Park; Hanjoo Kim; Hichan Moon; Jun
example rural zones, or where large industrial plants are Heo; Sungroh Yoon, "Concurrent simulation
located, with very massive consumptions despite low platform for energy-aware smart metering systems,"
people density, therefore less network events. Consumer Electronics, IEEE Transactions on, vol.56,
no.3, pp.1918,1926, Aug. 2010.
ACKNOWLEDGMENTS [12] Klein, RJ. "Healthy People 2010 criteria for data
The research presented in this article was partially funded suppression". Statistical Notes (Hyattsville, MD:
by the European project SMARTC2NET U.S. National Centre for Health Statistics) (24).
[http://www.smartc2net.eu], sponsored by the EU in the [13] AccuWeather. Web published. Accessed: Sept. 2014.
7th FP (grant agreement n. 318023). URL: http://www.accuweather.com/it/it/italy-
weather.
REFERENCES
[1] A. Monticelli, Electric Power System State
Estimation, Proceedings of the IEEE, vol. 88, no. 2,
Feb. 2000.
[2] Geoffrey K.F. Tso, Kelvin K.W. Yau, Predicting
electricity energy consumption: A comparison of
regression analysis, decision tree and neural
networks, Energy, Volume 32, Issue 9, Pages 1761-
1768, ISSN 0360-5442. Sept. 2007
[3] MathWorld. Regression and Least Squares Fitting.
Web published. Accessed: July 2014. URL:
http://mathworld.wolfram.com/LeastSquaresFitting.h
tml
[4] Coşkun Hamzaçebi, Forecasting of Turkey's net
electricity energy consumption on sectorial bases,
Energy Policy, Volume 35, Issue 3, Pages 2009-
2016, ISSN 0301-4215. March 2007.
[5] Seligman, C., Kriss, M., Darley, J. M., Fazio, R. H.,
Becker, L. J., & Pryor, J. B. Predicting Summer
Energy Consumption from Homeowners' Attitudes1.