chapter 4 - shodhgangashodhganga.inflibnet.ac.in/bitstream/10603/27652/9/09_chapter 4.pdf · 86...

41
86 CHAPTER 4 METHODOLOGY The methodology for the development of forecasting models to predict the energy requirement in India for the years 2010, 2020 and 2030 is presented in this chapter. Also the development of an Optimal Electricity Allocation Model (OEAM) for allocating electricity through various energy sources is described. 4.1 DEVELOPMENT OF FORECASTING MODELS TO PREDICT ENERGY REQUIREMENT IN INDIA Since energy becomes a vital factor for future developments of the country, a system of models has to be developed to provide forecasts of the energy demand in various sectors. Over the past two decades, forecasting has gained a widespread acceptance as an integral part of business planning and decision-making. Forecasting is the predicting of future values of a variable based on historical values of the same or other variables. Recent literature on energy forecasting offers a broad range of forecasting tools. The energy consumption is characterized by its large variations over time period. It varies widely from year to year and exhibits seasonal variations. Formulating a forecasting model that can accurately forecast the energy consumption is of prime importance in energy system planning. In the present work, an attempt was made to develop forecasting models for consumption of commercial energy sources in India. This analysis

Upload: dinhmien

Post on 09-May-2018

225 views

Category:

Documents


3 download

TRANSCRIPT

86

CHAPTER 4

METHODOLOGY

The methodology for the development of forecasting models to

predict the energy requirement in India for the years 2010, 2020 and 2030 is

presented in this chapter. Also the development of an Optimal Electricity

Allocation Model (OEAM) for allocating electricity through various energy

sources is described.

4.1 DEVELOPMENT OF FORECASTING MODELS TO

PREDICT ENERGY REQUIREMENT IN INDIA

Since energy becomes a vital factor for future developments of the

country, a system of models has to be developed to provide forecasts of the

energy demand in various sectors. Over the past two decades, forecasting has

gained a widespread acceptance as an integral part of business planning and

decision-making. Forecasting is the predicting of future values of a variable

based on historical values of the same or other variables. Recent literature on

energy forecasting offers a broad range of forecasting tools. The energy

consumption is characterized by its large variations over time period. It varies

widely from year to year and exhibits seasonal variations. Formulating a

forecasting model that can accurately forecast the energy consumption is of

prime importance in energy system planning.

In the present work, an attempt was made to develop forecasting

models for consumption of commercial energy sources in India. This analysis

87

utilizes regression techniques, double moving average method, double

exponential smoothing method, triple exponential smoothing method, Auto

Regressive Integrated Moving Average (ARIMA) model and Artificial Neural

Network (ANN) model (Univariate and Multivariate) for the energy

forecasting purpose of commercial energy sources such as coal, oil, electricity

and natural gas consumption in India. Figure 4.1 illustrates the schematic

representation of the development of forecasting models. The methodologies

of above-mentioned models are discussed below.

4.1.1 Time Series-Regression Methods

Regression is a statistical method of fitting a line through data to

minimize squared error. The actual number of data required depends mainly

upon the nature of the data. The objective of the regression is to estimate the

equation of a line through a set of data that minimizes the sum of the squared

differences between the actual data and the line. Regression analysis is used

to estimate the energy demand in different sectors. In time series technique,

past consumption data was used to predict future energy demand. These are

statistical techniques used when several years of data are available and when

relationships and trends are both clear and relatively stable. Time series

analysis helps to identify and explain any regularity or systematic variation in

the series of data which is due to seasonality, cyclical patterns that repeat any

two or three years or more, trends in the data and growth rates of these trends.

The following time series – regression techniques such as linear model,

exponential model, power model and quadratic model were used in the

analysis. The methodologies of these models are given below.

88

89

4.1.1.1 Linear Model

In the linear model, the relationship that is used to fit the ‘n’

number of data is a linear equation in one variable. The linear model is of the

form,

baxy (4.1)

where y is the consumption of particular energy source in specific sector in a

particular time period, x is the time period in year and a and b are constants.

4.1.1.2 Exponential Model

Exponential growth models are appropriate for long-term

forecasting when growth rates are constant over time. An exponential growth

curve has a constant rate of growth at any point in time, resulting in

staggering increases over longer periods as the increase is compounded

exponentially. This model can be approximated by the exponential curve of

the following form,

)( btat eY (4.2)

where Yt is the time series at time t, e = 2.71828, a is the intercept and b is the

equivalent of the slope and denotes the growth rate. The above equation can

be transformed to make it linear, by taking natural logarithms of both sides of

the equation. The forecasting is obtained by the following equation,

)()(

btamt eF (4.3)

where m is the number of periods ahead to be forecasted.

90

4.1.1.3 Power Model

The power model is also one of the time series regression

techniques considered for the forecasting. The power model is of the form,

baxy (4.4)

where y is the consumption of particular energy source in specific sector in a

particular time period, x is the time period in year and a and b are constants.

4.1.1.4 Quadratic Model

If the data appear to follow a quadratic pattern, then the non-linear

regression has to be applied. The quadratic model is of the form,

cbxaxy 2 (4.5)

where y is the consumption of particular energy source in specific sector in a

particular time period, x is the time period in year and a, b and c are constants.

4.1.2 Double Moving Average Method

The moving averages are applicable to time series data and in many

situations they are more appropriate and easier to use than regression

methods. The term “moving average” is used because, as each new

observation in the series becomes available, the oldest observation is dropped

and a new average is computed. One of the chief difficulties in applying

regression is the need to update coefficients of the least square equation

91

whenever a new data point is obtained. Because a relatively large number of

calculations are required to update the equations, more suitable forecasting

techniques such as moving average methods have been developed. With the

moving average methods, forecasting equations can be quickly revised with a

relatively small number of calculations as each new data point is collected.

Furthermore, moving average methods are able to deal directly with serial

correlation in time series data while regression techniques are not. A moving

average is simply a numerical average of the last N data points that are used

for purposes of making a forecast. As each new data point is acquired, it is

included in calculating the average and the data point for the Nth period

preceding the new data point is discarded. If the data have a linear or

quadratic trend, the simple moving average will be misleading. In the present

work double moving average method was used as one of the methods for

forecasting.

To calculate the double moving average Mt[2], the simple moving

average Mt[1] was treated as an individual data points and a moving average of

these averages was obtained. The values of Mt[2] were used to determine the

equation for forecasting future energy demand. The forecasting equation is of

the form,

Tbay ttTt (4.6)

where T is the number of time periods from the present time, t, to the period

for which the forecasting was made. The values of at and bt were determined

by the equations,

]2[]1[2 ttt MMa (4.7)

92

]MM[)]1N/(2[b ]2[t

]1[tt (4.8)

4.1.3 Exponential Smoothing Method

Exponential smoothing methods are one of the popular methods of

forecasting because they are easy to use, require very little computer time, and

need only a few data points to obtain future predictions. The exponential

smoothing method also eliminates the difficulties faced in the regression

methods. Another advantage of exponential smoothing methods over

regression methods is that it permits the forecaster to place more weight on

current data rather than treating all data points with equal importance.

Exponential smoothing methods also minimize data storage requirements

when calculations are performed manually or with a computer. The moving

average technique requires a relatively large amount of data storage.

Exponential smoothing methods have many of the same advantage as moving

average technique but require a minimum amount of data storage. The basic

exponential smoothing model is,

]1[

1]1[ )1( ttt SXS (4.9)

or

New estimate = [new data] + (1 - ) [Previous estimate]

The term (alpha) is called smoothing constant normally ranges between

0.01 to 0.3. The value of is obtained by trial and error, starting with a

certain value of and then increasing or decreasing it to find the value that

minimizes the percentage error. The double exponential smoothing method

and triple exponential smoothing method were used in the analysis.

93

4.1.3.1 Double Exponential Smoothing Method

The single exponential smoothing cannot deal with non-stationary

data. The linear exponential smoothing is an attempt to deal with linear non-

stationary data. Its only difference from single exponential smoothing is that it

introduces extra formulas that can estimate the trend and subsequently use it

for forecasting. To develop an equation that takes account of a linear trend in

data, double exponentially smoothed statistics St[2] was calculated. The value

of St[2] was determined by the following relation,

]2[1

]1[]2[ )1(ttt SSS (4.10)

The initial values of S0[1] and S0

[2] were assumed in these

calculations. The initial estimates were not important since relatively large

amount of data was available for forecasting. The forecasting equation is of

the form,

Tbay ttTt (4.11)

where T is the number of time periods from the present time, t, to the period

for which the forecasting was made. The values of constants at and bt were

determined by the following relation,

]2[]1[2ttt SSa (4.12)

]SS[)]1/([b ]2[t

]1[tt (4.13)

94

4.1.3.2 Triple Exponential Smoothing Method

The triple exponential smoothing is also called as quadratic

exponential smoothing. It is an extension of linear exponential smoothing. It

aims at dealing with trend of a higher order than linear trend. This is achieved

by introducing triple exponential smoothing which, in addition to the double

exponential smoothing, is used to remove quadratic trends. If the data used for

the analysis exhibits curvature in nature, then double exponential smoothing

is inadequate. In such cases, triple exponential smoothing is used. Triple

exponential smoothed data are sufficient for almost all practical applications.

The triple exponentially smoothed statistics were calculated as follows,

]3[1

]2[]3[ )1( ttt SSS (4.14)

The initial values of S0[2] and S0

[3] were also assumed in these calculations.

The forecasting equation is of the form,

2TcTbay tttTt (4.15)

where T is the number of time periods from the present time, t, to the period

for which the forecasting was made. The values of constants at, bt and ct are

determined by the following relation,

]3[]2[]1[ 33 ttt SSSa (4.16)

])34()45(2)56][()1(2/[ ]3[]2[]1[2tttt SSSb (4.17)

]2][)1(2/[ ]3[]2[]1[22tttt SSSc (4.18)

95

4.1.4 Auto Regressive Integrated Moving Average (ARIMA) Model

A common approach to the forecasting is the Box-Jenkins time

series approach, which is used here for the forecasting of commercial energy

sources in India. It has the attracting features because it is an economical

approach, which can represent stationary and non- stationary stochastic

processes. The objective here was to build an Auto Regressive Integrated

Moving Average (ARIMA) model, which adequately represents the data

generating processes. The Box-Jenkins method involves the following four-

step iterative cycle. They are:

1. Model identification,

2. Model estimation

3. Diagnostic checking, and

4. Forecasting with the final model.

Forecasting with the estimated model is based on the assumption

that the estimated model will hold in the horizon for which the forecasts are

made. The AR (Auto Regressive) part of the model indicates the future values

of yt are weighted averages of current and past realizations. Similarly, the MA

(Moving Average) part of the model shows how current and random shocks

will affect the future values of yt. Since AR models are simple to estimate,

have well developed model selection criteria, and require limited pre testing,

they are the form of ARIMA used here.

In an autoregressive integrated moving average model, the future

value of a variable is assumed to be a linear function of several past

observations and random errors. That is, the underlying process that generate

the time series has the form

96

yt 0 1yt-1 2yt-2 + · · · pyt-p t - 1 t-1 - 2 t-2 - · · · - q t-q

(4.19)

where yt t are the actual value and random error at time period t,

i j (j=0, 1, 2...q) are model parameters. p and q

are t,

are assumed to be independently and identically distributed with a mean of 2.

Equation 4.19 causes several important special cases of the ARIMA

family of models. If q = 0, then the model becomes an AR model of order p.

When p = 0, the model reduces to an MA model of order q. One central task

of the ARIMA model building is to determine the appropriate model order (p

and q). Based on the earlier work Box and Jenkins developed a practical

approach to building ARIMA models, which has the fundamental impact on

the time series analysis and forecasting applications. The Box–Jenkins

methodology includes three iterative steps of model identification, parameter

estimation and diagnostic checking. The basic idea of model identification is

that if a time series is generated from an ARIMA process, it should have some

theoretical autocorrelation properties. By matching the empirical

autocorrelation patterns with the theoretical ones, it is often possible to

identify one or several potential models for the given time series. Box and

Jenkins proposed to use the autocorrelation function and the partial

autocorrelation function of the sample data as the basic tools to identify the

order of the ARIMA model.

In the identification step, data transformation is often needed to

make the time series stationary. Stationary is a necessary condition in building

an ARIMA model that is useful for forecasting. A stationary time series has

the property that its statistical characteristics such as the mean and the

autocorrelation structure are constant over time. When the observed time

97

series presents trend, differencing and power transformation are often applied

to the data to remove the trend and stabilize the variance before an ARIMA

model can be fitted.

Once a tentative model is specified, estimation of the model

parameters is straightforward. The parameters are estimated such that an

overall measure of errors is minimized. This can be done with a nonlinear

optimization procedure. The last step of model building is the diagnostic

checking of model adequacy. If the model is not adequate, a new tentative

model should be identified, which is again followed by the steps of parameter

estimation and model verification. Diagnostic information may help to

suggest alternative model(s).

This three-step model building process is typically repeated several

times until a satisfactory model is finally selected. The final selected model

can then be used for prediction purposes. For applying the above model a

package called Statistical Package for Social Sciences (SPSS) was used in the

present work and the step-by-step procedure is explained below.

4.1.4.1 Step by step procedure

The systematic procedure for applying ARIMA (Box-Jenkins)

model through SPSS (Statistical Package for Social Sciences) software

package for forecasting is given below.

Select one dependent variable and move it into the ‘Dependent

Box’.

Optionally, select one or more independent variables and

move them into the ‘Independent(s) Box’.

98

Transform the series before estimation by choosing one of the options from the ‘Transform List’. The available transform options are:

o None. No transformation is performed.

o Natural log. Log transforms the series before estimation using the natural logarithm (base e).

o Log base 10. Log transforms the series before estimation using the base 10 logarithm.

The model group allows specifying the three parameters of the ARIMA model, autoregressive, difference and moving

average. These parameters are commonly referred to as p, d, and q, respectively. The corresponding seasonal parameters can also be defined if required.

Deselect the option named ‘Include Constant’ in equation if there is no need to estimate a constant term.

By clicking over ‘Save’, new variables were created, containing predicted values and residuals.

By clicking on ‘Options’ one can select convergence criteria, set initial values for the model and to choose how to display parameters in the output.

By following the above-mentioned steps, the ARIMA model was applied to the data and the forecasting of the commercial energy sources in India were arrived.

4.1.5 Artificial Neural Network (ANN) Model (Univariate and

Multivariate)

In general ANN’s are computational paradigms that implements

simplified models of their biological counter part, biological neural structures.

99

Biological neural network are the local assemblage of neurons and their

dendrite connections that form the human brain. Accordingly, ANN’s are

characterized by local processing in artificial neuron i.e parallel processing,

which is implemented by the rich connection pattern between processing

elements. The basic building block of the ANN is artificial neuron. The

neurons are grouped together in parallel to form layers. The layers are

interconnected through the weighting factors. Signals can flow from the input

layer through to the output layer in two ways that is unidirectional or

bi-directional. In unidirectional connections the neurons are connected from

one layer to next but not with in the same layer. The first and last layers of

Feed Forward Neural Network (FFNN) are called the input and output layers

and those in between are termed as hidden layers.

4.1.5.1 Application of Neural Network in Forecasting

The data set involves inputs and outputs of the network. Inputs

were past energy consumption data, GNP (Gross National Product) and

population in the case of multivariate Artificial Neural Network (ANN) model

and only past energy consumption data in the case of univariate Artificial

Neural Network (ANN) model and energy demand of different energy sources

like coal, petroleum, electricity and natural gas as output in both the cases. In

the total data available, 80 % was used for training and remaining data for

validation purpose. Once the network is trained, it was used for forecasting

the future energy demand.

4.2.5.2 Simple Neuron Physiology

The simple neuron physiology is illustrated in Figure 4.2 that

depicts the major components of the typical nerve cell in the central nervous

system. The membrane is a permeable to certain ionic species and acts to

100

maintain a potential difference between the intracellular and extra cellular

fluid. It accomplishes this task primarily by the action of sodium – potassium

membrane.

Figure 4.2 Simple neuron physiology

101

4.1.5.3 General processing element

The general processing element of the Artificial Neural Network

(ANN) is shown in Figure 4.3. The individual computational elements that

make up the most artificial neural system models are rarely called artificial

neurons. They are more often referred as nodes, units or processing elements

(PE’s). It is not always appropriate to think of the processing elements in a

neural network as being one to one relationship with actual biological

neurons.

Figure 4.3 General processing element of ANN

Each processing element is numbered, the one in the Figure 4.3

being ‘i’. For example, like a real neuron the processing element has many

inputs but it has only a single output, which can fan out to many other

processing elements in the network. The input, ith receives from the jth

processing element as indicated as Xj. Each connection to the ith-processing

element has associated with it a quantity called a weight or connection

strength (from here both words are used interchangeably). All these quantities

have analogous in the standard neuron model. The output of the processing

102

element corresponds to the strength of the synaptic connection between

neurons. In the present models, these quantities were represented as real

numbers.

Each processing element determines a net input value based on all

its input connections. In the absence of special connections, the net input have

been calculated by summing the input values gated (multiplied) by their

corresponding weights. In other words the net input to the ith unit can be

written as

Netij = Xj x Wij (4.20)

Where the index j runs all over connections to the processing element.

4.1.5.4 Network properties

Some of the important neural network properties are listed here.

The topology of a neural network refers to its framework as well as its

interconnection scheme. The framework is specified by a number of layers or

slabs and the number of nodes per layer. The types of layers include the input

layer, the hidden layer and the output layer.

4.1.5.4.1 The input layer

The nodes in the input layer are called input units, which encode

the instance presented to the network for processing. For Example, each input

may be designated by an attribute value possessed by an instance of the value.

103

4.1.5.4.2 The hidden layer

The nodes in the hidden layer are called hidden units, which are not

directionally observable and hence hidden. They provide non-linearity for the

network.

4.1.5.4.3 The output layer

The nodes in the output layer are called output units, which encode

possible concepts or values to be assigned to the instance under

considerations. For example, each output units represent a class of objects.

4.1.5.4.4 Feed forward network

All connections point in one direction (from the input toward the

output layer).

4.1.5.4.5 Symmetrical connections

If there is a connection pointing from node ‘i’ to node ‘j’, then there

is also a connection from node ‘j’ to node ‘i’ and the weights associated with

the two connections are equal to notational Wij = Wji.

4.1.5.4.6 Asymmetrical connections

If connections are not symmetrical as defined above, then they are

asymmetrical. Connection weights can be real numbers or integers. They are

adjustable during network training, but some can be fixed deliberately. When

training is completed all of them should be fixed.

104

4.1.5.5 Node Properties

The activation levels of nodes can be discrete (e.g. 0 & 1) or

continuous across in the range or unrestricted. This depends on the activation

(transfer) function chosen. If it is a hard limiting function then the activation

levels are 0 (or -1) and 1. For a sigmoid function the activation levels are

limited to a continuous range of real [0, 1]. The Sigmoid function at can be

mathematically given as

)e1/(1a xt (4.21)

where x is the input variable. In the case of a linear activation function, the

activation levels are open. The activation function is mentioned again in the

case of the system dynamics.

4.1.5.6 Back Propagation Neural (BPN) Network Algorithm

The schematic of the feed forward back propagation neural network

is shown in Figure 4.4. The network learns a predefined set of input-output

variable pairs by using two phases propagate- adept cycle. After an input has

been applied as stimulus to the first layer of neural network units, it is

propagated through each upper layer until an output is generated. This output

pattern is then compared to the desired output and an error signal is computed

for each output unit.

The error signals are then transmitted backward from the output

layer to each node in the intermediate layers organize themselves such that

different nodes learn to recognize different features of the total input space.

After training, when presented with an arbitrary input pattern that resembles

the feature the individual units learned to inhibit their outputs if the input

pattern does not contain feature that they trained to recognize.

105

Figure 4.4 Schematic of back propagation neural network

As the signals propagate through the different layers in the network,

the activity present at each upper layer can be thought of as a pattern with

features that can be recognized by units in the subsequent layer. The output

pattern generated can be thought as a feature map that provides an indication

of the presence or absence of many different feature combinations at the

input. The total effect of this behaviour is that the BPN provides an effective

means of allowing a computer system to examine data may be incomplete or

noisy, and to recognize delicate patterns from the partial input.

The above theory can be summarized and given as step-by-step

algorithmic procedure, which is to be used for developing a C++ code, and

developed as general-purpose user interactive program for this present work

analysis. The various steps involved in the Feed Forward Back Propagation

Network (FFBPN) algorithm, which is used for the forecasting of commercial

energy sources, has been presented below.

106

o Apply the input vector to the input units

Xp = (Xp1, Xp2, Xp3…Xpn) (4.22)

o Calculate the net-input values to the hidden layer units

netpjh = Wji

h Xpi (4.23)

o Calculate the outputs from the hidden layers.

ipj = fjh (netpj

h ) (4.24)

o Move to the output layer. Calculate the net-input

values to each unit.

netpko = Wki

o ipj (4.25)

o Calculate the outputs from the output layer.

Opk = fko ( neto

pk) (4.26)

o Calculate the error terms for the output units.opk = (Ypk - Opk) f k

o (netopk) (4.27)

o Calculate the error terms for the hidden units.opj = f jh (netpj

h) opk Wo

kj (4.28)

o Update the weights on the output layer.

Wkjo (t+1) = Wkj

o (t) + opk ipj (4.29)

o Update the weights on the hidden layer.

Whji (t+1) = Wh

jiopj Xi (4.30)

107

where

h - Number of the hidden layer in the network.

j - Number of nodes in the hidden layer

k - Number of nodes in the output layer.

i - Number of nodes in the input layer.

w - Connection strength or Weight.

- Error between actual and predicted value.

- Learn Rate of the network.

O - Output demand calculated by the network

Xp - Input variable to the neural network

Y - Actual demand

The following sample calculations explain the step by step working

of C++ Artificial Neural Network (ANN) code used for energy demand

forecasting in the present study. The procedure was based on the Feed

Forward Back Propagation Network (FFBPN) algorithm.

Step 1: Generation of initial weights and applying the input vector

to the input units

Random numbers generate initial connection strengths or weights

by using the user defined functions. In general these weights may have any

value. At the end of the each iteration these weights will be modified

according to error calculated in the output layer and later they may assume

either positive or negative values, until the least Mean Square Error (MSE) is

arrived. Figures 4.5 and 4.6 show the modification of error in the starting and

final stages of the network training, respectively.

108

0

0.05

0.1

0.15

0.2

0.25

1 3 5 7 9 11 13 15 17 19 21 23 25 27 29 31 33 35 37 39 41 Iterations

Figure 4.5 Error propagation in initial stages of weights modification

0.00E+00

1.00E-05

2.00E-05

3.00E-05

4.00E-05

5.00E-05

6.00E-05

7.00E-05

8.00E-05

9.00E-05

1.00E-04

1 8 15 22 29 36 43 50 57 64 71 78 85 92 99 106 113 120 127 134 141 148 155 162 169 176 183

Iteration Number from last

Figure 4.6 Error propagation in final stages of weights modification

109

In the present total electricity demand case, the network consists of

three input neurons, six hidden neurons and one output neuron. So, eighteen

weights are present between input to hidden layer and six between the hidden

to output layer.

Input to the neural network should always be in the range 0.0 and 1.0.

So, all the actual input values should undergo a process of transformation

called Normalization. Normalization is a process similar to interpolation

defined as the process of conversion of all inputs into the defined zone. In this

present work, intentionally the range was set between 0.1 and 0.9. This is

done to accommodate any further forecast beyond the year 2030. Here 0.1

was made equivalent to year 1950 data and 0.9 to year 2030 data. Due to the

unavailability of 2030 demand some arbitrary value was chosen from

regression analysis, whose validation error for the network was calculated.

Depending on the validation error and training set results, 2030 demand was

changed (similar to a trail and error procedure). The same process was

continued till the validation error was minimized. This way of the input data

optimization is required for the cases where the maximum value in input data

is not available.

As an illustration one data input is chosen and its propagation

through the network for the Artificial Neural Network (ANN) (Multivariate)

model is presented below. Table 4.1 shows the actual and normalized data for

the year 1950 consisting year, GNP, population and total electricity

consumption.

The weights for the input to hidden layer nodes are given in

Table 4.2 and the weights of the hidden to output layer nodes are listed in

Table 4.3.

110

Table 4.1 Actual and normalized input data for year 1950 for the total

electricity consumption

Data Type YearGNP, Billion Rupees

Population Total Electricity Demand, GWh

Actual Data 1950 100 36321100 4156.61

Normalized Data 0.1 0.1 0.1 0.1

Table 4.2 Connection strengths of the input layer to hidden layer

W1 W2 W3 W4 W5 W6

W1 0.492965 0.235099 0.394299 0.001205 0.309076 0.063707

W2 0.029893 0.066073 0.491241 0.057970 0.483245 0.389294

W3 0.192404 0.194601 0.029237 0.178014 0.426878 0.260628

Table 4.3 Connection strengths of hidden layer to output Layer

W11 W21 W31 W41 W51 W61

0.379208 0.135166 0.472167 0.030244 0.201987 0.408155

The following Figure 4.7 gives the pictorial representation of the

multivariate ANN model.

Step 2: Calculate the net-input values to the hidden layer units.

The connection strengths or weights generated in the step 1 were

multiplied with the normalized data and were given as the input to the

corresponding node in the hidden layer.

111

netpjh = Wji

h Xpi

netpjh (First node of the hidden layer) = 0.071526

netpjh (Second node of the hidden layer) = 0.049577

netpjh (Third node of the hidden layer) = 0.091478

netpjh (Fourth node of the hidden layer) = 0.023719

netpjh (Fifth node of the hidden layer) = 0.12192

netpjh (Sixth node of the hidden layer) = 0.071363

Figure 4.7 Pictorial representation of the multivariate ANN model

Step 3: Calculate the outputs from the hidden layers.

The net value calculated in the step 2 was sent into the hidden

neuron, where an activation function (sigmoid function) is applied. Figure 4.8

112

f( netpj) Xp1 x W11

Xp2 x W21

Xp3 x W31

ipj

illustrates the transformation of data in the hidden layer node. After

activation, net value is the output of the hidden neuron to the nodes in the

output layer and is given as

ipj = fjh (netpj

h )

ipj = 1 / ( 1+ e - netpjh

)

Figure 4.8 Transformation of data in the hidden layer node

ipj (First node of the hidden layer) = 0.517874

ipj (Second node of the hidden layer) = 0.512392

ipj (Third node of the hidden layer) = 0.522853

ipj (Fourth node of the hidden layer) = 0.505929

ipj (Fifth node of the hidden layer) = 0.530442

ipj (Sixth node of the hidden layer) = 0.517833

Step 4: Move to the output layer. Calculate the net-input values to

each unit.

Each hidden node of the hidden layer gives single ipj value and the

same is multiplied with the weights of hidden to output layer.

113

netpko = Wki

o ipj

netpko = 0.846314.

Step 5: Calculate the outputs from the output layer.

The value obtained from the previous step is subjected to the

activation or transformation by activation function, which gives the output

from the neural network.

Opk = fko ( neto

pk)

The output from the network was renormalized to get the predicted

value.

Step 6: Calculate the error terms for the output units.

The error terms for the output units were determined by the

following relation.

opk = (Ypk - Opk) f k

o (netopk)

Step 7: Calculate the error terms for the hidden units.

The error terms for the hidden units were determined by the

following relation. Since there are six nodes in the hidden layer, the number

of error terms for the hidden unit is also six.

opj = f jh (netpj

h) opk Wo

kj

op1 = -0.051185

op2 = -0.051219

op3 = -0.051144

114

op4 = -0.051243

op5 = -0.051061

op6 = -0.051186

Step 8: Update the weights on the output layer.

During the training phase, weights were changed continuously until

the optimization of the weights was reached. Modification of the weights

depends on the error of the network. This part will be taken care of by the

back propagation algorithm. Modification of the weights depends on the learn

rate, which is defined as a parameter taking care of the magnitude of change

in the values of connection strengths. The weights for the hidden to output

layer were modified by the following equation

Wkj o (t+1) = Wkjo (t) + o

pk ipj

Table 4.4 Updated weights for the hidden to output layer

W11 W21 W31 W41 W51 W61

0.336742 0.093149 0.429292 -0.011243 0.15849 0.365692

Step 9: Update the weights on the hidden layer.

The weights for the input to hidden layers are modified by the

following equation and the values are given below.

Whji (t+1) = Wh

ji opj Xi

.

115

Table 4.5 Updated weights for the input to hidden layers

W1 W2 W3 W4 W5 W6

W1 0.490918 0.23305 0.392253 -0.00084 0.307034 0.06166

W2 0.027846 0.064024 0.489195 0.05592 0.481203 0.387247

W3 0.190357 0.192552 0.027191 0.175964 0.424836 0.258581

In similar fashion, weights were modified until the least Mean

Square Error (MSE) is reached.

As per the methodologies of the various forecasting techniques

such as regression techniques, double moving average method, double

exponential smoothing method, triple exponential smoothing method,

ARIMA model and Artificial Neural Network (ANN) (Univariate and

Multivariate) models, energy forecasting was carried out for the commercial

energy sources such as coal, oil, electricity and natural gas consumption in

India.

4.2 DEVELOPMENT OF AN OPTIMAL ELECTRICITY

ALLOCATION MODEL (OEAM)

The utilization of commercial energy sources is increasing

enormously which will inevitably lead to a tremendous amount of

environmental pollution and global warming due to the green house effect.

Hence, it is essential to seek for non-polluting renewable energy sources for

the power generation in addition to the commercial energy sources. An

electricity model would facilitate the effective utilization of energy sources

for the power generation in India. In this chapter, an Optimal Electricity

Allocation Model (OEAM) was developed and presented in detail.

116

4.2.1 Fuzzy Linear Programming

In many practical situations, it is not reasonable to require that the

constraints or the objective function in linear programming problems be

specified in precise, crisp terms. In such situations, it is desirable to use some

type of fuzzy linear programming. In general fuzzy linear programming

problems are first converted into equivalent crisp linear or non-linear

problems, which are then solved by standard methods. The results of a fuzzy

linear programming problem are thus real numbers, which represent a

compromise in terms of the fuzzy numbers involved.

In the present research work, the constraints efficiency, emission

and carbon tax were considered as fuzzy linear constraints. These constraints

have the linguistic variables rather than exact quantitative variables to

represent imprecise concepts.

The membership function used in the fuzzy linear programming

model characterizes the fuzziness in a fuzzy set. In the present research study,

the data for the constraints such as efficiency, emission and carbon tax were

not available in single value. But it is available in the form of specific ranges.

Hence, it was identified to use the fuzzy logic concept for these three

constraints. In addition, it was identified to use the trapezoidal shaped

membership function. The intuition method was used in the present work to

assign membership functions to fuzzy variables. Intuition involves contextual

and semantic knowledge about an issue.

The Matlab code was generated to solve the fuzzy linear

programming. The input values such as unit cost of power generation from

different energy options, the predicted energy demand for the year 2020, the

potential of the various energy sources, the efficiency of the various energy

117

systems, the emission released from the different power plants and the carbon

tax imposed for the carbon emission were fed into the fuzzy linear program.

The program first converts the data into fuzzified values and the solution is

arrived, which was then defuzzified to get the actual output of the program.

The model was run for the electricity distribution to meet the electric energy

gap in India for the year 2020.

The Matlab code generated to solve the fuzzy based linear programming problem was programmed in such away that it can solve the problem with any number of variables in the objective function. In addition, it is possible to solve problems with any number of fuzzy linear constraint equations and variables.

4.2.2 Variables in Optimal Electricity Allocation Model (OEAM)

The possible energy options were considered in the model to meet the electricity demand in India. There are 20 energy options considered in the Optimal Electricity Allocation Model (OEAM) as shown in Table 4.6.

The model optimizes and selects the appropriate energy options for the power generation based on the factors such as cost, potential, demand, efficiency, emission and carbon tax. The objective function of the model is minimizing the unit cost of power generation. The other factors were used as constraints in the model.

4.2.2.1 Cost Factor

Since India is a developing country, the cost of power generation is very important in the economic point of view. The renewable energy systems typically have higher capital costs than fossil-fuelled systems, since all the fuel, equivalent over the useful lifetime was purchased at the beginning of the

118

Table 4.6 Different energy options (i) for the power generation (j) in

India

S.No Xi,j Different Energy Systems (i)

1.

2.

3

4.

5.

6.

7.

8.

9.

10.

11.

12.

13.

14.

15.

16.

17.

18.

19.

20.

X1, 1

X2, 1

X3, 1

X4, 1

X5, 1

X6, 1

X7, 1

X8, 1

X9, 1

X10, 1

X11, 1

X12, 1

X13, 1

X14, 1

X15, 1

X16, 1

X17, 1

X18, 1

X19, 1

X20, 1

Coal based power generation

Diesel based power generation

Gas turbine power generation

Nuclear based power generation

Hydro based power generation

Wind based power generation

Biodiesel based power generation

Biomass gasifier based power generation

Biogas based power generation

Solid Waste based power generation

Cogeneration based power generation

Ethanol based power generation

Solar PV based power generation

Solar thermal based power generation

OTEC based power generation

Tidal based power generation

Geothermal based power generation

Mini Hydel based power generation

Fuel cell based power generation

MHD based power generation

system life. Emphasis on life-cycle costs and reduction of the risks of high

capital investments would be necessary for the success of renewables. Costs

for energy from renewable energy systems are expected to be reduced over

119

the next few decades, due primarily to higher production volume. But the cost

of power generation by means of commercial energy systems are found to be

increasing every year considerably. Incremental improvements are also

expected in efficiency, materials, reliability and application of the energy

systems.

4.2.2.2 Potential Factor

The magnitude of each energy resource was dependent on local

conditions. To optimize the use of these resources, better data was needed,

especially on their variation. The comprehensive understanding of local

conditions throughout the world, will take extensive effort, but each country

has indigenous energy resources that should be understood as part of the

national or regional energy planning.

4.2.2.3 Demand Factor

The World Energy Council (WEC) report made a conservative

projection for renewable energy systems, and according to the minimum

possible scenario for 2020, renewable would meet 3-4 percent of the total

energy demand. Under the maximum possible scenario with major policy

initiatives, renewable might provide 8-12 percent of the total energy demand

by the year 2020. The recent Energy Policy review concluded that renewables

offer considerable potential for displacing conventional energy sources, and in

some cases was already competitive with them, in addition to offering

environmental advantages. This emphasized the need to make detailed studies

at the country level, both to estimate the potential of each energy source, to

meet the future demands, and to highlight environmental and socioeconomic

benefits.

120

4.2.2.4 Efficiency Factor

The efficiency factor plays a major role in the selection of the

energy systems for power generation. Many of the technologies necessary to

make efficient use of various energy systems were quite immature and

relatively costly. Renewable energy systems have a relatively low energy

density in their raw form. To convert this to high energy density requires

tremendous costs. Research and development of the emerging, renewable

energy technologies need to be continued and expanded, if these options were

to be rapidly moved to maturity. Also, necessary R&D is required to increase

the efficiency of the commercial energy systems.

4.2.2.5 Emission Factor

The emission factor is very important in the environmental point of

view. The emission released by the power plants leads to the increase in

global warming and Green House Gases (GHG) emissions. Also, it affects the

health of the people living in and around the location of the power plants.

Also, the increase in emission level leads to the ozone layer depletion. It is the

need of the hour to curtail the emission released by the power plants. Since

the emissions are mainly from the fossil- fuelled power plants, these plants

should be replaced with non-polluting plants to some extent.

4.2.2.6 Carbon Tax Factor

Emissions of Green House Gases (GHG) are thought to be a serious

threat to the well being of humankind and other species. The principal culprit

is CO2, generated largely by the extensive use of fossil fuels for the power

generation. The CO2 concentrations in the atmosphere have risen from

285 ppm at the beginning of the industrial revolution to around 370 ppm

121

today, and are still rising. Carbon tax is the tax on CO2 emissions from major

energy sources, and in particular on the burning of coal, oil and natural gas,

unless the CO2 released from such plants are prevented from entering the

atmosphere through sequestration. The eventual rate of tax should be

calibrated to the desired reduction in CO2 emissions. The tax would thus

discharge the use of all fossil fuels relative to alternative renewables. The tax

rate should be set high enough to reduce CO2 emissions significantly.

The mathematical representation of the Optimal Electricity

Allocation Model (OEAM) is given in the following equations:

Minimizel

iijij XCZ

1 (4.31)

Subject to constraints

Potential ][k

P)X(ij

19

1k

m

1i

(4.32)

Demand l

1ijij DX (4.33)

Efficiencyl

ijijij DX

1 (4.34)

Emission ]][[ nnE T)X(ik

19

1k

m

1i

(4.35)

Carbon Tax ]][[ RnnE T )X( rik

19

1k

m

1i

(4.36)

122

where

C = Unit cost of the energy system

= Efficiency of the energy system

l = Number of energy systems for power generation = 20

m = Number of system in respective resources = 20

D = Energy demand (GWh)

k = Resources

P = Potential of sources (GWh)

En = Emission constant (g/GWh)

Tn = Target emission level (g/year)

r = Carbon tax (Rs/ton)

R = Projected carbon tax in 2020 (Rs/ton)

X = Quantum of energy (GWh)

i = Various energy systems

j = Power generation

Among the five constraints considered in the model, efficiency,

emission and carbon tax were considered as fuzzy linear constraints while

demand and potential were considered as ordinary linear constraint.

Here, the constraints such as efficiency, emission and carbon tax do

not have crisp values and hence they were treated as fuzzy linear constraints.

On the other hand, potential and demand were considered as ordinary linear

constraints since they have exact values.

The demand for electricity consumption during 2020 in India would

be 993385 GWh, which was predicted by the Artificial Neural Network

(ANN) forecasting model. The present electricity demand is 565102 GWh.

Hence; the electric energy gap for the year 2020 was calculated as

123

428283 GWh. This electric energy gap should be met by the various energy

options with special consideration to the emission level and carbon tax and

minimizing the unit cost as the objective function. Even though abundant

potential of solar, wind and biomass energy are available in India, factors like

quality of the resources, intermittent nature and technical feasibility would

decide the quantum of electricity utilization from different energy sources. In

this OEAM model, this was considered as potential constraint.

4.2.3 Various Energy Options

The various energy options for the power generation are shown in

Figure 4.9. There are twenty different energy systems namely coal, diesel,

gas, nuclear, hydro, wind, biodiesel, biomass gasifier, biogas, solid waste,

cogeneration, ethanol, solar PV, solar thermal, OTEC, tidal, geothermal, mini

hydel, fuel cell and MHD were considered in the model for the allocation of

electricity to meet the electric energy gap (demand) for the year 2020. The

various inputs to the OEAM model are listed in Table 4.7. The input values

have been fed in the fuzzy linear programming and the model was run for the

electricity distribution to meet the electric energy gap in India for the year

2020. Figure 4.10 shows the schematic representation of the Optimal

Electricity Allocation Model (OEAM). The inputs for different variables were

fed into the model and the model was run to obtain the optimal electricity

distribution pattern for the year 2020.

124

Centralized andDecentralized

PowerGeneration

Diesel

Ethanol

Biodiesel

Nuclear

Wind

Hydro

Biomass Gasifier

Solid Waste

Bio gas

MHD

Fuel Cell

OTEC

Solar

CoalGas

Solar PV Cogeneration

Tidal

Geothermal

Mini Hydel

Figure 4.9 Various energy options for power generation

125

Table 4.7 Inputs to the OEAM model

S.No. Different Energy Systems (i)

Unit cost,Rs. / kWh

Potential,*1010

kWh/year Efficiency,

%

CarbonEmission,

g/kWh1. Coal based power

generation 1.80 34194 29-38 960-1300

2. Diesel based power generation 4.50 767.36 34-42 690-870

3.

Gas turbine power generation 3.30 750.36 28-38 460-1230

4. Nuclear based power generation

3 131.88 33-43 9-100

5. Hydro based power generation

1 19.11 40-50 2-41

6. Wind based power generation

2.75 2.24 30-40 11-75

7. Biodiesel based power generation

4.60 5.067 30-35 550-690

8. Biomass gasifier based power generation

2 4.5518 20-28 37-166

9. Biogas based power generation

1.25 1.4112 22-26 6-10

10. Solid Waste based power generation

2.75 0.84 15-20 30-150

11. Cogeneration based power generation 1.50 3.36 70-80 23-41

12.

Ethanol based power generation 4 6 30-35 500-620

13. Solar PV based power generation 12

500000 10-17.5 Negligible

14.

Solar thermal based power generation 11.70 14-20 Negligible

15. OTEC based power generation 28 24.893 2-6 Negligible

16. Tidal based power generation 9 1.204 15- 25 Negligible

17. Geothermal based power generation 5 5.107 10-20 Negligible

18. Mini Hydel based power generation 2 1.197 40-45 Negligible

19.

Fuel cell based power generation 5 400000 50-60 Negligible

20. MHD based power generation 6 0.005913 50-60 Negligible