Materials Algorithms Project
Program Library

Module MAP_UTIL_NNECODE

Provenance of code.
Purpose of code.
Specification.
Description of module's operation.
References.
Parameter descriptions.
Error indicators.
Accuracy estimate.
Any additional information.
Example of code
Auxiliary routines required.
Keywords.
Download source code.
Links.

Provenance of Source Code

C.A. Walsh,
Phase Transformations Group,
Department of Materials Science and Metallurgy,
Pembroke Street,
University of Cambridge,
Cambridge CB2 3QZ, U.K.

E-mail: caw4@cus.cam.ac.uk

Added to MAP: July 2002.

Top | Next

The BIGBACK program by David Mackay can be used to train a neural network from a series of experimental datasets. These routines read the output files from the BIGBACK program and use them to make predictions for any given set of inputs. The size of the error bars on the predicted values are also evaluated.

Top | Next | Prev

Specification

Language:	FORTRAN
Product form:	Source code

MODEL1(ND,INM,IHU,SNU,PMAX,PMIN,WGTFILE)

DOUBLE PRECISION  PMAX(20), PMIN(20), SNU(50)
CHARACTER*9       WGTFILE(50)
INTEGER           ND, INM, IHU(50)



NORM(ND,X0,XN,PMAX,PMIN)

DOUBLE PRECISION  X0(20), XN(20), PMAX(20), PMIN(20)
INTEGER           ND



GETV(ND,IHU,SNU,PMAX,PMIN,XN,WGTFILE,ERR0,ANS0,ANS0P,ANS0M)

DOUBLE PRECISION  PMAX(20), PMIN(20), XN(20)
DOUBLE PRECISION  SNU, ERR0, ANS0, ANS0P, ANS0M
CHARACTER*9       WGTFILE
INTEGER           ND, IHU

Top | Next | Prev

Description

A neural network is a non-linear mathematical equation for expressing one physical quantity, y, as a function of a set of input parameters, x_j:

The weights, w_i and w_ij, and biases are determined using the BIGBACK program written by David Mackay^1,2 from a series of known input parameters and measured output values (known as training the network), and are stored in a weights file. The BIGBACK program is also capable of calculating error bars, based on the probability distribution of the dataset used to train the network. In order to facilitate these calculations, the lu decomposition of the Hessian matrix is stored in a weights.lu output file. Further details about lu decomposition can be found by reference to Numerical Recipes⁴.

In general the best results are obtained by averaging the outputs from several different neural networks (committee result)³. The error from a committee of N networks is given in reference 3 as

This module contains a group of routines which can be used to obtain predictions from a neural network or committee of networks and calculate the size of the error bar. The weights and weights.lu output files are required as input files. These files should be stored in a sub-directory called w. The names of the weights files for each network in the committee should have exactly 7 characters (excluding the sub-directory name), and the corresponding lu decomposition files should have the suffix .lu.
e.g.

wgts101 and wgts101.lu
wgts102 and wgts102.lu
wgts103 and wgts103.lu

The souce code contains a complete program (except for routine LUBKSB) which gives an example of how the routines are used, including how to evaluate the overall error from a committee of neural networks.

NB The BIGBACK program is capable of considering a number of different forms to the non-linear equation relating the output to the input parameters. This program is strictly limited to the form given above. i.e., a linear output-layer activation function and a tanh activation function for the hidden layer.

NB For copyright reasons, the subroutine LUBKSB (absolutely essential for the operation of this module) is not included with this set of routines. It is a Numerical Recipes⁴ subroutine and must be acquired from a suitable source. (It is not a long routine and can be typed in very quickly and easily, if necessary.)

NB Users may find it necessary to add a carriage return to the end of each weights file in order to enable the program to read them.

The routines contained in the module are:

MODEL1
This sets up the required arrays and parameters with details of the neural networks: number of input variables and their normalisation factors; number of models in the committee; number of hidden nodes in each network; sigma_nu values; filenames of the weight files. All these need to be inserted into the subroutine by the user.

NORM
This subroutine is called to normalise the input parameters.

GETV
This subroutine is the core part of the program. It is called once for each network in the committee. Each time it returns the calculated output value and its error. This routine makes calls to the following subroutines:

WGTREAD: Reads the weights and biases from the weights file.
LUHREAD: Reads the lu decomposition of the hessian matrix from the *.lu file.
GGCALC: Calculates the g-vector (differentials of the output with respect to the weights and biases) and the inverse of the g-vector.
ERRCALC: Evaluates the error from the lu decomposition matrix and the inverse of the g-vector. This routine calls subroutine LUBKSB.
YCALC: Calculates the output value from the weights and input variables.
UNNORM: Unnormalises the output value.
LUBKSB: Refer to Numerical Recipes.

Top | Next | Prev

References

D.J.C. MacKay, 1997, Mathematical Modelling of Weld Phenomena 3, eds. H Cerjak & H.K.D.H. Bhadeshia, Inst. of Materials, pp 359.
D.J.C MacKay's website at https://wol.ra.phy.cam.ac.uk/mackay/README.html#Source_code
H. Fujii, D.J.C. Mackay and H.K.D.H. Bhadeshia, 1996, ISIJ International, 36, 1373-1382.
Numerical Recipes, 1986, W.H. Press, B.P. Flannery, S.A. Teukolsky and W.T. Vetterling, (CUP: Cambridge), pp 33.

Top | Next | Prev

Parameters

Input parameters

ND - integer: Number of input variables into the neural networks.
INM - integer: Number of networks in the committee.
IHU - integer: Array containing the number of hidden units for each network. Only one element of this array is input into subroutine GETV.
SNU - double precision: Array containing the sigma_nu values for each network. Only one element of this array is input into subroutine GETV.
PMIN - double precision: Array containing the lowest value of each input variable. These are used for normalising the inputs and can be found in the MINMAX output file of the BIGBACK program.
PMAX - double precision: Array containing the highest value of each input variable. These are used for normalising the inputs and can be found in the MINMAX output file of the BIGBACK program.
WGTFILE - character*9: Array containing the filenames of the weights files. The recommended format is of the form w/wgts101 etc. See above. Only one element of this array is input into subroutine GETV.
X0 - double precision: Array containing the input variables for which a prediction is to be made.
XN - double precision: Array containing the normalised input variables.

Output parameters

ANS0 - double precision: The predicted value from a given neural network, as output from GETV (after un-normalisation).
ERR0 - double precision: The error in the predicted value from a given neural network, as output from GETV.
ANS0P - double precision: ANS0 + ERR0
ANS0M - double precision: ANS0 - ERR0.
ANS1 - double precision: The predicted value from a committee of neural networks.
ERR1 - double precision: The error in the predicted value from a committee of neural networks.
ANS1P - double precision: ANS1 + ERR1.
ANS1M - double precision: ANS1 - ERR1.

Top | Next | Prev

1. Program text

      IMPLICIT DOUBLE PRECISION (A-H,O-Z), INTEGER(I-N)
      DOUBLE PRECISION SNU(50),PMAX(20),PMIN(20),XN(20),X0(20)
      INTEGER IHU(50)
      CHARACTER*9 WGTFILE(50),BLANK
C
      BLANK  = '         '
      DO 100 L1=1,50
         WGTFILE(L1) = BLANK
  100 CONTINUE
C
C Call subroutine to set the maximum and minimum parameter values for the
C model and set the number of input parameters and hidden variables etc.
C
      CALL MODEL1(ND,INM,IHU,SNU,PMAX,PMIN,WGTFILE)
C
C Read in the parameters (not normalised) and store in X0
C
      READ(*,*) (X0(J),J=1,ND)
C
C Normalise the parameters. Put normalised parameters in XN
C
      CALL NORM(ND,X0,XN,PMAX,PMIN)
      ANSR = 0D0
      ANSS = 0D0
      ERR1 = 0D0
      DO 120 L1=1,INM
         CALL GETV(ND,IHU(L1),SNU(L1),PMAX,PMIN,XN,WGTFILE(L1),
     +                                         ERR0,ANS0,ANS0P,ANS0M)
         ANSR = ANSR + ANS0
         ANSS = ANSS + ANS0*ANS0
         ERR1 = ERR1 + ERR0*ERR0
  120 CONTINUE
      ERR1 = ( ERR1 + ANSS - ANSR*ANSR/INM )/INM
      ERR1 = SQRT(ERR1)
      ANS1 = ANSR / INM
      ANS1M = ANS1 - ERR1
      ANS1P = ANS1 + ERR1
C
C Write out the results
C
      WRITE(*,1)
      WRITE(*,2) ANS1,ERR1,ANS1M,ANS1P
      STOP
    1 FORMAT(///'   VALUE   ERROR        RANGE')
    2 FORMAT(2X,F6.2,2X,F6.3,2X,F6.2,' - ',F6.2)
      END

2. Program data


0.8 1.5 1600.0 30.0


The sample weights files can be downloaded with the source code.

3. Program results


   VALUE   ERROR        RANGE
    1.62   0.243    1.38 -   1.86

Top | Next | Prev

MAP originated from a joint project of the National Physical Laboratory and the University of Cambridge.

MAP Website administration / map@msm.cam.ac.uk

Top | Module Index | MAP Homepage

Materials Algorithms Project
Program Library

Module MAP_UTIL_NNECODE

Provenance of Source Code

Purpose

Specification

Description

References

Parameters

Input parameters

Output parameters

Error Indicators

Accuracy

Further Comments

Example

1. Program text

2. Program data

3. Program results

Auxiliary Routines

Keywords

Download

Materials Algorithms ProjectProgram Library

Module MAP_UTIL_NNECODE

Input parameters

Output parameters

1. Program text

2. Program data

3. Program results

Materials Algorithms Project
Program Library