简体繁体中英

Bipolar Data Representation in Neural Networks

原文 2015-07-31 18:52:26 1 1 c#/ neural-network

Why we use bipolar data representation in Neural Networks. For example -0.5 and 0.5 in place of 0 and 1 or -1 and 1 in place of 0 and 1. As in this Article http://www.codeproject.com/Articles/11285/Neural-Network-OCR?fid=206868&df=90&mpp=25&noise=3&prof=True&sort=Position&view=Normal&spc=Relaxed&fr=26#xx0xx

1 answers

Your question is motivated,I'm guessing, by this statement from your ref:

But, in many neural network training tasks, it's preferred to represent training patterns in so called "bipolar" way , placing into input vector "0.5" instead of "1" and "-0.5" instead of "0".

There are two considerations that go in to the use of the 'bipolar' scaling:

a) The general choice of bipolar range is usually determined by the transfer functions used by the neural network in cases where the distribution of the input is guassian or similar ie. most of the values are centred around some mean with only a relatively small number of outliers. For example, if you use a logistic function for your nodes (output = [0,+1]) then you would scale your inputs between [0,+1]. Similarly, if you use a tanh function (output = [-1,+1] ), then you would scale your inputs similarly. All assuming that your inputs are continuous.

b) the range is further refined because of how learning takes place. NN learning usually uses the derivative of the transfer function, and best learning happens where there's greatest variation of the derivative for changes in input ie. steepest part of the transfer function. At either extreme of the transfer function, the curve flattens out the derivative is small, so learning is minimal/slow. To avoid those regions, if you are certain of the value range of your inputs, you scale them so that they lie well within the range of the steep part of the transfer function, typically say [-0.8, +0.8] for tanh(), but in your reference [-0.5, +0.5] for 'BipolarSigmoidFunction'.

TL;DR - choice of bipolar is determined by transfer function (your ref uses 'BipolarSigmoidFunction'), and bipolar values are arbitrary but centred on steepest part of transfer function curve.

Neural Networks in NeuronDotNet

RBF Neural Networks C#

System.Speech and Neural Networks

Encog3 Neural Networks C# : What is the interface used to input multiple data points in encog3?

Artifical neural networks height-weight problem

Neural Networks in C# using NeuronDotNet

Touch gesture recognition for phones without neural networks

When implementing Artificial Neural Networks, should the input layer be ignored?

Optimisation of hill climbing algorithm in c# for training neural networks

Boundary representation data structure

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Neural Networks in NeuronDotNet RBF Neural Networks C# System.Speech and Neural Networks Encog3 Neural Networks C# : What is the interface used to input multiple data points in encog3? Artifical neural networks height-weight problem Neural Networks in C# using NeuronDotNet Touch gesture recognition for phones without neural networks When implementing Artificial Neural Networks, should the input layer be ignored? Optimisation of hill climbing algorithm in c# for training neural networks Boundary representation data structure

Related Tags

Bipolar Data Representation in Neural Networks

Question

1 answers

solution1 0 2015-08-02 17:14:08

solution1
0 2015-08-02 17:14:08