简体繁体中英

Restricting output classes in multi-class classification in Tensorflow

原文 2016-07-25 15:42:45 2 1 classification/ tensorflow/ text-classification

I am building a bidirectional LSTM to do multi-class sentence classification. I have in total 13 classes to choose from and I am multiplying the output of my LSTM network to a matrix whose dimensionality is [2*num_hidden_unit,num_classes] and then apply softmax to get the probability of the sentence to fall into 1 of the 13 classes.

So if we consider output[-1] as the network output:

W_output = tf.Variable(tf.truncated_normal([2*num_hidden_unit,num_classes])) result = tf.matmul(output[-1],W_output) + bias

and I get my [1, 13] matrix (assuming I am not working with batches for the moment).

Now, I also have information that a given sentence does not fall into a given class for sure and I want to restrict the number of classes considered for a given sentence. So let's say for instance that for a given sentence, I know it can fall only in 6 classes so the output should really be a matrix of dimensionality [1,6] .

One option I was thinking of is to put a mask over the result matrix where I multiply the rows corresponding to the classes that I want to keep by 1 and the ones I want to discard by 0, by in this way I will just lose some of the information instead of redirecting it.

Anyone has a clue on what to do in this case?

1 answers

I think your best bet is, as you seem to have described, using a weighted cross entropy loss function where the weights for your "impossible class" are 0 and 1 for the other possible classes. Tensorflow has a weighted cross entropy loss function.

Another interesting but probably less effective method is to feed whatever information you now have about what classes your sentence can/cannot fall into the network at some point (probably towards the end).

Multi-Class Classification in WEKA

class imbalance issue in multi-class classification

Sigmoid activation for multi-class classification?

Calculating accuracy for multi-class classification

Gaussian process multi-class classification

Multi-class classification for large database (matlab)

Training logistic regression using scikit learn for multi-class classification

How to calculate Imbalance Accuracy Metric in multi-class classification

Imbalanced data and sample size for large multi-class NLP classification

Multi-Class Classification in Caffe of HDF5 data

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Multi-Class Classification in WEKA class imbalance issue in multi-class classification Sigmoid activation for multi-class classification? Calculating accuracy for multi-class classification Gaussian process multi-class classification Multi-class classification for large database (matlab) Training logistic regression using scikit learn for multi-class classification How to calculate Imbalance Accuracy Metric in multi-class classification Imbalanced data and sample size for large multi-class NLP classification Multi-Class Classification in Caffe of HDF5 data

Related Tags

Restricting output classes in multi-class classification in Tensorflow

Question

1 answers

solution1 0 2017-07-13 20:30:39

solution1
0 2017-07-13 20:30:39