Dealing with Overconfidence in Neural Networks: Bayesian Approach

1 · Jonathan Ramkissoon · July 29, 2020, 12:22 p.m.
Summary
I trained a multi-class classifier on images of cats, dogs and wild animals and passed an image of myself, it’s 98% confident I’m a dog. The problem isn’t that I passed an inappropriate image, because models in the real world are passed all sorts of garbage. It’s that the model is overconfident about an image far away from the training data. Instead we expect a more uniform distribution over the classes. The overconfidence makes it difficult to post-process model output (setting a threshold on p...