machine-learning - Digit Recognition on CNN

Question

I am testing printed digits (0-9) on a Convolutional Neural Network. It is giving 99+ % accuracy on the MNIST Dataset, but when I tried it using fonts installed on computer (Ariel, Calibri, Cambria, Cambria math, Times New Roman) and trained the images generated by fonts (104 images per font(Total 25 fonts - 4 images per font(little difference)) the training error rate does not go below 80%, i.e. 20% accuracy. Why?

Here is "2" number Images sample -

I resized every image 28 x 28.

Here is more detail :-

Training data size = 28 x 28 images. Network parameters - As LeNet5 Architecture of Network -

Input Layer -28x28
| Convolutional Layer - (Relu Activation);
| Pooling Layer - (Tanh Activation)
| Convolutional Layer - (Relu Activation)
| Local Layer(120 neurons) - (Relu)
| Fully Connected (Softmax Activation, 10 outputs)

This works, giving 99+% accuracy on MNIST. Why is so bad with computer-generated fonts? A CNN can handle lot of variance in data.

score 2 · Accepted Answer

考えられる問題が 2 つあります。

前処理: MNIST は 28px x 28px だけでなく、

NIST の元の白黒 (2 値) 画像は、縦横比を維持しながら 20x20 ピクセルのボックスに収まるようにサイズが正規化されています。結果として得られる画像には、正規化アルゴリズムで使用されるアンチエイリアシング技術の結果としてグレーレベルが含まれます。画像は、ピクセルの重心を計算し、この点が 28x28 フィールドの中心に位置するように画像を変換することにより、28x28 画像の中心に配置されました。

出典: MNIST ウェブサイト

オーバーフィッティング:

MNIST には 60,000 のトレーニングサンプルと 10,000 のテストサンプルがあります。いくつ持っていますか？
ドロップアウトを試しましたか (論文を参照)?
データセットの拡張技術を試しましたか? (たとえば、画像を少しずらしたり、おそらく縦横比を少し変更したり、ノイズを追加することもできますが、それらは役に立たないと思います)
小規模なネットワークを試してみましたか? (そして、あなたのフィルターの大きさ/フィルターの数は?)

備考

面白いアイデア！トレーニング済みの MNIST ネットワークをデータに単純に適用してみましたか? 結果は？

machine-learning - Digit Recognition on CNN

3 に答える 3

Related

Reference