deep-learning - prototxt でバッチサイズ 1 を使用する場合と、pycaffe でバッチサイズを 1 に強制する場合の結果の不一致

Question

レイヤに手動で変更を加えてMNISTの例を実行しています。トレーニング中はすべてがうまく機能し、最終的なテスト精度は ~99% に達しました。私は現在、pycaffe を使用して python で生成されたモデルを操作しようとしており、ここに記載されている手順に従っています。混同行列を計算したいので、LMDB からテスト画像を 1 つずつループしてから、ネットワークを実行しています。コードは次のとおりです。

net = caffe.Net(args.proto, args.model, caffe.TEST)
...
datum = caffe.proto.caffe_pb2.Datum()
datum.ParseFromString(value)
label = int(datum.label)
image = caffe.io.datum_to_array(datum).astype(np.uint8)
...
net.blobs['data'].reshape(1, 1, 28, 28) # Greyscale 28x28 images
net.blobs['data'].data[...] = image
net.forward()
# Get predicted label
print net.blobs['label'].data[0] # use this later for confusion matrix

これが私のネットワーク定義prototxtです

name: "MNISTNet"
layer {
  name: "mnist"
  type: "Data"
  top: "data"
  top: "label"
  include {
    phase: TRAIN
  }
  transform_param {
    scale: 0.00390625
  }
  data_param {
    source: "train_lmdb"
    batch_size: 64
    backend: LMDB
  }
}
layer {
  name: "mnist"
  type: "Data"
  top: "data"
  top: "label"
  include {
    phase: TEST
  }
  transform_param {
    scale: 0.00390625
  }
  data_param {
    source: "test_lmdb"
    batch_size: 100
    backend: LMDB
  }
}
layer {
  name: "conv1"
  type: "Convolution"
  bottom: "data"
  top: "conv1"
  param {
    lr_mult: 1
  }
  param {
    lr_mult: 2
  }
  convolution_param {
    num_output: 20
    kernel_size: 5
    stride: 1
    weight_filler {
      type: "xavier"
    }
    bias_filler {
      type: "constant"
    }
  }
}
layer {
  name: "relu1"
  type: "ReLU"
  bottom: "conv1"
  top: "conv1"
}
layer {
  name: "pool1"
  type: "Pooling"
  bottom: "conv1"
  top: "pool1"
  pooling_param {
    pool: MAX
    kernel_size: 2
    stride: 2
  }
}
layer {
  name: "conv2"
  type: "Convolution"
  bottom: "pool1"
  top: "conv2"
  param {
    lr_mult: 1
  }
  param {
    lr_mult: 2
  }
  convolution_param {
    num_output: 50
    kernel_size: 5
    stride: 1
    weight_filler {
      type: "xavier"
    }
    bias_filler {
      type: "constant"
    }
  }
}
layer {
  name: "relu2"
  type: "ReLU"
  bottom: "conv2"
  top: "conv2"
}
layer {
  name: "pool2"
  type: "Pooling"
  bottom: "conv2"
  top: "pool2"
  pooling_param {
    pool: MAX
    kernel_size: 2
    stride: 2
  }
}
layer {
  name: "fc1"
  type: "InnerProduct"
  bottom: "pool2"
  top: "fc1"
  param {
    lr_mult: 1
  }
  param {
    lr_mult: 2
  }
  inner_product_param {
    num_output: 500
    weight_filler {
      type: "xavier"
    }
    bias_filler {
      type: "constant"
    }
  }
}
layer {
  name: "relu3"
  type: "ReLU"
  bottom: "fc1"
  top: "fc1"
}
layer {
  name: "fc2"
  type: "InnerProduct"
  bottom: "fc1"
  top: "fc2"
  param {
    lr_mult: 1
  }
  param {
    lr_mult: 2
  }
  inner_product_param {
    num_output: 10
    weight_filler {
      type: "xavier"
    }
    bias_filler {
      type: "constant"
    }
  }
}
layer {
  name: "loss"
  type: "SoftmaxWithLoss"
  bottom: "fc2"
  bottom: "label"
}
layer {
  name: "accuracy"
  type: "Accuracy"
  bottom: "fc2"
  bottom: "label"
  top: "accuracy"
  include {
    phase: TEST
  }
}

テストバッチサイズは 100 であるため、Python コードで形状を変更する必要があります。ここで、テストバッチサイズを 1 に変更したとします。まったく同じ Python コードが、異なる (そしてほとんど正しい) 予測クラスラベルを出力します。したがって、バッチサイズ 1 で実行されているコードは、バッチサイズ 100 がひどいのに対し、最大 99% の精度で期待される結果を生成します。ただし、Imagenet pycaffe チュートリアルに基づいて、何が間違っているのかわかりません。最後の手段として、テスト用にバッチサイズ 1 の prototxt のコピーを作成し、それを Python コードで使用して、トレーニング中に元のファイルを使用することができますが、これは理想的ではありません。

また、バッチサイズ1でうまく機能する理由が説明されていないため、前処理の問題ではないと思います。

任意のポインタをいただければ幸いです！

deep-learning - prototxt でバッチ サイズ 1 を使用する場合と、pycaffe でバッチ サイズを 1 に強制する場合の結果の不一致

0 に答える 0

Related

Reference

deep-learning - prototxt でバッチサイズ 1 を使用する場合と、pycaffe でバッチサイズを 1 に強制する場合の結果の不一致