简写:
LB - Large Batch
SB - Small Batch
优缺点:
LB方法探索性太差,容易离起始点附近很近的地方停下来
参考:
Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour
简写:
LB - Large Batch
SB - Small Batch
优缺点:
LB方法探索性太差,容易离起始点附近很近的地方停下来
参考:
Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour