I think it would be difficult to even test batchsizes of 1 in a reasonable time, assuming there exists a sufficiently small learning rate to allow progress. How many wallclock centuries would it take to verify that a minibatch of 1 works for the OA5 DoTA bot?