I have some questions about your kernels.
1 Do you mind providing a CPU version (serial/openmp) of the metropolis kernel for reference and verification ?
2 When the maximum block size is 256 on a GPU, what are the values of BX, BY, and BZ ?
/* warning: changing the block dimensions can lead to undefined behavior. */
#define WARPSIZE 32
#define BX 32
#define BY 8
#define BZ 4
I have some questions about your kernels.
1 Do you mind providing a CPU version (serial/openmp) of the metropolis kernel for reference and verification ?
2 When the maximum block size is 256 on a GPU, what are the values of BX, BY, and BZ ?