Auto schedular performance on AMDGPU: the first attempt

cc @jcf94 who did experiment on mac’s amd gpu using opencl.