Unable to get started with Metal on macOS 10.13 with Radeon Pro 560 4096 MB

@haichen under the strategy design, shall we create a separate strategy for tensor-core related code and only register it for cuda(not gpu so it won’t affect other gpu kinds like metal and opencl)?