Complete output
One or more operators have not been tuned. Please tune your model for better performance. Use DEBUG logging level to see more details.
{'mean': 6.638219979995483, 'median': 7.205596800031344, 'std': 2.198990796372162}
class='n02123045 tabby, tabby cat' with probability=0.610552
class='n02123159 tiger cat' with probability=0.367179
class='n02124075 Egyptian cat' with probability=0.019365
class='n02129604 tiger, Panthera tigris' with probability=0.001273
class='n04040759 radiator' with probability=0.000261
[Task 1/28] Current/Best: 0.00/ 0.00 GFLOPS | Progress: (0/10) | 0.00 s
[Task 1/28] Current/Best: 0.00/ 0.00 GFLOPS | Progress: (10/10) | 13.75 sWARNING:root:Could not find any valid schedule for task Task(func_name=conv2d_nchw.cuda, args=(('TENSOR', (1, 3, 224, 224), 'float32'), ('TENSOR', (64, 3, 7, 7), 'float32'), (2, 2), (3, 3, 3, 3), (1, 1), 'float32'), kwargs={}, workload=('conv2d_nchw.cuda', ('TENSOR', (1, 3, 224, 224), 'float32'), ('TENSOR', (64, 3, 7, 7), 'float32'), (2, 2), (3, 3, 3, 3), (1, 1), 'float32')). A file containing the errors has been written to /tmp/tvm_tuning_errors_7doepwif.log.
Done.
[Task 2/28] Current/Best: 0.00/ 0.00 GFLOPS | Progress: (0/10) | 0.00 s
[Task 2/28] Current/Best: 0.00/ 0.00 GFLOPS | Progress: (10/10) | 14.36 sWARNING:root:Could not find any valid schedule for task Task(func_name=conv2d_nchw.cuda, args=(('TENSOR', (1, 64, 56, 56), 'float32'), ('TENSOR', (64, 64, 1, 1), 'float32'), (1, 1), (0, 0, 0, 0), (1, 1), 'float32'), kwargs={}, workload=('conv2d_nchw.cuda', ('TENSOR', (1, 64, 56, 56), 'float32'), ('TENSOR', (64, 64, 1, 1), 'float32'), (1, 1), (0, 0, 0, 0), (1, 1), 'float32')). A file containing the errors has been written to /tmp/tvm_tuning_errors_wohln07u.log.
Done.
[Task 3/28] Current/Best: 0.00/ 0.00 GFLOPS | Progress: (0/10) | 0.00 s
[Task 3/28] Current/Best: 0.00/ 0.00 GFLOPS | Progress: (10/10) | 15.52 sWARNING:root:Could not find any valid schedule for task Task(func_name=conv2d_nchw.cuda, args=(('TENSOR', (1, 64, 56, 56), 'float32'), ('TENSOR', (64, 64, 3, 3), 'float32'), (1, 1), (1, 1, 1, 1), (1, 1), 'float32'), kwargs={}, workload=('conv2d_nchw.cuda', ('TENSOR', (1, 64, 56, 56), 'float32'), ('TENSOR', (64, 64, 3, 3), 'float32'), (1, 1), (1, 1, 1, 1), (1, 1), 'float32')). A file containing the errors has been written to /tmp/tvm_tuning_errors_rzg9iv15.log.
Done.
[Task 4/28] Current/Best: 0.00/ 0.00 GFLOPS | Progress: (0/10) | 0.00 s
[Task 4/28] Current/Best: 0.00/ 0.00 GFLOPS | Progress: (10/10) | 10.91 sWARNING:root:Could not find any valid schedule for task Task(func_name=conv2d_nchw_winograd.cuda, args=(('TENSOR', (1, 64, 56, 56), 'float32'), ('TENSOR', (64, 64, 3, 3), 'float32'), (1, 1), (1, 1, 1, 1), (1, 1), 'float32'), kwargs={}, workload=('conv2d_nchw_winograd.cuda', ('TENSOR', (1, 64, 56, 56), 'float32'), ('TENSOR', (64, 64, 3, 3), 'float32'), (1, 1), (1, 1, 1, 1), (1, 1), 'float32')). A file containing the errors has been written to /tmp/tvm_tuning_errors_4naydwvt.log.
Done.
[Task 5/28] Current/Best: 0.00/ 0.00 GFLOPS | Progress: (0/10) | 0.00 s
[Task 5/28] Current/Best: 0.00/ 0.00 GFLOPS | Progress: (10/10) | 11.21 sWARNING:root:Could not find any valid schedule for task Task(func_name=conv2d_nchw.cuda, args=(('TENSOR', (1, 64, 56, 56), 'float32'), ('TENSOR', (256, 64, 1, 1), 'float32'), (1, 1), (0, 0, 0, 0), (1, 1), 'float32'), kwargs={}, workload=('conv2d_nchw.cuda', ('TENSOR', (1, 64, 56, 56), 'float32'), ('TENSOR', (256, 64, 1, 1), 'float32'), (1, 1), (0, 0, 0, 0), (1, 1), 'float32')). A file containing the errors has been written to /tmp/tvm_tuning_errors_fqta3hmb.log.
Done.
[Task 6/28] Current/Best: 0.00/ 0.00 GFLOPS | Progress: (0/10) | 0.00 s
[Task 6/28] Current/Best: 0.00/ 0.00 GFLOPS | Progress: (10/10) | 10.03 sWARNING:root:Could not find any valid schedule for task Task(func_name=conv2d_nchw.cuda, args=(('TENSOR', (1, 256, 56, 56), 'float32'), ('TENSOR', (64, 256, 1, 1), 'float32'), (1, 1), (0, 0, 0, 0), (1, 1), 'float32'), kwargs={}, workload=('conv2d_nchw.cuda', ('TENSOR', (1, 256, 56, 56), 'float32'), ('TENSOR', (64, 256, 1, 1), 'float32'), (1, 1), (0, 0, 0, 0), (1, 1), 'float32')). A file containing the errors has been written to /tmp/tvm_tuning_errors_xrzw_zb1.log.
Done.
[Task 7/28] Current/Best: 0.00/ 0.00 GFLOPS | Progress: (0/10) | 0.00 s
[Task 7/28] Current/Best: 0.00/ 0.00 GFLOPS | Progress: (10/10) | 10.57 sWARNING:root:Could not find any valid schedule for task Task(func_name=conv2d_nchw.cuda, args=(('TENSOR', (1, 256, 56, 56), 'float32'), ('TENSOR', (128, 256, 1, 1), 'float32'), (1, 1), (0, 0, 0, 0), (1, 1), 'float32'), kwargs={}, workload=('conv2d_nchw.cuda', ('TENSOR', (1, 256, 56, 56), 'float32'), ('TENSOR', (128, 256, 1, 1), 'float32'), (1, 1), (0, 0, 0, 0), (1, 1), 'float32')). A file containing the errors has been written to /tmp/tvm_tuning_errors_yx3111i7.log.
Done.
[Task 8/28] Current/Best: 0.00/ 0.00 GFLOPS | Progress: (0/10) | 0.00 s
[Task 8/28] Current/Best: 0.00/ 0.00 GFLOPS | Progress: (10/10) | 10.98 sWARNING:root:Could not find any valid schedule for task Task(func_name=conv2d_nchw.cuda, args=(('TENSOR', (1, 128, 56, 56), 'float32'), ('TENSOR', (128, 128, 3, 3), 'float32'), (2, 2), (1, 1, 1, 1), (1, 1), 'float32'), kwargs={}, workload=('conv2d_nchw.cuda', ('TENSOR', (1, 128, 56, 56), 'float32'), ('TENSOR', (128, 128, 3, 3), 'float32'), (2, 2), (1, 1, 1, 1), (1, 1), 'float32')). A file containing the errors has been written to /tmp/tvm_tuning_errors_75eb995i.log.
Done.
[Task 9/28] Current/Best: 0.00/ 0.00 GFLOPS | Progress: (0/10) | 0.00 s
[Task 9/28] Current/Best: 0.00/ 0.00 GFLOPS | Progress: (10/10) | 8.82 sWARNING:root:Could not find any valid schedule for task Task(func_name=conv2d_nchw.cuda, args=(('TENSOR', (1, 256, 56, 56), 'float32'), ('TENSOR', (512, 256, 1, 1), 'float32'), (2, 2), (0, 0, 0, 0), (1, 1), 'float32'), kwargs={}, workload=('conv2d_nchw.cuda', ('TENSOR', (1, 256, 56, 56), 'float32'), ('TENSOR', (512, 256, 1, 1), 'float32'), (2, 2), (0, 0, 0, 0), (1, 1), 'float32')). A file containing the errors has been written to /tmp/tvm_tuning_errors_b4b6seui.log.
Done.
[Task 10/28] Current/Best: 0.00/ 0.00 GFLOPS | Progress: (0/10) | 0.00 s
[Task 10/28] Current/Best: 0.00/ 0.00 GFLOPS | Progress: (10/10) | 12.91 sWARNING:root:Could not find any valid schedule for task Task(func_name=conv2d_nchw.cuda, args=(('TENSOR', (1, 128, 28, 28), 'float32'), ('TENSOR', (512, 128, 1, 1), 'float32'), (1, 1), (0, 0, 0, 0), (1, 1), 'float32'), kwargs={}, workload=('conv2d_nchw.cuda', ('TENSOR', (1, 128, 28, 28), 'float32'), ('TENSOR', (512, 128, 1, 1), 'float32'), (1, 1), (0, 0, 0, 0), (1, 1), 'float32')). A file containing the errors has been written to /tmp/tvm_tuning_errors_6mh2ss5h.log.
Done.
[Task 11/28] Current/Best: 0.00/ 0.00 GFLOPS | Progress: (0/10) | 0.00 s
[Task 11/28] Current/Best: 0.00/ 0.00 GFLOPS | Progress: (10/10) | 12.04 sWARNING:root:Could not find any valid schedule for task Task(func_name=conv2d_nchw.cuda, args=(('TENSOR', (1, 512, 28, 28), 'float32'), ('TENSOR', (128, 512, 1, 1), 'float32'), (1, 1), (0, 0, 0, 0), (1, 1), 'float32'), kwargs={}, workload=('conv2d_nchw.cuda', ('TENSOR', (1, 512, 28, 28), 'float32'), ('TENSOR', (128, 512, 1, 1), 'float32'), (1, 1), (0, 0, 0, 0), (1, 1), 'float32')). A file containing the errors has been written to /tmp/tvm_tuning_errors_upcbvqql.log.
Done.
[Task 12/28] Current/Best: 0.00/ 0.00 GFLOPS | Progress: (0/10) | 0.00 s
[Task 12/28] Current/Best: 0.00/ 0.00 GFLOPS | Progress: (10/10) | 18.84 sWARNING:root:Could not find any valid schedule for task Task(func_name=conv2d_nchw.cuda, args=(('TENSOR', (1, 128, 28, 28), 'float32'), ('TENSOR', (128, 128, 3, 3), 'float32'), (1, 1), (1, 1, 1, 1), (1, 1), 'float32'), kwargs={}, workload=('conv2d_nchw.cuda', ('TENSOR', (1, 128, 28, 28), 'float32'), ('TENSOR', (128, 128, 3, 3), 'float32'), (1, 1), (1, 1, 1, 1), (1, 1), 'float32')). A file containing the errors has been written to /tmp/tvm_tuning_errors_cd_pbydl.log.
[Task 13/28] Current/Best: 0.00/ 0.00 GFLOPS | Progress: (0/10) | 0.00 s
[Task 13/28] Current/Best: 0.00/ 0.00 GFLOPS | Progress: (10/10) | 9.46 sWARNING:root:Could not find any valid schedule for task Task(func_name=conv2d_nchw_winograd.cuda, args=(('TENSOR', (1, 128, 28, 28), 'float32'), ('TENSOR', (128, 128, 3, 3), 'float32'), (1, 1), (1, 1, 1, 1), (1, 1), 'float32'), kwargs={}, workload=('conv2d_nchw_winograd.cuda', ('TENSOR', (1, 128, 28, 28), 'float32'), ('TENSOR', (128, 128, 3, 3), 'float32'), (1, 1), (1, 1, 1, 1), (1, 1), 'float32')). A file containing the errors has been written to /tmp/tvm_tuning_errors_j8bamdxe.log.
Done.
[Task 14/28] Current/Best: 0.00/ 0.00 GFLOPS | Progress: (0/10) | 0.00 s
[Task 14/28] Current/Best: 0.00/ 0.00 GFLOPS | Progress: (10/10) | 13.52 sWARNING:root:Could not find any valid schedule for task Task(func_name=conv2d_nchw.cuda, args=(('TENSOR', (1, 512, 28, 28), 'float32'), ('TENSOR', (256, 512, 1, 1), 'float32'), (1, 1), (0, 0, 0, 0), (1, 1), 'float32'), kwargs={}, workload=('conv2d_nchw.cuda', ('TENSOR', (1, 512, 28, 28), 'float32'), ('TENSOR', (256, 512, 1, 1), 'float32'), (1, 1), (0, 0, 0, 0), (1, 1), 'float32')). A file containing the errors has been written to /tmp/tvm_tuning_errors_crae7902.log.
Done.
[Task 15/28] Current/Best: 0.00/ 0.00 GFLOPS | Progress: (0/10) | 0.00 s Done.
[Task 15/28] Current/Best: 0.00/ 0.00 GFLOPS | Progress: (10/10) | 13.58 sWARNING:root:Could not find any valid schedule for task Task(func_name=conv2d_nchw.cuda, args=(('TENSOR', (1, 256, 28, 28), 'float32'), ('TENSOR', (256, 256, 3, 3), 'float32'), (2, 2), (1, 1, 1, 1), (1, 1), 'float32'), kwargs={}, workload=('conv2d_nchw.cuda', ('TENSOR', (1, 256, 28, 28), 'float32'), ('TENSOR', (256, 256, 3, 3), 'float32'), (2, 2), (1, 1, 1, 1), (1, 1), 'float32')). A file containing the errors has been written to /tmp/tvm_tuning_errors_7_e_pdx1.log.
[Task 16/28] Current/Best: 0.00/ 0.00 GFLOPS | Progress: (0/10) | 0.00 s
[Task 16/28] Current/Best: 0.00/ 0.00 GFLOPS | Progress: (10/10) | 13.90 sWARNING:root:Could not find any valid schedule for task Task(func_name=conv2d_nchw.cuda, args=(('TENSOR', (1, 512, 28, 28), 'float32'), ('TENSOR', (1024, 512, 1, 1), 'float32'), (2, 2), (0, 0, 0, 0), (1, 1), 'float32'), kwargs={}, workload=('conv2d_nchw.cuda', ('TENSOR', (1, 512, 28, 28), 'float32'), ('TENSOR', (1024, 512, 1, 1), 'float32'), (2, 2), (0, 0, 0, 0), (1, 1), 'float32')). A file containing the errors has been written to /tmp/tvm_tuning_errors_i6kaclzm.log.
Done.
[Task 17/28] Current/Best: 0.00/ 0.00 GFLOPS | Progress: (0/10) | 0.00 s
[Task 17/28] Current/Best: 0.00/ 0.00 GFLOPS | Progress: (10/10) | 14.02 sWARNING:root:Could not find any valid schedule for task Task(func_name=conv2d_nchw.cuda, args=(('TENSOR', (1, 256, 14, 14), 'float32'), ('TENSOR', (1024, 256, 1, 1), 'float32'), (1, 1), (0, 0, 0, 0), (1, 1), 'float32'), kwargs={}, workload=('conv2d_nchw.cuda', ('TENSOR', (1, 256, 14, 14), 'float32'), ('TENSOR', (1024, 256, 1, 1), 'float32'), (1, 1), (0, 0, 0, 0), (1, 1), 'float32')). A file containing the errors has been written to /tmp/tvm_tuning_errors_w8f3rgi9.log.
Done.
[Task 18/28] Current/Best: 0.00/ 0.00 GFLOPS | Progress: (0/10) | 0.00 s
[Task 18/28] Current/Best: 0.00/ 0.00 GFLOPS | Progress: (10/10) | 8.49 sWARNING:root:Could not find any valid schedule for task Task(func_name=conv2d_nchw.cuda, args=(('TENSOR', (1, 1024, 14, 14), 'float32'), ('TENSOR', (256, 1024, 1, 1), 'float32'), (1, 1), (0, 0, 0, 0), (1, 1), 'float32'), kwargs={}, workload=('conv2d_nchw.cuda', ('TENSOR', (1, 1024, 14, 14), 'float32'), ('TENSOR', (256, 1024, 1, 1), 'float32'), (1, 1), (0, 0, 0, 0), (1, 1), 'float32')). A file containing the errors has been written to /tmp/tvm_tuning_errors_zbsfyik6.log.
Done.
[Task 19/28] Current/Best: 0.00/ 0.00 GFLOPS | Progress: (0/10) | 0.00 s
[Task 19/28] Current/Best: 0.00/ 0.00 GFLOPS | Progress: (10/10) | 11.89 sWARNING:root:Could not find any valid schedule for task Task(func_name=conv2d_nchw.cuda, args=(('TENSOR', (1, 256, 14, 14), 'float32'), ('TENSOR', (256, 256, 3, 3), 'float32'), (1, 1), (1, 1, 1, 1), (1, 1), 'float32'), kwargs={}, workload=('conv2d_nchw.cuda', ('TENSOR', (1, 256, 14, 14), 'float32'), ('TENSOR', (256, 256, 3, 3), 'float32'), (1, 1), (1, 1, 1, 1), (1, 1), 'float32')). A file containing the errors has been written to /tmp/tvm_tuning_errors_qu68tp6r.log.
Done.
[Task 20/28] Current/Best: 0.00/ 0.00 GFLOPS | Progress: (0/10) | 0.00 s
[Task 20/28] Current/Best: 0.00/ 0.00 GFLOPS | Progress: (10/10) | 11.14 sWARNING:root:Could not find any valid schedule for task Task(func_name=conv2d_nchw_winograd.cuda, args=(('TENSOR', (1, 256, 14, 14), 'float32'), ('TENSOR', (256, 256, 3, 3), 'float32'), (1, 1), (1, 1, 1, 1), (1, 1), 'float32'), kwargs={}, workload=('conv2d_nchw_winograd.cuda', ('TENSOR', (1, 256, 14, 14), 'float32'), ('TENSOR', (256, 256, 3, 3), 'float32'), (1, 1), (1, 1, 1, 1), (1, 1), 'float32')). A file containing the errors has been written to /tmp/tvm_tuning_errors_l15zjzzj.log.
Done.
[Task 21/28] Current/Best: 0.00/ 0.00 GFLOPS | Progress: (0/10) | 0.00 s
[Task 21/28] Current/Best: 0.00/ 0.00 GFLOPS | Progress: (10/10) | 4.07 sWARNING:root:Could not find any valid schedule for task Task(func_name=conv2d_nchw.cuda, args=(('TENSOR', (1, 1024, 14, 14), 'float32'), ('TENSOR', (512, 1024, 1, 1), 'float32'), (1, 1), (0, 0, 0, 0), (1, 1), 'float32'), kwargs={}, workload=('conv2d_nchw.cuda', ('TENSOR', (1, 1024, 14, 14), 'float32'), ('TENSOR', (512, 1024, 1, 1), 'float32'), (1, 1), (0, 0, 0, 0), (1, 1), 'float32')). A file containing the errors has been written to /tmp/tvm_tuning_errors_ycqg1qr0.log.
Done.
[Task 22/28] Current/Best: 0.00/ 0.00 GFLOPS | Progress: (0/10) | 0.00 s
[Task 22/28] Current/Best: 0.00/ 0.00 GFLOPS | Progress: (10/10) | 10.27 sWARNING:root:Could not find any valid schedule for task Task(func_name=conv2d_nchw.cuda, args=(('TENSOR', (1, 512, 14, 14), 'float32'), ('TENSOR', (512, 512, 3, 3), 'float32'), (2, 2), (1, 1, 1, 1), (1, 1), 'float32'), kwargs={}, workload=('conv2d_nchw.cuda', ('TENSOR', (1, 512, 14, 14), 'float32'), ('TENSOR', (512, 512, 3, 3), 'float32'), (2, 2), (1, 1, 1, 1), (1, 1), 'float32')). A file containing the errors has been written to /tmp/tvm_tuning_errors_ybjo14d5.log.
Done.
[Task 23/28] Current/Best: 0.00/ 0.00 GFLOPS | Progress: (0/10) | 0.00 s
[Task 23/28] Current/Best: 0.00/ 0.00 GFLOPS | Progress: (10/10) | 15.11 sWARNING:root:Could not find any valid schedule for task Task(func_name=conv2d_nchw.cuda, args=(('TENSOR', (1, 1024, 14, 14), 'float32'), ('TENSOR', (2048, 1024, 1, 1), 'float32'), (2, 2), (0, 0, 0, 0), (1, 1), 'float32'), kwargs={}, workload=('conv2d_nchw.cuda', ('TENSOR', (1, 1024, 14, 14), 'float32'), ('TENSOR', (2048, 1024, 1, 1), 'float32'), (2, 2), (0, 0, 0, 0), (1, 1), 'float32')). A file containing the errors has been written to /tmp/tvm_tuning_errors_vh0nsr41.log.
[Task 24/28] Current/Best: 0.00/ 0.00 GFLOPS | Progress: (0/10) | 0.00 s
[Task 24/28] Current/Best: 0.00/ 0.00 GFLOPS | Progress: (10/10) | 3.57 sWARNING:root:Could not find any valid schedule for task Task(func_name=conv2d_nchw.cuda, args=(('TENSOR', (1, 512, 7, 7), 'float32'), ('TENSOR', (2048, 512, 1, 1), 'float32'), (1, 1), (0, 0, 0, 0), (1, 1), 'float32'), kwargs={}, workload=('conv2d_nchw.cuda', ('TENSOR', (1, 512, 7, 7), 'float32'), ('TENSOR', (2048, 512, 1, 1), 'float32'), (1, 1), (0, 0, 0, 0), (1, 1), 'float32')). A file containing the errors has been written to /tmp/tvm_tuning_errors_cjrlba0_.log.
Done.
[Task 25/28] Current/Best: 0.00/ 0.00 GFLOPS | Progress: (0/10) | 0.00 s
[Task 25/28] Current/Best: 0.00/ 0.00 GFLOPS | Progress: (10/10) | 10.34 sWARNING:root:Could not find any valid schedule for task Task(func_name=conv2d_nchw.cuda, args=(('TENSOR', (1, 2048, 7, 7), 'float32'), ('TENSOR', (512, 2048, 1, 1), 'float32'), (1, 1), (0, 0, 0, 0), (1, 1), 'float32'), kwargs={}, workload=('conv2d_nchw.cuda', ('TENSOR', (1, 2048, 7, 7), 'float32'), ('TENSOR', (512, 2048, 1, 1), 'float32'), (1, 1), (0, 0, 0, 0), (1, 1), 'float32')). A file containing the errors has been written to /tmp/tvm_tuning_errors_fqcr_b68.log.
Done.
[Task 26/28] Current/Best: 0.00/ 0.00 GFLOPS | Progress: (0/10) | 0.00 s Done.
[Task 26/28] Current/Best: 0.00/ 0.00 GFLOPS | Progress: (10/10) | 13.92 sWARNING:root:Could not find any valid schedule for task Task(func_name=conv2d_nchw.cuda, args=(('TENSOR', (1, 512, 7, 7), 'float32'), ('TENSOR', (512, 512, 3, 3), 'float32'), (1, 1), (1, 1, 1, 1), (1, 1), 'float32'), kwargs={}, workload=('conv2d_nchw.cuda', ('TENSOR', (1, 512, 7, 7), 'float32'), ('TENSOR', (512, 512, 3, 3), 'float32'), (1, 1), (1, 1, 1, 1), (1, 1), 'float32')). A file containing the errors has been written to /tmp/tvm_tuning_errors_9ksvboer.log.
[Task 27/28] Current/Best: 0.00/ 0.00 GFLOPS | Progress: (0/10) | 0.00 s
[Task 27/28] Current/Best: 0.00/ 0.00 GFLOPS | Progress: (10/10) | 12.91 sWARNING:root:Could not find any valid schedule for task Task(func_name=conv2d_nchw_winograd.cuda, args=(('TENSOR', (1, 512, 7, 7), 'float32'), ('TENSOR', (512, 512, 3, 3), 'float32'), (1, 1), (1, 1, 1, 1), (1, 1), 'float32'), kwargs={}, workload=('conv2d_nchw_winograd.cuda', ('TENSOR', (1, 512, 7, 7), 'float32'), ('TENSOR', (512, 512, 3, 3), 'float32'), (1, 1), (1, 1, 1, 1), (1, 1), 'float32')). A file containing the errors has been written to /tmp/tvm_tuning_errors_dear1xqt.log.
Done.
[Task 28/28] Current/Best: 0.00/ 0.00 GFLOPS | Progress: (0/10) | 0.00 s
[Task 28/28] Current/Best: 0.00/ 0.00 GFLOPS | Progress: (10/10) | 8.51 sWARNING:root:Could not find any valid schedule for task Task(func_name=dense_small_batch.gpu, args=(('TENSOR', (1, 2048), 'float32'), ('TENSOR', (1000, 2048), 'float32'), None, 'float32'), kwargs={}, workload=('dense_small_batch.gpu', ('TENSOR', (1, 2048), 'float32'), ('TENSOR', (1000, 2048), 'float32'), None, 'float32')). A file containing the errors has been written to /tmp/tvm_tuning_errors_rmg3_378.log.
Done.
Done.
Done.
class='n02123045 tabby, tabby cat' with probability=0.610552
class='n02123159 tiger cat' with probability=0.367179
class='n02124075 Egyptian cat' with probability=0.019365
class='n02129604 tiger, Panthera tigris' with probability=0.001273
class='n04040759 radiator' with probability=0.000261
optimized: {'mean': 18.921251980018496, 'median': 21.12027845000739, 'std': 6.681149567023143}
unoptimized: {'mean': 6.638219979995483, 'median': 7.205596800031344, 'std': 2.198990796372162}
While with AVX-512 the optimized model runs slightly slower than the unoptimized one, the autotuning seems to work.
With CUDA though, the output does not show any measured speed in GFLOPS and outputs these warnings:
Is there something obviously wrong with my setup? Is the GPU kernel invalid because the setup in the tutorial is inherently incompatible with GPUs, or is this an TVM-internal issue?