Autoscheduler and VM

ok, it seems there are four tasks with UnsupportedReduce. For now I hacked the task extraction code in compile_engine.cc to only send conv2d op to the auto scheduler. Still there are 46 tuning tasks to keep my gpu busy for a while.