Graph partitioning and Heterogeneous Execution

Or can we somehow skip them during lowering but just create another type of op, say tvm_copy_op?