The network I am working on can be considered as a Resnet-101 and FPN network of Tensorflow, and I believe there are no control flow operations in this network. All the operations are listed below:
Add
MaxPool
Placeholder
FusedBatchNorm
Conv2D
Const
Pad
BiasAdd
Relu
Identity
Transpose
All the workloads extracted from the network in Linux:
('conv2d_nchw.cuda', ('TENSOR', (1, 256, 232, 232), 'float32'), ('TENSOR', (256, 256, 1, 1), 'float32'), (1, 1), (0, 0, 0, 0), (1, 1), 'float32')
('conv2d_nchw.cuda', ('TENSOR', (1, 1024, 58, 58), 'float32'), ('TENSOR', (2048, 1024, 1, 1), 'float32'), (2, 2), (0, 0, 0, 0), (1, 1), 'float32')
('conv2d_nchw.cuda', ('TENSOR', (1, 512, 116, 116), 'float32'), ('TENSOR', (1024, 512, 1, 1), 'float32'), (2, 2), (0, 0, 0, 0), (1, 1), 'float32')
('conv2d_nchw.cuda', ('TENSOR', (1, 256, 232, 232), 'float32'), ('TENSOR', (512, 256, 1, 1), 'float32'), (2, 2), (0, 0, 0, 0), (1, 1), 'float32')
('conv2d_nchw.cuda', ('TENSOR', (1, 3, 933, 933), 'float32'), ('TENSOR', (64, 3, 7, 7), 'float32'), (2, 2), (0, 0, 0, 0), (1, 1), 'float32')
('conv2d_nchw.cuda', ('TENSOR', (1, 64, 232, 232), 'float32'), ('TENSOR', (64, 64, 1, 1), 'float32'), (1, 1), (0, 0, 0, 0), (1, 1), 'float32')
('conv2d_nchw.cuda', ('TENSOR', (1, 256, 232, 232), 'float32'), ('TENSOR', (64, 256, 1, 1), 'float32'), (1, 1), (0, 0, 0, 0), (1, 1), 'float32')
('conv2d_nchw_winograd.cuda', ('TENSOR', (1, 64, 232, 232), 'float32'), ('TENSOR', (64, 64, 3, 3), 'float32'), (1, 1), (1, 1, 1, 1), (1, 1), 'float32')
('conv2d_nchw.cuda', ('TENSOR', (1, 64, 232, 232), 'float32'), ('TENSOR', (64, 64, 3, 3), 'float32'), (1, 1), (1, 1, 1, 1), (1, 1), 'float32')
('conv2d_nchw.cuda', ('TENSOR', (1, 64, 232, 232), 'float32'), ('TENSOR', (256, 64, 1, 1), 'float32'), (1, 1), (0, 0, 0, 0), (1, 1), 'float32')
('conv2d_nchw.cuda', ('TENSOR', (1, 256, 232, 232), 'float32'), ('TENSOR', (128, 256, 1, 1), 'float32'), (1, 1), (0, 0, 0, 0), (1, 1), 'float32')
('conv2d_nchw.cuda', ('TENSOR', (1, 128, 233, 233), 'float32'), ('TENSOR', (128, 128, 3, 3), 'float32'), (2, 2), (0, 0, 0, 0), (1, 1), 'float32')
('conv2d_nchw.cuda', ('TENSOR', (1, 512, 116, 116), 'float32'), ('TENSOR', (128, 512, 1, 1), 'float32'), (1, 1), (0, 0, 0, 0), (1, 1), 'float32')
('conv2d_nchw_winograd.cuda', ('TENSOR', (1, 128, 116, 116), 'float32'), ('TENSOR', (128, 128, 3, 3), 'float32'), (1, 1), (1, 1, 1, 1), (1, 1), 'float32')
('conv2d_nchw.cuda', ('TENSOR', (1, 128, 116, 116), 'float32'), ('TENSOR', (128, 128, 3, 3), 'float32'), (1, 1), (1, 1, 1, 1), (1, 1), 'float32')
('conv2d_nchw.cuda', ('TENSOR', (1, 128, 116, 116), 'float32'), ('TENSOR', (512, 128, 1, 1), 'float32'), (1, 1), (0, 0, 0, 0), (1, 1), 'float32')
('conv2d_nchw.cuda', ('TENSOR', (1, 512, 116, 116), 'float32'), ('TENSOR', (256, 512, 1, 1), 'float32'), (1, 1), (0, 0, 0, 0), (1, 1), 'float32')
('conv2d_nchw.cuda', ('TENSOR', (1, 256, 117, 117), 'float32'), ('TENSOR', (256, 256, 3, 3), 'float32'), (2, 2), (0, 0, 0, 0), (1, 1), 'float32')
('conv2d_nchw.cuda', ('TENSOR', (1, 1024, 58, 58), 'float32'), ('TENSOR', (256, 1024, 1, 1), 'float32'), (1, 1), (0, 0, 0, 0), (1, 1), 'float32')
('conv2d_nchw_winograd.cuda', ('TENSOR', (1, 256, 58, 58), 'float32'), ('TENSOR', (256, 256, 3, 3), 'float32'), (1, 1), (1, 1, 1, 1), (1, 1), 'float32')
('conv2d_nchw.cuda', ('TENSOR', (1, 256, 58, 58), 'float32'), ('TENSOR', (256, 256, 3, 3), 'float32'), (1, 1), (1, 1, 1, 1), (1, 1), 'float32')
('conv2d_nchw.cuda', ('TENSOR', (1, 256, 58, 58), 'float32'), ('TENSOR', (1024, 256, 1, 1), 'float32'), (1, 1), (0, 0, 0, 0), (1, 1), 'float32')
('conv2d_nchw.cuda', ('TENSOR', (1, 1024, 58, 58), 'float32'), ('TENSOR', (512, 1024, 1, 1), 'float32'), (1, 1), (0, 0, 0, 0), (1, 1), 'float32')
('conv2d_nchw.cuda', ('TENSOR', (1, 512, 59, 59), 'float32'), ('TENSOR', (512, 512, 3, 3), 'float32'), (2, 2), (0, 0, 0, 0), (1, 1), 'float32')
('conv2d_nchw.cuda', ('TENSOR', (1, 2048, 29, 29), 'float32'), ('TENSOR', (512, 2048, 1, 1), 'float32'), (1, 1), (0, 0, 0, 0), (1, 1), 'float32')
('conv2d_nchw_winograd.cuda', ('TENSOR', (1, 512, 29, 29), 'float32'), ('TENSOR', (512, 512, 3, 3), 'float32'), (1, 1), (1, 1, 1, 1), (1, 1), 'float32')
('conv2d_nchw.cuda', ('TENSOR', (1, 512, 29, 29), 'float32'), ('TENSOR', (512, 512, 3, 3), 'float32'), (1, 1), (1, 1, 1, 1), (1, 1), 'float32')
('conv2d_nchw.cuda', ('TENSOR', (1, 512, 29, 29), 'float32'), ('TENSOR', (2048, 512, 1, 1), 'float32'), (1, 1), (0, 0, 0, 0), (1, 1), 'float32')
('conv2d_nchw.cuda', ('TENSOR', (1, 2048, 29, 29), 'float32'), ('TENSOR', (256, 2048, 1, 1), 'float32'), (1, 1), (0, 0, 0, 0), (1, 1), 'float32')
If you are interested, here is the Tensorflow graph_def I am working on.