Why mali gpu is slower than arm cpu?

Good performance with mobile GPUs requires autotuning as in this example.