I totally dropped the ball by using a wrong perspective on the 'better than' comment. I work a lot with embedded hardware and from a hardware perspective it runs at higher speeds and does not suffer from (albeit) intended slowdown.
I didn't mean to claim the emulation was now running cycle-accurate as on a real board