> It has to do with syncing the video/sound and the refresh rate of your monitor > or something
On real hardware the cpu can update video ram the instant before the pixels appear on screen & it can read inputs immediately.
Emulators generally have the inputs from the start of the frame, for the next 1/60th of a second you'll get the same inputs while on arcade hardware they could change during that time. Then at the end of the frame you pass the image to the OS to display, which is 1/60th of a second later than real hardware.
There isn't really anything you can do about it, the FPGA emulators that plug into real controls and monitors are likely to be much better but we're a long way off that being something that everyone has.
But even real monitors aren't great, LCD displays for example often add another frame or more of latency.
|