Part of the issue is that the client was designed during the Win9x-era, where the GPUs weren't all that great with 3D rendering and CPUs were single-core (I still use this kind of system as main, so I can compare). Graphics are rendered by DirectX and all graphics card power makes difference in the game fluency (even an upgrade from a 8MB GPU to 32MB GPU is like two different worlds).
But there are few factors in the modernization, that may be partly at fault, why the client chokes on newer systems:
Back then your typical resolution was 640x480 or 800x600, or if you were very high-end you could play on a 1024x768(!) resolution. The code in use is increasingly inefficient with today's resolutions like FullHD or 4K (that also are no longer 4:3 / 3:2 ratio as was common back then, which causes screw ups on certain ultra wide resolutions).
There is something, that is not rendered by DirectX and that is all text. Text is rendered by the GDI, the Windows raster graphics engine. While this engine was hardware-accelerated (just like DirectX) in Windows XP and older, since Vista the GDI is (mostly) software-based to allow hardware-accelerated DWM for 3D-effects in Windows UI (e.g. Aero). This is also why plain GDI applications run slower on Vista and newer than they did before on the same hardware. Under certain conditions text rendering by GDI can cause delays on Windows XP and newer due to some Unicode handling, that is absent on the Windows 9x platform.
The client is a multi-threaded application, designed for single-core systems with the assumption that two threads will never run at the same time, so some synchronization between the threads can take out any advantages from multi-core systems, or dead-lock the client in the worst case. Also some update in the last few years introduces an unidentified thread-leak into the client, so there may be delays from contention between synchronized resources.
Another issue introduced in the last years, mainly with the introduction of Lua and new features based on it, is increased RAM usage (not just the amount of memory consumed, but also the amount of memory blocks), mostly during startup. This incurs delays during allocation/freeing, not all that visible on powerful systems today, but on the older ones you notice a sudden increase from a 5 second start up time, to 2 minutes.
Hope this helps your investigation somewhat.