If my analysis is halfway correct, the mismatch you get is proportional to
time_spent_waiting * windspeed * boundary_slowdown_factor
So for weak winds you may have a lot of time before anything goes wrong, for strong winds, rugged terrain that time may be much shorter.
In general, you or the weather tile need to move one weather tile size (=40 km) to trigger new weather generation.