Ok , so I did quite bit of digging and I think I found the root cause of the problem
the motor optimization was just a distraction, since the decoder was already loaded with shay files the settings were readily optimal and had nothing to do with CS3 either.
I ran the locomotive indoor on a simple layout with no issues. My outdoor layout has a large number of isolated contact tracks sections attached to the Marklin S88 DC. the S88 DC senses motor return current that is connected to ground of the system internally via a diode.
The ESU decoders reliers on motor's BEMF sensing to compensate the motor torque for the load, the BEMF is sampled at very high frequency like 40KHz, this creates a very sensitive feedback loop by its nature.... now when the locomotive crosses from one contact section to another the return path has a delay to establish feedback in the circuit via the S88. the delay is in the order of msec... until the feedback is established there is some ground bounce from the motor return current (also evident in flickering of layout lights that feed directly from that section of track)..... the ground bounce screws up the BEMF measurement (it can even reverse polarity) so the feedback starts to oscillate causing stutter in the motor.... I had actually seen the same thing with Piko with Soundtrax DCC decoders in the Piko moguls but never quite nailed down the root cause
so why LGB locomotives don't have this problem the answer is that LGB decoders use a carefully designed low pass filter in the feedback loop to reject this momentary noise, and when feedback is not reliable they fall back on the factory calibrated load curve that is provided by the MFX controller, so they are much less sensitive to "noise on the track" especially from this ground bounce when passing between the isolated sections....
I measured the track waveform by a scope, you can see the difference when LGB 28442 is running vs. when shay is running , the current oscillation from the motor PWM modulation is distorting the track voltage in case of Shay with ESU Loksound whereas in case of LGB 28442 the waveform is clean. so safe to say Marklin/LGB designed the decoder with this detail in mind as they also make the S88
Next to be 100% sure I will install an LGB MFX decoder for the shay and confirm smooth operation, unfortunately this decoder does not have Shay sound and some other features found on the LokSound so it is not a solution
I have a few ideas how to solve this problem, I will report back if I am succesful.
I thought this information might be useful for others. it applies to folks who have a layout with many isolated contact track sections (for automation) fed back to an S88DC.... otherwise ESU should work find with marklin cs3 , in fact I also confirmed it works fine with MS2.....
LGB 28442.
ESU Loksound 5XL
