When using Trainer for training in DDP setting, does the logging_steps argument perform all-reduce for that step just to get the single loss value?