You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We are seeing about 10 - 15% reduction in performance for llama 8B and BERT training moving from torch-xla v2.8.0-rc3 to torch-xla v2.8.0. The problem was narrowed down to v2.8.0-rc3...v2.8.0, especially #9547. Building the wheels after reverting change ad76b20 has restored the performance back.
Note: This is an additional issue that we have observed after resolving the logging issue #9569 .
To Reproduce
Steps to reproduce the behavior:
Install latest Neuron torch-neuronx + torch-xla + torch + torchvision, replace torch-xla and torch with 2.8.0