New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

[colossal-llama2] fix tensor data update for gemini loss calculation #5442

Merged

Camille7777 merged 1 commit into hpcaitech:main from Camille7777:fix/colossal_llama2

Mar 11, 2024

Merged

[colossal-llama2] fix tensor data update for gemini loss calculation #5442

applications/Colossal-LLaMA-2/train.py

-Original file line number
+Diff line change
@@ Expand Up / @@ -56,6 +56,7 @@ def format_numel_str(numel: int) -> str: @@
     def all_reduce_mean(tensor: torch.Tensor) -> torch.Tensor:
         dist.all_reduce(tensor=tensor, op=dist.ReduceOp.SUM)
+        tensor = tensor.data
         tensor.div_(dist.get_world_size())
         return tensor
@@ Expand Down @@